AI Citation Patterns by Platform
Wikipedia's dominance, Reddit's outsized role, YouTube on Perplexity, G2 for SaaS, LinkedIn for professional queries, Forbes/Investopedia for finance — the citation atlas.
If you ask ChatGPT, Perplexity, Google AIO, and Claude the same question, you’ll see the same set of source domains over and over: Wikipedia, Reddit, YouTube, G2, LinkedIn, Forbes, Investopedia, NerdWallet. These are not coincidences. They are the citation watering holes — domains AI engines disproportionately trust — and earning a presence on them is a high-leverage GEO strategy regardless of how strong your own domain is.
TL;DR
- Wikipedia is the most-cited source on every English LLM, often 3–4x the next runner-up. A single Wikipedia mention with a citation back to your domain can outweigh ten conventional backlinks for AI visibility.
- Reddit is in the top 5 across every major surface — a complete reversal of its pre-2023 role in SEO. Authentic, upvoted, expert-voiced threads on relevant subreddits feed Google AIO and ChatGPT Search directly.
- Vertical watering holes dominate their niches. G2 and Capterra for SaaS, LinkedIn for B2B and “what does role X do,” Forbes/Investopedia/NerdWallet for finance, GitHub/Stack Overflow for developer queries.
The mental model
Imagine the AI engines as journalists who all read the same five newspapers and three trade journals. Even if your blog is excellent, if you’ve never been quoted in those publications, you’re invisible to the journalists. The citation watering holes are those publications. Earning a presence in them is the equivalent of being on the journalists’ regular reading list.
This reframes off-page SEO entirely. The traditional question — “how do I get backlinks?” — becomes “how do I show up in the publications the engines already read?” A Reddit thread upvoted to the top of r/webdev is, for ChatGPT Search citation purposes, often more valuable than a regular DR-50 backlink. A G2 listing with strong reviews can swing every “best SaaS for X” answer.
The watering holes are not gameable in the link-farm sense. They have editorial standards (Wikipedia), community moderation (Reddit), or transactional data (G2). But they are addressable through strategy: contribution, authentic participation, structured data submission.
Deep dive: the 2026 reality
Citation share data for May 2026, aggregated from Profound, Otterly, BrightEdge, and Ahrefs Brand Radar across ~1M sample queries:
| Domain | ChatGPT | Perplexity | Google AIO | Claude | Avg rank |
|---|---|---|---|---|---|
| wikipedia.org | 18% | 24% | 22% | 26% | 1 |
| reddit.com | 11% | 9% | 14% | 7% | 2 |
| youtube.com | 4% | 14% | 6% | 3% | 3 |
| github.com | 5% | 4% | 4% | 6% | 4 |
| linkedin.com | 3% | 3% | 4% | 3% | 5 |
| forbes.com | 3% | 2% | 3% | 2% | 6 |
| investopedia.com | 2% | 2% | 3% | 2% | 7 |
| nytimes.com | 2% | 2% | 2% | 3% | 8 |
| stackoverflow.com | 2% | 1% | 2% | 3% | 9 |
| medium.com | 2% | 1% | 1% | 1% | 10 |
(Numbers are approximate and shift quarter-to-quarter; the ordinal pattern is stable.)
Vertical-specific patterns (citation share within domain-specific queries):
| Vertical | Top citation watering holes |
|---|---|
| B2B SaaS comparison | G2, Capterra, TrustRadius, Gartner Peer Insights, Reddit (r/SaaS) |
| Finance / investing | Investopedia, NerdWallet, Bankrate, Forbes Advisor, Reddit (r/personalfinance), SEC filings |
| Healthcare / medical | Mayo Clinic, NIH/PubMed, WebMD, Healthline, CDC.gov |
| Developer / technical | GitHub, Stack Overflow, MDN, official docs, Hacker News |
| Local / travel | TripAdvisor, official tourism .gov sites, Google Maps reviews |
| E-commerce product | Wirecutter, Consumer Reports, RTINGS, Reddit (r/BuyItForLife) |
| Professional / careers | LinkedIn, Glassdoor, Indeed (for role definitions), Wikipedia |
| News / current events | Reuters, AP, BBC, NYT, WaPo |
Why each watering hole matters and how to earn presence:
Wikipedia. Editorial entries reference primary sources. A research paper, dataset, or original journalism that meets Wikipedia’s notability and verifiability standards gets cited as a source on the relevant article. Once Wikipedia cites you, every English LLM training corpus and live retrieval favors you for that topic.
Reddit. Genuine subject-matter expertise in the relevant subreddit. Not link-drop spam — sustained, helpful participation that earns upvotes. ChatGPT Search and Google AIO both pull from highly-upvoted threads. The Reddit-Google content licensing deal (Feb 2024) gave Reddit content priority in AIO retrieval.
YouTube. Transcript-driven retrieval. Perplexity in particular cites YouTube heavily because they ingest transcripts as first-class units. A 12-minute tutorial with clean captions and timestamps can outrank a 4,000-word article.
G2 / Capterra / TrustRadius. Structured product comparison data. AI engines lift G2’s “best of” lists into “best X for Y” queries. Active reviews, accurate categorization, claimed profile, and “Leader” badge in the relevant grid are the levers.
LinkedIn. Profile-driven citations for “what does a [role] do” queries, and article-driven citations for thought-leadership queries. LinkedIn’s content licensing deals with multiple LLM providers give native LinkedIn articles preference.
Forbes / Investopedia / NerdWallet. Editorial-trust gate for finance queries. Expert columns, contributor relationships, or being cited by their staff writers all earn presence. Forbes Advisor’s “best of” categories drive substantial citation share for personal-finance queries.
Visualizing it
flowchart TD
Query[User AI query] --> Engine[ChatGPT / Perplexity / AIO / Claude]
Engine --> Pull{Citation source pull}
Pull --> WP[Wikipedia 18-26%]
Pull --> RD[Reddit 7-14%]
Pull --> YT[YouTube 3-14%]
Pull --> Vert[Vertical sites]
Vert --> G2[G2/Capterra SaaS]
Vert --> LI[LinkedIn B2B]
Vert --> Fin[Forbes/Investopedia/NerdWallet finance]
Vert --> Dev[GitHub/SO developer]
Vert --> Med[Mayo/NIH medical]
WP --> Cite[Cited in synthesis]
RD --> Cite
YT --> Cite
G2 --> Cite
LI --> Cite
Fin --> Cite
Dev --> Cite
Med --> Cite
Bad vs. expert
The bad approach
Treating off-page SEO purely as backlink-building from anywhere with DR > 30. Buying guest posts on irrelevant blogs. Ignoring the watering holes because they’re “platforms, not sites I control.”
Off-page priorities (typical):
- Acquire 10 DR40+ backlinks via guest posts
- Submit to 50 directories
- Build PBN tier-2 links
- Buy a few sponsored placements on niche blogs
This fails because the AI engines don’t read those guest-post sites. Citation share comes from the watering holes, not from a long tail of mid-DR domains. A guest post on seoguide-2026.example does almost nothing for AI visibility.
The expert approach
Targeted presence-building on the actual citation watering holes for your vertical, treated as off-page strategy.
# Off-page strategy: citation watering hole presence
horizontal:
wikipedia:
target: "Get original research/data referenced as a source on 3 relevant articles"
method: "Publish citable primary research; submit edits via talk pages with sourcing"
reddit:
target: "Build author expertise + 2 highly-upvoted contributions on relevant subreddits"
method: "Sustained participation in r/SaaS, r/Entrepreneur, etc. as named SME"
youtube:
target: "Transcribed video for top 5 priority topics"
method: "Studio-produced, captioned, timestamps, source links in description"
vertical_b2b_saas:
g2:
target: "Claimed profile, 25+ reviews, Leader badge in primary category"
method: "Customer review campaign; G2 Marketing Solutions if budget allows"
capterra:
target: "Verified listing in 3 relevant categories"
trustradius:
target: "Verified listing + 10+ reviews"
linkedin:
target: "Founder-authored article every 2 weeks; SME-tagged"
<!-- Example: structured data on your own site that helps watering holes link to you -->
<script type="application/ld+json">
{
"@context": "https://schema.org",
"@type": "Dataset",
"name": "2026 SaaS Pricing Benchmark",
"description": "Pricing data for 200 SaaS tools, benchmarked Q1 2026",
"creator": {"@type": "Organization", "name": "Acme Research"},
"license": "https://creativecommons.org/licenses/by/4.0/",
"datePublished": "2026-03-15"
}
</script>
This wins because it builds presence in the exact places AI engines read. A Wikipedia source citation, an upvoted Reddit AMA, a captioned YouTube explainer, and a G2 Leader badge collectively move citation share faster than 50 conventional backlinks.
Do this today
- Run Profound or Otterly.ai on your top 25 priority queries across ChatGPT, Perplexity, Google AIO, and Claude. Log the cited domains. Find your watering-hole gap — domains cited in your space where you’re absent.
- Audit your Wikipedia presence. Search for your category, your key topics, your founder. Identify articles where you could legitimately be cited as a source. Read Wikipedia’s Reliable Sources policy before contributing.
- Pick two relevant subreddits for your category. Spend a month participating authentically before posting anything that mentions your product. Earn mod-friendly status.
- For B2B SaaS, claim profiles on G2, Capterra, and TrustRadius. Run a customer review campaign — even 15 fresh reviews moves the AI citation needle. Don’t bribe; G2 enforces this hard.
- Identify the top 5 YouTube channels in your space (use TubeBuddy or VidIQ for niche analysis). Pitch contributed appearances or sponsor segments. A 30-second segment in a popular tutorial can drive substantial Perplexity citation.
- For finance/B2B/professional categories, pitch contributed columns to Forbes Councils, Entrepreneur, or LinkedIn long-form. Authorship matters — bylined articles get cited; ghosted attributions don’t.
- Publish at least one piece of citable original research per quarter — survey, dataset, benchmark — and pitch it directly to journalists at the watering-hole domains in your space.
- Use HARO alternatives (Qwoted, Featured.com, SourceBottle) to get expert quotes published on Forbes, NerdWallet, Investopedia. One quote per outlet per quarter compounds quickly.
- Confirm your Wikidata entry is correct and linked to your Wikipedia article (if any). Wikidata’s structured data feeds knowledge graphs that LLMs use for entity disambiguation.
- Set up a monthly watering-hole audit in your reporting: count your appearances on the top 5 citation domains for your category. Trend monthly. This is your real off-page scorecard for AI visibility.
Mark complete
Toggle to remember this module as mastered. Saved to your browser only.
More in this part
Part 9: AI Search Optimization (GEO/AEO)
- 065 The AI Search Landscape: Where Discovery Goes Next 24m
- 066 Google AI Overviews 21m
- 067 Google AI Mode 26m
- 068 ChatGPT Search Optimization 22m
- 069 Perplexity Optimization 24m
- 070 Generative Engine Optimization (GEO) Principles 21m
- 071 Answer Engine Optimization (AEO) 20m
- 072 AI Citation Patterns by Platform You're here 17m
- 073 AI Crawler Management 19m
- 074 Earned Media for AI Visibility 16m
- 075 Measuring AI Visibility 20m
- 076 The Future: Agentic Search & AI Browsers 22m