News SEO & Google News
Google News inclusion, news sitemaps, NewsArticle schema, Top Stories optimization, real-time SEO for breaking news, and E-E-A-T as it applies to journalism.
News SEO is the highest-velocity discipline in search. The window between publish and peak traffic on a breaking story is often 15–60 minutes, and the rules are different: you are not optimizing for a stable index, you are racing for Top Stories on a query that did not exist an hour ago. Get the markup wrong, the deduplication signals weak, or the byline missing, and you watch a competitor with worse reporting outrank you for the rest of the news cycle.
TL;DR
- Top Stories is the prize, not Google News. The dedicated Google News surface drives ~10% of news traffic; Top Stories in regular SERP and News tab drive the rest. Eligibility is the same.
- Speed of indexing wins the cycle. Google promotes fresh, well-marked-up coverage within 1–5 minutes via real-time crawl. Sitemap pings, IndexNow, and clean
NewsArticleschema are the difference between rank 1 and rank 9. - E-E-A-T is policed harder for news. Author bios, masthead transparency, named editors, corrections policy, and named sources are now ranking inputs, not nice-to-haves, especially after the September 2024 Site Reputation Abuse enforcement.
The mental model
News SEO is like a wire service circa 1960. The first reporter to get a clean dispatch on the editor’s desk gets the front page. Late filings — even better-written ones — get spiked or buried inside. Google’s freshness signal works the same way: the first credible source published on a breaking entity earns the canonical position, and every subsequent publisher is graded against it.
The implication is that you optimize for time-to-rank, not just rank. Pre-built page templates, pre-validated NewsArticle schema, pre-warmed CDN edges, and a publish workflow that hits a news sitemap within 60 seconds of go-live are what separates Reuters from a regional paper. The reporting can be equal; the technical chassis is not.
The second mental model: clusters, not pages. Google detects a “story cluster” — a real-world event with multiple covering articles — and assembles a Top Stories carousel for it. You are competing inside that cluster. The signals that win there are byline expertise, original reporting (named sources, exclusive footage, on-the-ground quotes), and consistent updates as the story develops.
Deep dive: the 2026 reality
Three platform realities shape news SEO in 2026.
Google News inclusion is automatic. Since December 2019, publishers no longer apply. Any site that meets Google News content policies (originality, transparency, accountability) is eligible if Google’s crawler discovers and classifies it as news. The work shifted from “applying” to “demonstrating you are a news publisher” via clean Publisher Center profile, masthead, author pages, and consistent NewsArticle schema.
Top Stories now blends with AI Overviews. As of mid-2025, AI Overviews appear above Top Stories on roughly 18% of news-intent queries. AIO cites 3–5 sources; the cited sources also tend to appear in Top Stories. To be cited you need: explicit NewsArticle schema, dateline, named author with sameAs, and a lede that states the news in the first 50 words.
Site Reputation Abuse policy (effective May 2024, enforced from September 2024) targets “parasite SEO” where a high-authority news domain hosts third-party content (coupons, “best of” lists from external partners) under its subdomain or subdirectory. Forbes, USA Today, Time, and CNN took major hits. If you operate a news site, audit any /coupons/ or /reviews/ directory built by external partners — these are the highest-risk surfaces.
The crawler picture:
| Crawler | Role | Behavior on news |
|---|---|---|
| Googlebot-News | Discovers via sitemap and links | Crawls news sitemaps every 1–5 min |
| Googlebot | Standard web crawl | Indexes for regular SERP |
| GPTBot / OAI-SearchBot | OpenAI training and live retrieval | OAI-SearchBot reads breaking news for ChatGPT Search |
| PerplexityBot | Live retrieval for Perplexity | Heavy on breaking-news verticals |
| ClaudeBot | Anthropic training | Most news sites block via robots.txt |
A 2025 Reuters Institute study found AI assistants get news facts wrong 51% of the time; the publishers cited as sources also rank higher in conventional SERP. The incentive to be cited is real even if the click-through is lower than from organic.
News sitemaps must include articles published in the last 48 hours, capped at 1,000 URLs per sitemap. After 48 hours, articles drop out of the news sitemap but remain in the standard sitemap. The news:publication_date must be in W3C datetime format with timezone.
Visualizing it
flowchart TD
P["Article published<br/>(CMS commit)"] --> SM["News sitemap updated<br/>within 60s"]
P --> IN["IndexNow ping<br/>to Bing + Yandex"]
SM --> GBN["Googlebot-News<br/>fetch (1-5 min)"]
GBN --> CL["Story cluster<br/>classification"]
CL --> TS["Top Stories carousel"]
CL --> AIO["AI Overviews<br/>citation candidate"]
CL --> GN["Google News surface"]
TS --> CTR["Peak traffic window<br/>15-60 min"]
Bad vs. expert
The bad approach
<article>
<h1>Big news today!</h1>
<p>By Staff Writer | 2 hours ago</p>
<p>Something happened. Here is what we know so far. More to come.</p>
</article>
<!-- sitemap.xml: standard, no news namespace, last updated yesterday -->
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
<url><loc>https://example.com/post/big-news</loc></url>
</urlset>
No structured data. “Staff Writer” is a transparency violation per Google News policies. The lede says nothing — neither AIO nor Top Stories will cite it. The sitemap is not a news sitemap; even if Googlebot discovers it, the article is not eligible for the Top Stories real-time crawl. Publish at 9:00, the article hits Top Stories at 11:30, by which point a competitor has owned the cycle for two hours.
The expert approach
<article itemscope itemtype="https://schema.org/NewsArticle">
<header>
<h1 itemprop="headline">Federal Reserve Cuts Rates 25 Basis Points,
Citing Cooling Labor Market</h1>
<p class="dateline">WASHINGTON, May 7, 2026 —</p>
<p class="byline">
By <a rel="author" href="/staff/anya-patel" itemprop="author"
itemscope itemtype="https://schema.org/Person">
<span itemprop="name">Anya Patel</span>
</a>,
Senior Economics Correspondent · Updated <time itemprop="dateModified"
datetime="2026-05-07T14:32:00-04:00">2:32 p.m. ET</time>
</p>
</header>
<p class="lede" itemprop="description">The Federal Reserve cut its benchmark
rate by 25 basis points to 4.00-4.25% on Wednesday, the first cut of 2026,
citing softer payrolls data and easing wage growth.</p>
<!-- body, with named sources and updates -->
</article>
<script type="application/ld+json">
{
"@context": "https://schema.org",
"@type": "NewsArticle",
"headline": "Federal Reserve Cuts Rates 25 Basis Points, Citing Cooling Labor Market",
"datePublished": "2026-05-07T14:00:00-04:00",
"dateModified": "2026-05-07T14:32:00-04:00",
"author": [{
"@type": "Person",
"name": "Anya Patel",
"jobTitle": "Senior Economics Correspondent",
"url": "https://example.com/staff/anya-patel",
"sameAs": [
"https://www.linkedin.com/in/anyapatel",
"https://muckrack.com/anya-patel"
]
}],
"publisher": {
"@type": "NewsMediaOrganization",
"name": "Example News",
"logo": {"@type": "ImageObject", "url": "https://example.com/logo.png"},
"diversityPolicy": "https://example.com/about/diversity",
"ethicsPolicy": "https://example.com/about/ethics",
"correctionsPolicy": "https://example.com/about/corrections"
},
"image": ["https://example.com/img/fed-1200x675.jpg"]
}
</script>
<!-- news-sitemap.xml -->
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"
xmlns:news="http://www.google.com/schemas/sitemap-news/0.9">
<url>
<loc>https://example.com/2026/05/07/fed-cuts-rates</loc>
<news:news>
<news:publication>
<news:name>Example News</news:name>
<news:language>en</news:language>
</news:publication>
<news:publication_date>2026-05-07T14:00:00-04:00</news:publication_date>
<news:title>Federal Reserve Cuts Rates 25 Basis Points</news:title>
</news:news>
</url>
</urlset>
The lede is a 38-word answer to the news query. Author has sameAs linking to LinkedIn and Muck Rack (the de facto journalist-verification graph). Publisher includes diversityPolicy, ethicsPolicy, correctionsPolicy — fields Google’s News policy team checks. The news sitemap fires within seconds, with timezone-aware datetime. This article is in Top Stories within 5 minutes; an AIO synthesis citing it within 20.
Do this today
- In Google Search Console, open Indexing → Sitemaps and verify a news sitemap is submitted at
/news-sitemap.xml. Confirm it includes only articles from the last 48 hours and updates on every publish. - In Google Publisher Center (publishercenter.google.com), confirm your publication is verified, the masthead is current, and the language/region settings match your editorial scope.
- Audit ten recent articles for
NewsArticleJSON-LD using the Rich Results Test. Fix any missing fields:datePublished,dateModified,author.sameAs,publisher.correctionsPolicy. - Replace any “Staff” or pseudonymous bylines with named authors. Build dedicated author pages with photo, bio, beats, social links, and
Personschema. Google’s reviewer guidelines explicitly weight this for news. - Add IndexNow integration. POST every new article URL to
https://api.indexnow.org/IndexNowto ping Bing, Yandex, and Naver. Google does not consume IndexNow but Bing’s news index drives ChatGPT Search citations. - Rewrite the lede on your last 20 articles to deliver the news in the first 50 words. AIO and Top Stories both lift from this region; throat-clearing here is a citation killer.
- Audit any
/coupons/,/reviews/, or partner-content subdirectories. If they were built by an external partner with separate editorial control, you are at risk under Site Reputation Abuse policy. Block from index or migrate to a different domain. - Subscribe to Glimpse or Exploding Topics alerts on your beat keywords. The first article on an emerging entity often owns it for the cycle.
- Set CDN cache TTL on article pages to 30 seconds with stale-while-revalidate, not 5 minutes. Updates during a story (corrections, additions) need to propagate fast enough that Googlebot-News sees them on its next pass.
- Track time-to-Top-Stories as a KPI. Use GSC → Performance → Search appearance → News to log when each article first appeared. Anything over 10 minutes is a workflow problem; anything over 30 is a technical one.
Mark complete
Toggle to remember this module as mastered. Saved to your browser only.
More in this part