You open Search Console. The Pages report shows 847 URLs as "Not indexed". You go to one specific URL, paste it into URL Inspection, and read "Crawled - currently not indexed". You hit "Request indexing". Nothing happens. You hit it again the next day. Still nothing. You start composing an angry forum post.
Stop. The 2026 honest answer for most of those 847 URLs is that Google made a deliberate decision and that decision is mostly correct. John Mueller has said it directly: "I strongly recommend not relying on trying to force indexing." The URL Inspection tool is rate-limited to 2,000 requests per property per day, and Google has no plan to raise it — the rate limit is a feature, not a bug. The premise of "force-indexing as the fix" is wrong; the fix is upstream of indexation.
That said: a meaningful minority of the URLs in your "Not indexed" report probably should be indexed and aren't. The checklist below tells you how to find which ones, what to do about them, and what to delete instead of fight.
One critical distinction up front. Search Console shows two very different "not indexed" statuses that look like one problem:
- Discovered — currently not indexed: Google knows the URL exists (probably from a sitemap or an internal link) but hasn't crawled it yet. This is a crawl-budget / server-capacity / site-quality-signal issue. Google is rationing crawl based on whether the site is worth crawling. Different fix path.
- Crawled — currently not indexed: Google crawled the URL, looked at it, and decided not to include it. This is a quality verdict, not a technical problem. Different fix path entirely.
Operators who treat these as one problem waste weeks. The first 4 items of this checklist split them.
Vocabulary
If you've read our disavow checklist or anchor portfolio checklist, you already have Penguin/SpamBrain. New terms in italics.
- SpamBrain: Google's AI link-spam classifier deployed December 2022. Works at donor-cluster granularity.
- Helpful Content System / HCU: Google's content-quality classifier (deployed 2022, made part of core algorithm in March 2024). Targets thin, AI-generated-without-edit, or unhelpful pages. Drives ranking suppression, not always deindexation.
- URL Inspection: Search Console feature that shows Google's index status for a specific URL plus a "Request Indexing" button (rate-limited to 2,000 requests/property/day).
- Crawl budget: the number of URLs Googlebot is willing to fetch from your site in a given time window. Illyes (2024): mostly a non-issue for sites under 10,000 pages.
- X-Robots-Tag: HTTP response header that controls indexing. Same effect as a meta robots tag but set at the server level — often the cause when the page HTML looks fine but Google still won't index.
- Canonical: HTML/HTTP signal that declares which version of a URL is the "main" one. A page that sets canonical to a different URL is asking Google to index that URL instead.
- Indexing API: Google's programmatic indexing endpoint. Restricted to JobPosting and BroadcastEvent schema only — using it for general pages is against TOS and will get the property quarantined.
- IndexNow: instant-indexation protocol adopted by Bing, Yandex, and Naver. Google has tested it since 2021 and not adopted — sending IndexNow to Google does nothing.
- hreflang: HTML/HTTP attribute that tells Google which language and region version of a page to serve to which audience. Misconfigured hreflang causes Google to consolidate language variants and "Crawled - not indexed" the wrong ones.
- E-E-A-T markers: signals of Experience, Expertise, Authoritativeness, Trustworthiness on a page. Concretely: named author with bio page, credentials, LinkedIn link, published date, fact-checked claims with sources, real photos. Used by Google's quality classifiers.
What success looks like
You finish the checklist with three buckets. Bucket A: pages that should be indexed and aren't — you identified the technical or quality blocker and fixed it. Bucket B: pages where Google's judgment is correct — you accept the verdict or delete the page. Bucket C: pages that truly don't need to be indexed (admin, thank-you, preview URLs) — you mark them noindex deliberately so they stop appearing in the "Not indexed" report and stop wasting your crawl budget. Sorting 847 URLs into those three buckets is the win.
Related reading: redirect chains are a common reason a page stalls — see redirect chains explained: what they are, why they hurt SEO, and how to fix them.
Four checks before you touch any technical setting. Most indexation troubleshooting work goes the wrong direction because the operator skipped this category and started fixing a technical problem when the issue was either (a) Google's correct quality judgment, or (b) the URL was never supposed to be indexed in the first place.
Five checks for pages that should be indexed and aren't. These catch the technical reasons Google can't (or won't) index a URL even when the content is fine. Run these in order; later checks assume earlier ones pass.
Four checks for pages where Google's index decision is content-driven, not technical. These are the harder fixes because they require rewriting or merging pages, not flipping a flag. The honest framing: if Google declined to index a thin page, Google was probably right.
Five execution steps. Run them in order. Skipping fix-then-request and going straight to Request Indexing is the most common operator mistake — you burn rate-limit quota requesting a page Google already saw and declined.
Four anti-patterns that show up in nearly every indexation post-mortem. If you're about to do any of these, stop.