Your organic traffic dropped 18% last quarter. Rankings barely moved. The clicks went to AI Overviews. You opened ChatGPT, asked your category's top query, and watched it cite three competitors and not you. Your CMO asks why traffic is down. You don't have a good answer that doesn't sound like "AI search ate it." You start Googling "how to optimize for AI search". You find 40 articles, 38 of them recycled SEO advice with "AI" sprinkled on top.
Three numbers anchor this checklist.
- Brand-mention frequency on third-party sites correlates with AI citation at r=0.664, about 3× stronger than backlinks (r=0.218). Source: Ahrefs' 75,000-brand study (Linehan & Guan, May 2025). The implication: every external mention of your brand on Reddit, in a podcast transcript, in a Substack write-up, in a partner blog post, even unlinked, moves the GEO needle more than a typical backlink does. Important caveat: correlation, not causation. Brand-mentioned companies also tend to be larger, more covered, and older. The 3× ratio is directionally robust; the absolute coefficient is one study.
- Cross-engine citation overlap is only 11-12% (Averi's 680M-query audit; Qwairy's 118k-response analysis). No single "GEO playbook" works everywhere. AI Overviews, ChatGPT Search, Perplexity, Claude, and Gemini are five separate citation pipelines pulling from different source distributions. Optimizing for one buys you partial credit for the others.
- Princeton's original GEO paper measures what happens AFTER your URL is retrieved. The viral "add citations, quotations, statistics" prescription only applies once you're in the candidate set. If you're not retrievable, the lift is zero. Most "GEO tactic" articles skip the retrievability precondition; this checklist starts there.
22 items, 5 stages: get the 2026 mental model right (pre-flight), build the authority signals AI engines weigh (authority), write content in patterns that get cited (content), shore up technical readiness (technical), avoid four anti-patterns being sold as GEO. Tier filter + browser-saved progress.
Vocabulary
If you've read our GEO vs SEO guide you have most of these. New terms in italics.
- GEO (Generative Engine Optimization): tactics for getting cited by AI search engines (AI Overviews, ChatGPT Search, Perplexity, Claude, Gemini). Adjacent to SEO, not the same.
- AIO (AI Overviews): Google's AI-generated summary at the top of search results. Pulls citations from the regular Google index plus its own retrieval ranking.
- Candidate set: the pool of URLs an AI engine retrieves before it decides which to cite. Like Google's index, but per-engine. If you're not in it, GEO tactics lift you by zero.
- Retrievability: the precondition for citation: whether your URL is in the candidate set. Indexation, robots.txt, crawl access, sitemap inclusion. Princeton's paper measures post-retrieval; this is pre-retrieval.
- Brand mention: any reference to your brand name on a third-party site (linked or unlinked). The dominant GEO signal per Ahrefs' 2025 75K-brand study.
- Citation source distribution: where a given AI engine pulls its citations from. Wikipedia 47.9% on ChatGPT, Reddit 46.7% on Perplexity, Reddit 21% on AIO (Profound's multi-platform audit).
- sameAs: schema.org property listing all your brand's other canonical URLs (LinkedIn, X, Crunchbase, G2), so Google links them as one entity.
- Person JSON-LD: schema.org/Person
structured data for an author, embedded inside an Article schema's
authorproperty. The Person carries name, url, sameAs array, jobTitle, knowsAbout. - E-E-A-T: Experience, Expertise, Authoritativeness, Trustworthiness. Google's quality-rater framework. Author bio + credentials + sameAs links are how a page shows it.
- SSR / SSG: server-side rendering / static site generation. Both deliver fully-rendered HTML to a crawler. SPA sites that depend on client-side JS to render content suffer in retrievability.
- TTFB: Time To First Byte. Measures how fast a server starts sending the page. AI crawlers retrieve at scale with strict budgets.
- Princeton GEO paper: the 2023 paper "GEO: Generative Engine Optimization" (Aggarwal et al., arXiv 2311.09735) that introduced citation/quotation/statistic tactics. Magnitudes don't transfer cleanly to production engines; directions replicate.
- llms.txt: proposed text-file standard to tell LLMs what to crawl. Google's John Mueller publicly compared it to meta-keywords. Google has stated it won't use it.
- GEO-16: an academic framework (arXiv 2509.10762) that measured structural quality across 16 pillars. A G-score of ≥0.70 with ≥12 pillars correlated with 78% cross-engine citation rate.
What success looks like
You finish with a documented audit across the 5 categories. The authority work is a 6-12 month project; brand mentions don't accumulate overnight. The content and technical work compound faster, usually visible within 4-8 weeks in citation trackers. The win: cited by name in 1-3 AI engines for your top commercial query, not appearing in their index alone.
Four mental-model checks before any tactical work. These rule out the three most common operator misconceptions: that AIO citation == ranking #1, that one GEO playbook serves all engines, and that the Princeton tactics work without retrievability first.
Five authority items. The first one (brand-mention frequency) is the strongest single signal in the published 2025-2026 data. Most GEO advice skips it because it can't be hacked in a week. We don't.
Five content items. Princeton's GEO tactics replicate in direction (statistics, citations, quotations help) but the magnitudes don't transfer cleanly to production engines. The items below pull what's defensible and skip the folklore.
Four technical items. JSON-LD schema is the only technical signal with published empirical lift on AI citation rates, and only in Google's ecosystem. Cross-engine technical work is more about not breaking retrieval than adding GEO magic.
Four anti-patterns sold as GEO in 2025-2026 vendor blogs that don't hold up against the published data. If you've spent budget on any of these, you're not behind — you're early enough to redirect. The data on each one became clear in 2026, not 2024.