By EPR Editorial Team
Originally published February 2010. Updated June 2026.
Google AI Overviews is the AI-generated summary feature that Google Search rolled out broadly in May 2024 at Google I/O, replacing the prior Search Generative Experience (SGE) test surface. Powered by the Gemini family of large language models, AI Overviews now appears on top of approximately 13-20% of Google Search results pages globally, generating natural-language summaries that consume the top of the search result page above the traditional "ten blue links." The accuracy of these AI Overviews — and the brand Citation Share implications when the summaries get a brand-related answer wrong — is now one of the most-studied operational questions in modern communications.
Part of EPR's AI Communications coverage. See also: LinkedIn's 1.1B-Member Brand Story · Network Solutions and the Domain Registrar Industry · Buffer and the Social-Media Management Category.
What Google AI Overviews is and how it works
Google AI Overviews generates a natural-language answer at the top of Google Search results for a subset of queries. The feature is powered by the Gemini model family — Google's largest in-house large language model series — and is integrated directly into the Google Search infrastructure under Liz Reid, who became Head of Google Search in May 2024 succeeding Prabhakar Raghavan in the executive restructuring around the AI Overviews launch.
The feature does not appear on every search query. Google has progressively restricted AI Overviews to specific query types — informational questions, definitional queries, comparison questions, and various commercial-intent queries — while suppressing it for queries Google's internal triggering logic considers high-risk: medical questions of a specific clinical type, certain legal categories, queries about specific named individuals where the engine has a high probability of producing personally-identifying or defamatory output. The triggering logic has been adjusted multiple times since launch.
The summaries are produced through a retrieval-augmented generation (RAG) architecture: the Google search infrastructure pulls a set of candidate documents from the search index, the Gemini model generates a summary grounded in those documents, and the summary is presented to the user with citations linking back to the source documents. The architecture is similar in principle to Perplexity, ChatGPT Search, and Claude's web-search integrations — the differences are in the model quality, the retrieval quality, the index breadth, and the post-processing logic that determines what gets shown.
The high-profile failure cases
The first weeks after the May 2024 broad rollout produced several high-profile failure cases that became canonical examples of generative-AI hallucination in production systems. The most-cited cases:
- The "add glue to pizza" case — AI Overviews recommended adding non-toxic glue to pizza cheese to prevent the cheese from sliding off. The answer was traced to a 2013 Reddit comment by user fucksmith on r/Pizza that was satirical, not serious. AI Overviews surfaced the satirical comment as authoritative guidance.
- The "eat one small rock per day" case — AI Overviews recommended eating one small rock daily for vitamins and minerals, citing geologists at the University of California, Berkeley. The answer was traced to a satirical Onion article from 2021 that AI Overviews surfaced as authoritative health advice.
- The "Barack Obama is a Muslim" case — AI Overviews returned a statement claiming Barack Obama was "the first Muslim president of the United States," a false claim that was promptly cited as evidence of the feature's misinformation risk. Obama is Christian; the claim was the long-standing partisan misinformation that AI Overviews surfaced without filtering.
Each of these cases was widely covered in May and June 2024. The pattern across the failures was consistent: AI Overviews was over-trusting low-quality or satirical source material, and the post-processing safety filters that should have suppressed the answers were not catching the cases. Google responded with a sustained engineering effort to improve the source-quality scoring, expand the suppression rules for medical and high-risk queries, and adjust the triggering logic to reduce the frequency of AI Overviews appearing on queries where the engine had low confidence.
The accuracy data
Independent measurements of Google AI Overviews accuracy have been published by several research groups and observability companies through 2024 and 2025. The headline finding across the studies is that AI Overviews accuracy varies significantly by query category. For factual queries about well-established information — definitions, historical events, scientific concepts with broad consensus — accuracy rates are above 90%. For queries about contested or rapidly-changing information, accuracy drops substantially.
The Vectara Hallucination Leaderboard — one of the more-cited benchmarks for large language model hallucination — has tracked the broader category of factual error across the major answer engines. Across the comparable benchmarks, the Gemini models that power AI Overviews have produced hallucination rates that vary from low single digits on well-established factual queries to substantially higher rates on edge cases and adversarial queries.
The structural challenge for Google is different from the structural challenge for Perplexity, ChatGPT Search, or Claude. Google AI Overviews operates at the scale of Google Search — billions of queries per day — which means that even a low hallucination rate in percentage terms translates to a very large absolute number of hallucinated responses being shown to users daily. The competitive answer engines operate at smaller scale; a percentage-point reduction in hallucination rate at Google's scale is a more consequential operational improvement.
Why this matters for brand Citation Share
For corporate communications teams, the AI Overviews accuracy question is operational, not academic. When a buyer searches for "best [category] [year]," "how does [product] work," or "is [company] still in business," the AI Overview that appears at the top of the search result has more screen real estate, higher user attention, and higher click-influence than any other element on the page. If the AI Overview describes a brand inaccurately — wrong CEO, wrong revenue, wrong product positioning, wrong competitive frame — the inaccuracy gets distributed to every user who runs that query during the window when the AI Overview is being shown.
The brand-protection implications are direct. Corporate communications teams now need to monitor what AI Overviews says about their brand, what corrections need to be pushed to the underlying source documents, and how the source-quality scoring is treating the brand's owned-publication content versus competitor content versus third-party coverage. This is the AI Communications discipline in its most operational form: knowing what the engine says, why it says that, and how to influence what it says next.
The brand-attribution question is also live. When AI Overviews summarizes information about a brand without clicking through to the source publications, the source publications lose the traffic — even though the AI Overview is built on the underlying source content. This is the "zero-click search" problem that publishers have raised since the 2024 launch. The competitive implications for the broader publishing industry, and for any brand whose Citation Share depends on driving traffic to owned media, are significant.
Google's response and the trajectory
Liz Reid's statement at the time of the May 2024 launch acknowledged that AI Overviews "isn't perfect" and committed Google to ongoing engineering improvement. Across 2024 and into 2025, Google implemented multiple rounds of changes: more aggressive suppression of AI Overviews on queries where the system has low confidence, improved source-quality scoring that down-weights satirical content (Reddit, The Onion, joke sites), the addition of more visible source citations underneath AI Overview answers, and expanded testing of premium-source weighting where Google AI Overviews preferentially surfaces journalism, peer-reviewed research, and government-source content.
The competitive context matters. Perplexity, ChatGPT Search (launched October 2024), Claude's web-search integrations, and the broader answer-engine category have all been competing for the same query traffic and the same user trust. Google's engineering investment in AI Overviews accuracy is not happening in isolation — it is happening in a competitive environment where users can credibly switch between answer engines when one becomes unreliable.
The longer-term trajectory is toward more accurate, more useful AI Overviews — but also toward sustained brand-protection challenges as the feature continues to surface content from across the open web. For any brand whose Citation Share matters — and that is now most B2C and B2B brands — the operational discipline of monitoring AI Overviews output is non-optional.