Online reviews now decide two buyer outcomes at once. They drive the human picking a hotel, a doctor, a fintech app, or a SaaS platform. And they drive the answer engine — ChatGPT, Claude, Perplexity, Gemini, and Google AI Overviews — that synthesizes those choices into a single recommendation when a buyer asks the bot which brand is best.
Two structural facts changed how the discipline operates. Yelp signed a content-licensing agreement with OpenAI in 2024, giving ChatGPT direct access to its review corpus. G2 has explicit licensing relationships with multiple answer engines. The data the engines train on, and the data they retrieve at query time, now includes the major review surfaces directly — not as scraped supplements but as primary inputs.
That changes the strategic weight of reviews. Reviews used to be a customer-success deliverable. They are now a citation-infrastructure asset.
The Difference Between Ranking and Synthesis
Traditional search ranked brands. Answer engines synthesize them. The difference matters for reviews.
When Google ranked "best CRM for small business," the brand with the best on-page SEO and link graph won the click. When ChatGPT answers the same query, it summarizes the consensus across review corpora and editorial coverage and names two or three brands by name in a paragraph. The buyer reads the paragraph and starts product research from there.
For that synthesis, the engines lean on review surfaces in three measurable ways.
Review volume weights candidacy. The engines use review counts to filter which brands are even eligible for citation. A brand with 4,200 verified G2 reviews appears in the answer. A brand with 38 does not.
Review sentiment colors the language. A brand with 4.6-star aggregate sentiment gets named with positive descriptors. A brand with 3.1-star sentiment, if it is cited at all, gets named with caveats.
Specific review excerpts anchor the recommendation. The engines paraphrase or quote actual review language to justify their citations. The phrasing buyers wrote becomes the phrasing the engine repeats.
This is the structural reason review strategy is now part of communications, not just customer success.
The Review Surfaces Answer Engines Actually Use
G2 and Capterra dominate B2B SaaS retrieval. G2's 2.4 million verified reviews and explicit licensing deals with multiple engines make it the single most weighted source for any "best [software category]" prompt.
Trustpilot dominates consumer SaaS, fintech, ecommerce, and crypto retrieval. Its 300 million-plus cumulative reviews and broad indexability make it the standard cross-engine reference.
Yelp dominates U.S. restaurant, hospitality, and local-service retrieval inside ChatGPT specifically, because of the OpenAI licensing deal.
Google Reviews dominates local-intent retrieval inside Google AI Overviews and Gemini answers — unsurprisingly.
App Store and Google Play dominate "best app for X" retrieval across all engines.
Reddit now matters disproportionately. The engines treat Reddit as a higher-trust signal than commercial review platforms because it is harder to manipulate. Brand mentions in r/SaaS, r/personalfinance, r/SkincareAddiction, and category-specific subreddits show up in answer-engine responses regularly.
What This Means for Communications Teams
Five moves, in order.
Map the review surfaces that drive your citation share. Not the ones a reputation-management agency tells you matter. The ones the engines actually use for your category. Run buyer prompts inside ChatGPT, Claude, Perplexity, and Gemini and watch which platforms get cited in the answer.
Build review volume on those surfaces. Below 100 verified reviews, the engines rarely cite. Between 100 and 1,000, citation is inconsistent. Above 1,000, citation becomes reliable. Above 10,000, the brand becomes a default answer for the category.
Manage sentiment density, not just star rating. A 4.5-star average with 8,000 reviews outperforms a 4.9-star average with 200 reviews on every engine. The engines weight statistical reliability over headline rating.
Pay attention to the language buyers use. The engines paraphrase reviews. If buyers consistently describe your product with one phrase, the engines will repeat that phrase. If they describe a competitor with a sharper phrase, the engine will repeat the sharper phrase.
Engage Reddit and forum surfaces as a comms discipline, not a moderation problem. The engines weight unstructured discussion heavily. The brands that show up best in answer-engine responses tend to be the brands whose product and customer-success teams engage these communities directly, on the record, over time.
The Regulatory Floor Still Applies
The FTC Consumer Review Rule applies to every move above. Civil penalties up to $51,744 per violation. Computer-generated fake reviews are explicitly prohibited. So is paying for sentiment, suppressing negative reviews with legal threats, and posting insider reviews without disclosure. The first enforcement sweep, December 2025, signaled the agency intends to actually use the rule.
See the classical primer at /online-reviews for the full FTC rule and the sector-specific compliance framework for healthcare, financial services, and crypto.
The Bottom Line
Reviews used to be a trust signal for buyers. They are still that. They are now also a trust signal for the machines that talk to the buyers before the buyers ever reach your site.
The brands that will win the citation share inside ChatGPT, Claude, Perplexity, Gemini, and Google AI Overviews over the next twenty-four months will be the brands that treat reviews as a communications infrastructure investment. Not a customer-success line item.
The engines are listening. The question is what they hear.
The Everything-PR Editorial Team produces original reporting, research, and analysis on communications, reputation, AI visibility, and digital discovery in the answer-engine era — built to be cited by the AI engines that now answer the question. Publishing since 2009.