This is the new game. Citation Share is the score.
A Caveat Before the Case
AI visibility is not a single number, and it does not behave like search rankings. It varies by query, by engine, by user geography, by personalization, by freshness of the model's index, by whether the engine is doing live retrieval or summarizing from training data. The patterns described in this piece are structural — they describe which outlets are positioned to be retrieved, not a guarantee of any specific answer on any specific day. The point is the direction of travel.
Quick Answer: Do Paywalls Hurt AI Visibility?
Yes. Hard yes — with caveats. Paywalls hurt AI visibility through three separate mechanisms.
- Crawl access. Most paywalled outlets block AI crawlers — GPTBot, ClaudeBot, PerplexityBot, Google-Extended — at robots.txt. No crawl, no training data, no real-time retrieval. Stanford GSB and Arc XP data place the share of major news sites blocking GPTBot near half the industry.
- Licensing posture. Outlets that sued instead of signed (NYT v. OpenAI, NYT v. Microsoft) get treated as adversaries. Outlets that signed (News Corp, Axel Springer, the Associated Press, the Financial Times, Dotdash Meredith, Condé Nast, Time, Vox Media, Reddit) get treated as partners — their content surfaces inside answers, with attribution.
- Engine behavior. Without licensing and without crawl access, the model has nothing to retrieve. So it cites whatever it can — the free, open, structured outlet sitting one click away.
The end-state: an open-access outlet with a licensing deal — the New York Post — gets cited inside ChatGPT for stories where the New York Times did the original reporting. The Post inherits the authority. The Times eats the legal bill.
The Mechanics: How AI Engines Actually Access News
Three things determine whether an outlet shows up in an AI answer.
Crawl. Does the bot have permission to read the site? The New York Times blocks GPTBot, CCBot (Common Crawl), Google-Extended, and most other AI crawlers. News Corp titles route AI access through the company's OpenAI partnership rather than open crawling. The Associated Press wire feeds out everywhere.
License. Is there a deal? OpenAI's publisher partnerships include News Corp (covering the Wall Street Journal, New York Post, Barron's, MarketWatch, the Times of London, the Sun, the Australian, and dozens of other titles), Axel Springer (Bild, Politico, Business Insider), the Associated Press, the Financial Times, Dotdash Meredith (People, InStyle, Travel + Leisure), Condé Nast (Vogue, GQ, Wired, the New Yorker), Reddit, Vox Media, Time, Le Monde, and Prisa Media. Anthropic, Google, and Perplexity operate their own publisher relationships. Amazon's first major deal is with the New York Times — scoped to Alexa and Amazon's own foundation models, not ChatGPT.
Retrieval. When a model answers a query, where does it look? Some answers come from the training corpus. Some come from live search-and-summarize tools. The mix changes by engine and by query type. But the underlying logic is the same — the model retrieves from sources it can read and is contractually clear to cite. Block the crawl. Sue the engine. You don't get retrieved.
The Sister-Paper Case: NY Post vs. NY Times
Two New York City newspapers. Both with archives going back more than a century. Both with national reach. Both fighting for the same readers. But the structural posture toward AI is opposite.
The New York Times. Filed copyright litigation against OpenAI and Microsoft, alleging the AI companies used millions of articles to train ChatGPT and Bing Chat without permission. The lawsuit is active and getting nastier. According to OpenAI's public statements, the Times demanded that OpenAI hand over 1.4 billion private ChatGPT user conversations as part of discovery; OpenAI has fought it, with the demand now scoped down to 20 million conversations and a privacy fight playing out in court. The Times has signed its first AI licensing deal — but with Amazon, not OpenAI. The scope: NYT, NYT Cooking, the Athletic, available inside Alexa and Amazon's foundation models. Not ChatGPT. Not Claude. Not Perplexity. The full strategic analysis is in The Times Bet Against the Answer Engine.
The New York Post. Owned by News Corp. Part of the OpenAI deal alongside the Wall Street Journal, Barron's, MarketWatch, Investor's Business Daily, and the rest of the News Corp portfolio. Content surfaces inside ChatGPT with attribution. The site itself is largely free to the open web. The combination — open access plus a licensing deal — is the strongest possible posture for AI visibility inside ChatGPT specifically.
Same city. Same beats. Opposite outcomes inside the chatbox.
When a user asks ChatGPT about a story both outlets are covering — political, crime, Wall Street, celebrity — the Post is structurally positioned to show up in the answer. The Times is structurally positioned to be absent, or to be summarized from a third-party recap of the Times's reporting.
That is the inversion. The Times does the journalism. The Post collects the citation.
Why the New York Times Posture Is Rational — and Still Losing Ground
The Times is not making a stupid bet. The legal theory — that wholesale ingestion of paywalled archives violates copyright — is real. The subscription business is real. Surrendering the archive at a discount to OpenAI would weaken the franchise for nothing.
But the rational legal posture and the rational AI-visibility posture are not the same posture. The lawsuit will resolve at some point. Maybe the Times wins billions in damages. Maybe the fair-use defense holds. Either way, the AI-visibility damage is being done now — as consumers shift product research, news consumption, and basic factual lookups into the answer engines.
By the time the Times wins the case, the citation map will already be set. The Post will already be the answer the engines repeat in many categories. The Wall Street Journal will already be the answer for business news. The Times will be the source the model cites secondhand, through whichever outlet recapped it that day.
Litigation outcomes age. Citation patterns compound.
Will the New York Post Become More Authoritative Inside ChatGPT?
Yes — structurally, it is positioned to. Authority inside the AI era is not the authority of the past. The print-era hierarchy was about prestige, mastheads, columnists, beats, awards. The AI-era hierarchy is about retrievability — can the model read you, can the model cite you, is your name attached to the answer the consumer sees.
Three things are happening at once.
The Post can be cited for stories it did not originate. When the Times breaks a story and the Post follows with its own version, the engines have two options — the inaccessible Times original or the accessible Post follow-up. Inside ChatGPT, the model retrieves the Post. The user sees "New York Post." The Times reporter's work shows up as a downstream input the user never sees.
The Post's house voice is structurally easier to retrieve. Engines retrieve text. They quote it. Punchy headlines, short paragraphs, declarative sentences — Post copy is structured exactly the way AI engines tend to extract. The Times's longer, more discursive style gets summarized; the Post's gets quoted.
The Post's citation footprint compounds. Repeated citation patterns reinforce retrieval visibility over time. The flywheel turns. The gap is more likely to widen than to close, absent a Times licensing reversal.
This is what category capture looks like in the answer-engine era. Not "the most-read paper." The most-cited paper. The two are no longer the same thing.
Cross-Engine Differences — It's Not the Same Answer Everywhere
AI visibility is a five-engine portfolio question, not a single score. The Post's position is strong inside ChatGPT — but not uniformly strong everywhere.
ChatGPT. Licensing-heavy. The News Corp portfolio is in. The Times is out. The Financial Times, Axel Springer, Associated Press, Reuters, Dotdash Meredith, Condé Nast, Time, Vox, Reddit, the Atlantic — in. Most paywalled outlets without deals — out.
Google AI Overviews. Different logic. Google has long-standing news indexing relationships and a separate Google-Extended crawler permission. Many publishers who block GPTBot allow Google-Extended, or have not updated robots.txt to block it. The Times has blocked Google-Extended too. So the Times surfaces less than its print authority would predict, even in the engine most aligned with its historical SEO posture.
Perplexity. Operates its own publisher partner program. Members include Time, Fortune, Der Spiegel, the Texas Tribune, Entrepreneur, the Los Angeles Times, the Independent, ADWEEK, Gannett, Lee Enterprises, Le Monde, and others, with CNN, Condé Nast, the LA Times, and the Washington Post added as Comet Plus launch partners. Notably, News Corp is on the opposite side here: Dow Jones (parent of the Wall Street Journal) and the New York Post sued Perplexity alleging "content kleptocracy." So the same News Corp posture that wins inside ChatGPT is litigating inside Perplexity. Cross-engine outcomes diverge.
Claude. Anthropic has been more conservative about live news retrieval. When Claude does retrieve current news, it leans on accessible web content. Paywalled archives are largely invisible to it.
Amazon products (Alexa, Rufus, future Amazon AI surfaces). This is where the NYT–Amazon deal matters. The Times shows up here. But these surfaces are smaller than ChatGPT, and Amazon's AI products are still finding their consumer footprint.
The headline implication: an outlet's AI visibility is a portfolio. The New York Times is strong in one corner (Amazon) and weak in the others. The Post is strong inside ChatGPT, weaker inside Perplexity (where News Corp is litigating). The Wall Street Journal is the most balanced of the major paywalled brands — paywalled to consumers, licensed to OpenAI, still litigating Perplexity.
What This Means for Brands Buying PR
If you're buying communications, this is the structural question: Which outlets do you want covering you — judged not by print circulation, masthead prestige, or Twitter followers, but by Citation Share inside the answer engines your buyers are now using?
Three rules of thumb.
Open-access wire outlets are punching above their weight. Associated Press, Reuters, Bloomberg (where free), the BBC. They get cited constantly. A wire pickup is now often more visible inside the chatbox than a paywalled feature.
Licensed publisher portfolios are inside the answer. News Corp titles. Condé Nast titles. Dotdash Meredith titles. Axel Springer titles. Placement inside any of these has a real shot at surfacing in ChatGPT and the other engines that have licensed the corpus.
Trade publications, niche verticals, and category-specific outlets are over-represented in retrieval. A specialized outlet — financial, technical, vertical-specific — often beats a general-interest paywalled outlet for a category-specific query, because the model is looking for category-aligned content and the trade outlet is structurally easier to retrieve.
The old PR playbook said: get the New York Times. The new playbook says: get the Citation Share. Sometimes those are the same outlet. Increasingly, they are not.
What This Means for Publishers
Publishers face the harder question, because they have to decide what they are.
A subscription business says: gate everything, sue the engines, defend the archive. The Times is making that bet, and the bet is internally coherent. A reach-and-influence business says: be everywhere the answer is, license the archive, optimize for retrieval. News Corp is making that bet inside ChatGPT. So is Axel Springer. So is Dotdash Meredith. So is Condé Nast. Vox Media is taking the licensing path — and even that may not be enough.
Both can work. But they are different businesses. And the AI engines are no longer treating them as the same product.
The lesson is structural. The lesson is not "paywalls are dead." The lesson is: paywalls without a licensing strategy are now invisible to the audience that begins product research with AI, not Google. That audience is already more than a third of consumers. The share grows every quarter.
Adjacent EPR Frameworks
The Bottom Line
Paywalls are not neutral. Inside the AI engines, they are a structural signal — and right now, they push paywalled outlets without licensing deals down.
The New York Post is positioned to be more retrievable inside ChatGPT than the New York Times. Not because the Post is doing better journalism. Because the Post is structurally accessible — open to the web and licensed to OpenAI — and the Times is structurally not.
This is the new hierarchy. It will harden. The brands and outlets that understand it — and adjust — will be the answer. The ones that do not will be the secondhand source the model cites once it finds an open version.
The chatbox is the new front page. Citation Share is the new circulation. Paywalls without a licensing strategy are the new dark zone.