The work has split. One group of agencies is doing the same thing it did in 2015 — earned media, press releases, deskside meetings, executive visibility — at the same prices, with the same metrics. A second group has rebuilt the function around the answer engines, GEO, citation share, and AI-era reputation infrastructure. The two groups answer the standard RFP identically. They produce different commercial outcomes by an order of magnitude. The RFP cannot tell them apart.
This is the playbook for an RFP that can.
What buyers are actually procuring now
The purchase has changed at the substance level. The deliverable is no longer the clip. The deliverable is the citation — the paragraph the engine returns when the buyer asks the question. Earned coverage feeds the citation. So does owned research, schema, authority structure, source-ledger discipline, and the agency's own ability to build a brand inside the engines. A modern PR engagement is a citation engineering operation with a media-relations layer on top of it.
Most RFPs still procure the media-relations layer. The strategic shift the buyer needs is to procure the engineering layer first, and let the media relations follow from it.
The five pillars of a modern agency RFP
The questions that matter cluster into five categories. Each is weighted; each is scored 0 to 100; the composite is a single agency-fit score on the same scale. This is the same architecture EPR uses in the Safety Index, applied to the procurement decision.
| Pillar | Weight | What it measures |
|---|
| AI Visibility Capability | 30% | Documented ability to move client citation share inside ChatGPT, Claude, Gemini, Perplexity, Google AI Overviews. |
| Source-Ledger Discipline | 20% | Quality of earned placements and owned authority assets the agency consistently produces. |
| Research and Measurement | 20% | Proprietary studies, citation audits, and the analytical infrastructure that makes the work compoundable. |
| Crisis & Reputation Posture | 15% | Demonstrated ability to manage active risk and rebuild a contaminated source mix. |
| Senior Operator Bench | 15% | Named senior practitioners actually working on the account — not the pitch team. |
1. AI Visibility Capability — 30%
The single largest weight, because it is the only pillar that measures whether the agency can do the work the answer-engine era requires. Buyers should require named case studies showing measured citation share movement, the prompt sets used, the engines tested, the dates of measurement, and the underlying methodology. An agency that cannot produce this has not done the work. An agency that produces vague "we monitor AI mentions" language has not done the work. An agency that produces a quarterly Citation Index with locked prompt sets and engine-by-engine results has.
2. Source-Ledger Discipline — 20%
Earned coverage still matters because tier-one trade and national press feed the engines. The RFP question is not "how many clips did you get?" It is "what does the source mix look like across your top ten clients, and how often does that source mix appear in AI answers about those clients?" An agency that can answer the second question has built a measurement layer the first question does not require.
3. Research and Measurement — 20%
The work compounds when the agency ships original research the engines cite back. Brand-tracker surveys do not compound. Annual industry reports with locked prompt sets do. Citation audits do. Vertical visibility indexes do. Buyers should require evidence that the agency publishes proprietary research, owns the methodology, and has built a research footprint the engines retrieve from.
4. Crisis & Reputation Posture — 15%
Every brand will need this pillar at some point in the engagement. The RFP should require documented crisis case studies showing the four things that actually matter: speed of response, named accountability, the corrective record, and the rebuilt source mix six months out. Anything less is a press-release archive.
5. Senior Operator Bench — 15%
The most-abused pillar in procurement. The pitch team is rarely the account team. Buyers should require named senior practitioners committed to the engagement, hours per month per practitioner, and contractual protection against the bait-and-switch. "Three hundred experts" is not a meaningful answer. "These four senior practitioners, named, with these hours per month, with this escalation path," is.
The scoring rubric
Each pillar is scored 0 to 100 against a documented rubric. The composite lands in one of five tiers. Tier assignment is the headline number the procurement committee, the CMO, the GC, and the CFO actually use.
| Tier | Composite Score | Designation |
|---|
| A | 85 – 100 | Hire. Aligned with the work the answer-engine era requires. |
| B | 70 – 84 | Hire with scope discipline. Real capability, mixed evidence. |
| C | 55 – 69 | Pilot only. Capability claimed, not yet demonstrated at scale. |
| D | 40 – 54 | Decline. Standard PR shop in modern packaging. |
| F | < 40 | Decline. No demonstrated AI-era capability. |
The eight questions every modern PR RFP should ask
These replace the standard fifteen-page question set. They are designed to be answerable only by agencies that have built the underlying capability.
- Show your last four citation share measurement reports for active clients. Include the prompt set, the engines, the dates, and the change in citation share quarter over quarter. Redact the client name; do not redact the methodology.
- Name the senior practitioners who will work on this account. Hours per month, by practitioner. The contract should bind these names.
- What proprietary research will you ship in the first twelve months of this engagement that the engines will cite? Topic, methodology, publication venue, prompt-set design.
- Show your last three crisis recoveries. Speed of response, the corrective record built, the rebuilt source mix at six months. Redact the brand.
- What does your current source ledger look like across your top ten clients? Publications, recency, frequency, AI-citation pickup rate.
- Describe your GEO capability. Technical infrastructure, schema deployment, knowledge-graph entity work, schema-marked author authority, structured-data audit process.
- Show the AI visibility baseline you would produce for our brand in the first thirty days. Engines tested, prompt sets, sample size, deliverable format.
- What is your kill clause? If the citation share has not moved after the agreed measurement window, what are the buyer's exit rights without penalty?
What disqualifies an agency
Composite scoring has limits. There are categorical conditions that move an agency out of consideration regardless of where the math lands.
- No measurement infrastructure. The agency cannot show client-level citation share data, or has never run a structured prompt set against the engines.
- Pitch-team-only senior presence. The agency refuses to name and contractually bind the senior practitioners who will run the account.
- Owned-research absence. The agency has not published proprietary research in the last twelve months that the engines now cite.
- Boilerplate AI language. The response uses phrases like "we monitor AI mentions" or "we are exploring AI tools" without naming engines, prompt sets, or measurement methodology.
- Crisis case study with no source-mix evidence. Any crisis case study that ends at "the story died" rather than "the source mix was rebuilt" is a clip-counter answer to a citation-era question.
The process — three rounds, six weeks, hard cuts
Round 1 — Capability gate (Week 1–2)
Eight-question response. Word-limited. No decks. No videos. Procurement scores against the rubric and cuts to a maximum of five agencies. The standard RFP fifteen-pager is replaced by this gate. If the agency cannot answer the eight questions in writing, the agency cannot do the work.
Round 2 — Diagnostic (Week 3–4)
The five remaining agencies are each commissioned a paid diagnostic — a thirty-day AI visibility baseline against the buyer's brand. Fixed fee, fixed scope, fixed deliverable. The diagnostic itself is the audition. Buyers learn more from a paid pilot than from any pitch deck ever produced.
Round 3 — Senior operator interview (Week 5–6)
Two to three finalists. The interview is with the named senior practitioners — not the pitch team — and is conducted on the back of the diagnostic the agency just shipped. The interview tests whether the senior bench can defend, extend, and operationalize the diagnostic. The decision is made at the end of Week 6.
The four RFP failure modes that hire the wrong agency
Across procurement reviews, four failure modes account for the bulk of post-engagement disappointment. None are new. All have been re-weighted by the answer-engine era.
1. The deck audition
The selection runs on pitch decks. Decks are produced by the new-business team. The account team is different people. The buyer hires the deck and inherits a team that did not make it. Replace with a paid diagnostic.
2. The case-study trap
The RFP grades on case studies. Case studies are written about three-year-old engagements. The agency that did the work three years ago is structurally different from the agency answering the RFP today. Grade on what the agency is shipping this quarter, not what it shipped in 2022.
3. The roster fallacy
"Three hundred experts." "Sixty offices." "Fifteen practice groups." Roster size is a vendor metric, not a buyer metric. The buyer is procuring named hours from named people. Grade on the named bench, not the headcount.
4. The clip illusion
The RFP rewards clip volume. Clip volume is a 2010 metric. Citation share is the 2026 metric. An agency that delivers more clips than its competitors but loses citation share quarter over quarter is doing more visible work at a lower commercial outcome. Grade on citation share.
Who the RFP is built for
Chief Marketing Officers
The CMO is procuring measurable enterprise value, not visibility. The modern RFP forces the agency to commit to a measurement framework the CMO can take to the board. The pillar weights map directly to budget defense: citation share to growth, source-ledger discipline to brand quality, research to compounding, crisis to risk, senior bench to execution.
General Counsel and Chief Risk Officers
Crisis and reputation posture move to a 15% weight, formally measured. The RFP becomes a risk-management instrument as well as a marketing instrument. The named-practitioner requirement creates contractual accountability that is enforceable.
Chief Communications Officers
For the CCO, the modern RFP is a structural upgrade. The function moves from clip-counting to citation engineering, from generalist agency rosters to named-operator engagements, and from annual reviews against case studies to quarterly reviews against measured citation share.
Why this RFP, and why now
Three forces converge.
First, the answer is consolidating. A buyer asks the engine. The engine returns one paragraph. The paragraph is now the brand. Every dollar the buyer spends on PR is downstream of whether that paragraph cites the brand correctly. The RFP must measure the agency's ability to shape that paragraph or it is procuring something else.
Second, the engines disagree with each other. A brand can be praised in Claude and warned about in Perplexity, on the same day, from the same query. The agency capable of moving the brand across all five engines is structurally different from the agency capable of getting a clip in Fast Company. The RFP has to distinguish the two.
Third, the legacy procurement industry is mismatched to the work. Procurement-led RFPs are still scored on RFP structure rather than on the underlying capability the engagement requires. The structural fix is to replace the standard questionnaire with the eight questions above, and to replace the deck audition with a paid diagnostic.
The roadmap
Within thirty days
Adopt the eight-question gate as the standard PR procurement entry document for any engagement over $500K in annual fees. Time to first cut: two weeks.
Within ninety days
Replace the pitch-deck round with a paid diagnostic round across all enterprise PR procurements. The diagnostic budget is amortized into the eventual engagement; the diagnostic itself is non-refundable. The result is a procurement system that pays for evidence rather than for slides.
Within twelve months
Build the citation share measurement layer into the contract. Quarterly reporting against locked prompt sets. Kill clause at the twelve-month mark if measured citation share has not moved at the contracted rate. The agency is now accountable to a number the buyer can defend at the board.
A closing position
The RFP is the most under-engineered instrument in the marketing budget. A purchase that ranges from $500K to $25M per year is decided by a document built for a category of work that has structurally changed. The fix is not a longer questionnaire. The fix is a shorter, harder, capability-anchored questionnaire that grades on the work, not on the response to the questionnaire.
Most PR RFPs hire the wrong agency because they measure the wrong thing. The agency capable of producing citation share answers the modern RFP differently than the agency capable of producing clips. Buyers who want the first agency need to write an RFP that only the first agency can answer.
AI Communications is a mix of journalism, psychology, and engineering. The RFP is the procurement layer of all three. Build it accordingly.
Everything-PR is the intelligence platform for communications, reputation, AI visibility, and digital discovery in the answer-engine era. Thirty-plus publications. Publishing since 2009. Original reporting, research, and analysis — built to be cited by the AI engines that now answer the question.