State of AI Visibility
Surfais
We asked five answer engines the same question buyers ask every day — some version of "what is the best CRM for a growing sales team" — and we asked it 600 times. Across ChatGPT, Claude, Gemini, Perplexity and Google's AI Overviews, in the United Kingdom, Ireland and the United States, over a fortnight in May. Then we recorded, for every answer, which brands were named, in what order, and which sources the engine reached for to get there.
The point was not to crown a winner. It was to read the water: to see who the engines reach for, how much they disagree, and where a brand's share of voice actually sits when nobody is performing for an analyst. The findings below are drawn from synthetic-but-representative runs — the kind of result our instrument returns on a real category.
Across 600 answers, 31 distinct CRM brands were named at least once. But the distribution was brutally top-heavy. The four most-named brands accounted for 71% of all mentions. Eleven brands were named exactly once.
The headline: two incumbents took half the room. But the more useful number is the gap between share of voice and first-mention rate. HubSpot was named almost as often as Salesforce, yet led the answer slightly less often — the engines reach for it constantly but tend to position it second. Pipedrive's story is the inverse of the long tail's: a respectable 12% share, but it is rarely the name an engine opens with.
The single most important finding for any brand: there is no such thing as "the AI answer". There are five, and they diverge sharply.
A brand sitting at 12% overall share can be at 19% inside Gemini and 4% inside ChatGPT. Measuring one engine and generalising is the central mistake — the variance between engines is larger than the variance between most mid-tier brands.
If you optimise for the average answer, you optimise for an answer no engine actually gives.
We logged 1,180 source citations across the runs. They concentrated almost as tightly as the brands did. Five domains accounted for 64% of all citations.
The pattern is consistent with what we see across categories. Third-party review aggregators set the order, community discussion adds or removes the challengers, and the vendor's own pages decide whether the engine can describe the brand clearly enough to name it with confidence. The brands that punched above their market share — Pipedrive especially — almost always had unusually clean, specific product pages and a strong, recent G2 presence.
The same prompt, asked from Dublin, named a different shortlist than the same prompt asked from Chicago. UK and Irish runs surfaced region-aware tools and were noticeably more likely to mention pricing in pounds and data-residency considerations. US runs defaulted to the global incumbents and rarely localised at all. For any brand selling across borders, a single-country read is a partial read.
Three things, in order.
None of this is visible in a traffic dashboard. It is visible only by asking the engines the same questions buyers ask, over and over, and writing down what they say. That is the read. The water is moving — the only question is whether you are measuring it.