Methodology

How the AI Presence Index Works

This page is designed to be cited. Every decision in our methodology is documented, weighted, and explained. No black boxes.

The Problem With Measuring AI Visibility

Traditional SEO has Domain Authority. Backlink profiles. Rank tracking. Decades of tooling that tells you exactly where you stand.

GEO has nothing. Until now.

When a buyer asks ChatGPT "what's the best video hosting platform for my SaaS?" is your brand in the answer? Are you first, or fifth? Are you described as a top pick, or just mentioned in passing? Is ChatGPT the only platform that knows you, or are you visible everywhere?

The AI Presence Index answers all of these questions with a single number: your score from 0 to 100.

What the Score Actually Measures

The score is not a vanity metric. It is a proxy for one thing: how likely is a buyer to encounter your brand when they ask an AI system for a recommendation in your category.

It measures four dimensions:

DimensionWeightWhat It Captures
Mention Rate30 ptsAre you in the AI consideration set at all? Weighted by how strongly AI recommends you.
Position30 ptsAre you named first, or fourth? Buyers rarely look past the second recommendation.
Sentiment20 ptsWhen AI mentions you, is it positive, neutral, or cautionary?
Platform Breadth20 ptsAre you visible on all four major AI platforms, or just one?

Total: 0 to 100.

The 5-Step Pipeline

1

Brand Context Inference

When you enter a brand name or URL, we do not guess. We fetch your live homepage and extract the title, meta description, and page content. This is fed to GPT-4o alongside your brand name.

The model returns a structured profile:

  • Specific category — "Video Hosting and Streaming SaaS", not "tech company"
  • Ideal customer profile — who the product is built for
  • Top 5 competitors — used in competitive prompts
  • Key differentiators — used in feature-specific queries
  • Primary use cases — used in use-case queries

Wrong context produces wrong prompts, which produce useless results. This is why we fetch the actual website instead of inferring from a brand name.

2

84 Buyer-Intent Queries

We run 7 competitive prompts across 4 platforms, 3 times each. 84 total query executions per brand.

The 7 Competitive Prompts:

#Prompt TypeExample
C1Category recommendation"What are the best video hosting platforms for SaaS products?"
C2Purchase decision"I need a video hosting platform. Which one should I choose?"
C3Market comparison"Compare the top video hosting tools available right now."
C4Competitor alternative"Best alternatives to Vimeo for businesses"
C5Competitor alternative"Best alternatives to Wistia for developer teams"
C6Feature-specific"Video hosting platform with DRM protection"
C7Use-case specific"Video hosting for online course platforms"

Why 3 Runs Per Query:

AI APIs are non-deterministic. A single run can place your brand first in one call and fourth in the next. By running each prompt three times and taking the median result, score variance collapses from plus/minus 8 points to plus/minus 2 to 3 points. All scoring queries run at temperature 0.

The 4 Platforms:

PlatformModelWhy It Matters
ChatGPTGPT-4o900M+ weekly users. The dominant AI assistant.
PerplexitySonarAI search with real-time web citations. Growing fast in B2B.
ClaudeSonnetStrong enterprise and technical user base.
Gemini1.5 FlashGoogle ecosystem. Deep integration with Search.

Branded Queries — Report Only, Not Scored:

We also run 4 branded queries per platform. These generate verbatim quotes and perception data for your report but do not feed the score. Why? If branded queries counted, any recognisable brand would score well regardless of whether AI actually recommends them to buyers. The score must reflect discovery, not fame.

3

Structured Response Extraction

Each of the 84 competitive query responses is individually analysed by GPT-4o-mini using a strict JSON extraction prompt. For each response, we extract:

  • Mentioned — is the brand specifically named?
  • Position — in what order does it appear relative to other brands?
  • Sentiment — positive, neutral, or negative framing?
  • Recommendation Strength — top pick, recommended, mentioned, or mentioned negatively?
  • Competitors Found — every other brand named in the same response
  • Verbatim Quote — the exact sentence where the brand is first named

After 3 runs of the same prompt, we resolve to a single data point using majority vote for categorical fields and median for position integers.

4

Scoring

Mention Rate (0 to 30 points)

How often does your brand appear across the 28 competitive prompt-platform combinations?

Recommendation strength multiplier:

StrengthMultiplier
Top pick1.0
Recommended0.8
Mentioned0.5
Mentioned negatively0.1
Not mentioned0.0

Query intent multiplier:

Query TypeMultiplier
Category recommendation (C1)1.5x
Purchase decision (C2)1.3x
Competitor alternatives (C4, C5)1.3x
Market comparison (C3)1.2x
Feature-specific (C6)1.1x
Use-case specific (C7)1.0x

Position (0 to 30 points)

PositionPoints
1st30
2nd22
3rd15
4th8
5th or later3
Not mentioned0

The drop-off is intentional and steep.

Sentiment (0 to 20 points)

SentimentWeight
Positive1.0
Neutral0.5
Negative0.0

Calculated only from prompts where your brand was mentioned.

Platform Breadth (0 to 20 points)

PlatformsScore
4/420
3/414
2/48
1/43
0/40

A platform counts if it mentions your brand in at least 2 of the 7 competitive query types.

5

Report Generation

7-section report from all 100 query results:

  1. Executive Summary
  2. Score Breakdown
  3. Platform Intelligence
  4. Competitive Landscape
  5. Brand Perception Analysis
  6. Prompt Gap Analysis
  7. Recommendations

Score Interpretation

ScoreLabelWhat It Means
85 to 100DominantAI consistently recommends you first across most platforms and query types
70 to 84StrongHigh visibility with gaps in specific query types or platforms
55 to 69ModerateAI knows you but competitors are winning most recommendation moments
40 to 54WeakRarely appearing in buyer-intent queries
0 to 39InvisibleNot in the AI consideration set for your category

Benchmarks and Percentiles

Once a category has 8 or more scored profiles, every brand receives: Category Rank, Category Percentile, Category Average, Top Competitor Score.

Score History

Every time a brand is scored, the result is stored and never overwritten. Profile pages display score history as a trend.

Limitations

LimitationImpactHow We Handle It
AI non-determinismplus/minus 2 to 3 point variance after 3-run medianAcceptable. Disclosed here.
Model updatesScores shift when models updateModel version logged. Recalibration notices issued.
Category inference errorsWrong category = wrong promptsHomepage fetched on every run. Lower-confidence runs flagged.
Platform API downtimeFewer than 4 platforms scoredUnavailable platform excluded. Confidence level flagged.
Newer brandsScore may not reflect product qualityScore reflects AI training data, not product merit.

One More Thing

Improving your score requires genuinely improving your AI visibility. There is no shortcut. That is the point. A metric worth having should be hard to fake.

Ready to see where you stand?

Want to improve your score? DerivateX specializes in Citation Engineering for B2B SaaS brands.