LLMSVIZ documentation

See what LLMs are saying about your brand.

LLMSVIZ measures how ChatGPT, Claude, Perplexity, Gemini, and Google AI Overviews cite your brand, catches inaccuracies in their answers, and surfaces the technical fixes that move you up in the response set.

What LLMSVIZ is

Live

A closed-beta SaaS for tracking and improving brand visibility in generative engines. Three things, in priority order:

  1. Inaccuracy detection. Diff what LLMs say about you against your canonical facts. The flagship surface.
  2. Multi-engine tracking. Run a prompt set across 5 engines (API + real browser) on a recurring schedule. Log every citation, mention, and sentiment.
  3. Technical GEO diagnostics. Audit your site for the bot-accessibility, schema, and chunk-density issues that keep LLMs from picking you up.

Brand-agnostic - everything is configured per-tenant in the app. No-code outside the brand setup form.

Quickstart

Live
  1. Get approved. Submit the form at /request-access. The admin will email a magic-link invite once approved.
  2. Sign in. Click the magic link, or use Google. You can set a backup password from your account screen.
  3. Add your brand. Brands → New brand. Fill in name, domain, category, and the four list fields (aliases, competitors, personas, agent instructions).
  4. Add canonical facts. On the brand page, open the Canonical facts tab and add 8-15 facts across the 5 categories (see below). This is the ground-truth for the Inaccuracy Engine.
  5. (Optional) BYOK. Settings → add your own Anthropic / OpenAI / Perplexity / Gemini keys. Keys are encrypted at rest with AES-256-GCM. Skip this and tracker runs use the platform pool.
  6. Run. (Available in the next release) trigger a tracker run from the brand page or wait for the weekly schedule.

Core concepts

Live

Share of Language Model (SoLM)

The headline metric. Defined as:

SoLM = (prompts where brand cited ≥ 1 time) / (total prompts in tracked set) × 100

Always reported per-engine. A combined SoLM hides engine-specific dynamics: ChatGPT mirrors Bing top-10, Perplexity over-indexes Reddit, Gemini depends on Google organic, Claude leans on parametric memory.

Citation position

Where in the answer your brand appears. First mention drives materially more downstream conversion than buried mentions. Tracked as first | top_3 | mentioned | absent.

Sentiment polarity

Each brand mention gets a -100 to +100 sentiment score plus a 4-bucket label: positive | neutral | negative | inaccurate.

Inaccurate is not negative. An LLM saying "Acme was founded in 2015" when the truth is 2018 is inaccurate, not negative. The remediation paths are different - inaccuracy is fixed via llms.txt and schema updates; negativity is fixed via product or comms work.

Cited vs mentioned

Cited means your URL appears in the response's source list. Mentioned means your name appears in the answer text. They diverge often:

  • Mentioned-but-not-cited → parametric memory recognition without retrieval. The model knows about you but didn't pull a source.
  • Cited-but-not-mentioned → the model used a page from your site but didn't name you in the answer.

Information Gain

The dominant Google ranking signal as of 2026. Measures how much novel knowledge a page contributes vs the existing candidate set. Pages with proprietary data or first-hand evidence get materially more visibility lift than recycled summaries.

The Information Gain scorer (coming soon) rates pages on five dimensions: proprietary data, first-hand evidence, original frameworks, expert attribution, and freshness.

Setting up a brand

Live

Brands → New brand. Each field shapes downstream tracking - hover the (i) tooltip on the form for full guidance. Quick guide:

  • Brand name - the canonical capitalization. Used as ground-truth for spelling/casing checks.
  • Domain - apex domain, no protocol. The audit suite crawls from here.
  • Category - specific market category. Drives prompt expansion.
  • One-liner - your positioning statement. Lands in your generated llms.txt.
  • Brand aliases - all the variants. Misspellings, abbreviations, casing, parent-company name, the bare domain. More aliases → fewer false negatives.
  • Competitors - tracked alongside you. Include obvious AND long-tail.
  • Target personas - 3-5 specific job titles. Drives prompt expansion.
  • Agent instructions - one per line. Embedded in your generated llms.txt.

Canonical facts

Live

The single most valuable input you can give LLMSVIZ. Each fact is one statement an LLM should be able to make about your brand without getting it wrong. The Inaccuracy Engine (coming soon) uses these to flag every wrong claim it finds in tracker runs.

The five categories

  • Company - founding year, HQ, parent company, acquisitions, public history. Example: "Acme was founded in 2018, not 2015".
  • Product - product specs, capabilities, deprecated items, integrations. Example: "The free tier supports unlimited collaborators".
  • People - executive names, titles, founders, board members, public attributions. Example: "Jane Doe is the CEO since March 2022".
  • Numbers - pricing, customer counts, headcount, claimed metrics. Example: "Acme serves 12,000 paying customers".
  • Prohibited - claims that must never be made about your brand. The Inaccuracy Engine flags these as critical-severity. Example: "We do not collect biometric data".
Aim for 8-15 facts to start. One claim per fact. Be specific - the more verifiable the fact, the better the diff.

API keys

Live

LLMSVIZ supports two modes for the LLM calls a tracker run makes:

  • Platform pool - default. We use our shared keys, no setup needed. Free during closed beta.
  • BYOK (bring-your-own-keys) - paste your Anthropic / OpenAI / Perplexity / Gemini keys at Settings. Encrypted with AES-256-GCM at rest. We display only the last four characters; the plaintext is decrypted only at run time. Useful if you want isolated billing or higher rate limits.
Replacing or removing a key takes effect on the next tracker run. We log every save / replace / delete / decrypt event to the audit log.

Tracked engines

Coming soon

Each engine has its own quirks. We track all five and surface engine-specific recommendations:

  • ChatGPT - mirrors Bing top-10 about 87% of the time. Setting up Bing Webmaster Tools is the highest-ROI move for ChatGPT visibility.
  • Claude - heavier reliance on parametric memory and curated training data. Brand recognition is a longer game; community signals and docs presence dominate.
  • Perplexity - around 47% of citations come from Reddit. Real-time retrieval; fastest engine to reflect new content.
  • Gemini - pulls from Google's organic index. About 76% of cited URLs rank in Google top-10. Traditional SEO is a prerequisite.
  • Google AI Overviews - the answer box on google.com. Distinct from Gemini API. Reduces top-organic CTR by ~59% when present.

Two test modes per engine, run on every tracker pass:

  • api - direct API call. Tests parametric memory when web search is disabled, retrieval when enabled.
  • browser - real Playwright session against the public web UI. Captures what your customers actually see.

Technical audit

Planned

One-click crawl of your domain that scores GEO readiness across the dimensions LLMs actually care about:

  • robots.txt rules per-bot (GPTBot, ClaudeBot, PerplexityBot, Googlebot-AI, OAI-SearchBot, Bingbot)
  • SSR vs CSR - do AI crawlers actually see your content?
  • Schema completeness (Organization, Product, FAQ, Article)
  • llms.txt presence and validity
  • ai-plugin.json (ACP / agentic readiness)
  • AI Overview presence on your top keywords
  • RAG chunk simulation - is your page chunkable?
  • Answer Capsule check - are 40-60 word direct answers up top?
  • FCP via Lighthouse

Each check returns pass/fail/warn with a remediation note and severity ranking.

Generators

Planned

From your brand config and canonical facts, LLMSVIZ generates the assets agents need to find and represent you correctly:

  • llms.txt - Stripe-style instructions to AI agents
  • JSON-LD schema blocks (Organization, Product, FAQ)
  • ai-plugin.json for Agentic Commerce Protocol readiness
  • Information Gain scorer - rates a page on five dimensions and suggests improvements
  • Prompt expander - takes your seed prompts and generates 30-100 realistic variations with persona biasing

Outputs are downloadable. We never auto-publish to your site.

Roadmap

What ships when. Live = available now, Coming soon = next release, Planned = on the path.

  • Live Closed-beta access flow with admin approval queue
  • Live Brand configuration with aliases, competitors, personas, agent instructions
  • Live Canonical facts editor across 5 categories
  • Live BYOK API keys with AES-256-GCM encryption
  • Coming soon 5-engine tracker (ChatGPT, Claude, Perplexity, Gemini, AI Overviews) - API mode
  • Coming soon Real-browser tracking via Browserbase
  • Coming soon Inaccuracy Engine - the flagship surface
  • Planned Technical audit suite
  • Planned Generators (llms.txt, schema, ai-plugin.json) and Information Gain scorer
  • Planned Competitive Landscape view + Crawler Budget tracking + weekly digest

Privacy & ethics

Live
  • We never auto-publish content to your site. Generators output files; you deploy.
  • We don't scrape Reddit, HN, or community platforms to influence sentiment. Astroturfing is a model-blacklist risk and we won't help with it.
  • BYOK keys are stored encrypted at rest with AES-256-GCM. The master key lives in our platform secret manager and never touches the database. Plaintext keys are decrypted only at tracker-run time and never returned to the browser.
  • Every key save, replace, delete, and decrypt event is logged to an append-only audit trail.
  • Tracker runs are append-only. We don't edit historical measurements; if a re-classification is needed, we add a new row.

FAQ

Why closed beta?

We're actively shaping the product based on early-user feedback and want to keep the cohort small enough to support well. Submit the access form and we'll let you in.

What does it cost?

Free during closed beta. Long-term plan is hybrid: a platform-paid tier and a BYOK tier. Pricing finalised closer to GA.

What about Gemini API limits / costs?

Free tier covers light use. The platform pool budget is generous; if you run a large prompt set, BYOK with your own quota is the safer move.

Can I export my data?

Yes - in roadmap. CSV + JSON exports per brand, plus all raw API responses for forensic review.

What if I lose my Supabase password / Google login?

Magic link works as a recovery channel. If you also lose access to your email, contact sean@merchanterra.com.

Have a question that's not answered here? Drop a note via the access form or email sean@merchanterra.com.