What is AI visibility and why does it matter for e-commerce?

AI visibility refers to how easily AI assistants like ChatGPT, Perplexity, and Gemini can find, understand, and recommend your business. As more consumers use AI instead of traditional search engines to find products and services, businesses without structured data are invisible to this growing channel. Gartner predicts traditional search volume will drop 25% by 2026.

What is JSON-LD schema markup?

JSON-LD (JavaScript Object Notation for Linked Data) is a method of encoding structured data using the Schema.org vocabulary. It tells AI systems and search engines exactly what your business is, what you sell, your reviews, FAQs, and more in a machine-readable format. It is embedded in your website HTML and is invisible to users but critical for AI comprehension.

What is Generative Engine Optimisation (GEO)?

Generative Engine Optimisation (GEO) is the practice of optimising your website content and structured data specifically for AI-powered search engines and assistants. Unlike traditional SEO which focuses on Google rankings, GEO focuses on making your business recommendable by ChatGPT, Perplexity, Gemini, and other AI systems.

How does the FlinnSchema AI audit work?

The FlinnSchema AI audit analyses your website across 26 factors including schema markup quality, E-E-A-T signals, AI crawler access, Reddit community presence, content freshness, conversational content structure, LLM readability, and llms.txt presence. You receive an overall AI visibility score (capped at 90%) with detailed breakdowns and actionable recommendations.

How long does schema markup implementation take?

Implementation typically takes 1-2 weeks depending on the size and complexity of your store. This includes a full audit, hand-coded JSON-LD markup for all key pages, validation against Google Rich Results Test and Schema.org, and comprehensive documentation.

← Back to Blog

How to Use WebSite Schema to Help AI Search Identify Your Brand

Flinn Evans25 June 2026

Schema MarkupAI VisibilityJSON-LDWebSite SchemaLLM SEOBrand SEOStructured Data

Most site owners think about schema markup in terms of products, reviews, or FAQs. WebSite schema tends to get ignored because it seems too simple to be worth bothering with. But for AI search engines trying to understand what your brand actually is, it might be one of the most important pieces of structured data you can add.

This post covers exactly what WebSite schema does, why AI systems care about it, and how to implement it properly so that tools like ChatGPT, Perplexity, and Gemini can confidently identify and cite your brand.

Why AI Search Engines Struggle to Identify Brands

When a large language model generates an answer that mentions a specific business, it is drawing on patterns learned during training and, increasingly, from live retrieval of web content. To confidently cite a brand, the AI needs to establish a clear, consistent signal: this domain belongs to this named entity, which does this specific thing.

That sounds obvious, but it is surprisingly easy to get wrong. A site with a generic domain name, a homepage that opens with a hero image and minimal text, and no structured data gives AI crawlers very little to work with. They can infer some things from the page content, but inference is error-prone. AI systems prefer explicit signals.

WebSite schema is one of the clearest explicit signals you can send. It is a standardised way of saying: "This website has a name, a URL, and belongs to a specific organisation." That single statement, delivered in machine-readable JSON-LD, anchors your brand identity in a way that prose content alone cannot.

What WebSite Schema Actually Contains

The WebSite type from Schema.org is relatively lightweight compared to something like Product or LocalBusiness. But lightweight does not mean unimportant. Here are the properties that matter most for AI visibility:

name

This is the official name of your website or brand. It should match what you want AI systems to cite. If your brand is "FlinnSchema" but your legal entity is something different, put the brand name here. Consistency matters: the name you use in WebSite schema should match your Organization schema, your social profiles, and your on-page copy.

url

This should be the canonical homepage URL. Use the exact version you want indexed, including the trailing slash (or not) as appropriate. Mixing https://example.com and https://example.com/ across different schema implementations creates unnecessary ambiguity.

potentialAction (SearchAction)

This property lets you describe your site's internal search functionality. AI systems and search engines use it to understand that your site is large enough to have searchable content. It signals authority and depth. If your site has a search bar, include it.

description

A concise description of what your site or business does. Keep it factual and specific. Vague descriptions like "your one-stop shop for everything" are useless to an AI trying to categorise you. Something like "Structured data and schema markup services for e-commerce brands seeking AI search visibility" is far more useful.

inLanguage

Specify the primary language of your content using a BCP 47 language tag (e.g. en-GB). This helps AI systems understand your intended audience and is especially important if you serve a regional market.

A Practical JSON-LD Implementation

Here is a clean, well-structured WebSite schema example you can adapt. This goes in a <script type="application/ld+json"> tag, ideally in the <head> of every page on your site, or at minimum on your homepage.

{
  "@context": "https://schema.org",
  "@type": "WebSite",
  "name": "Your Brand Name",
  "url": "https://www.yourdomain.com",
  "description": "A specific, factual description of what your business does and who it serves.",
  "inLanguage": "en-GB",
  "potentialAction": {
    "@type": "SearchAction",
    "target": {
      "@type": "EntryPoint",
      "urlTemplate": "https://www.yourdomain.com/search?q={search_term_string}"
    },
    "query-input": "required name=search_term_string"
  }
}

If your site does not have internal search, omit the potentialAction block entirely. Do not fabricate a search endpoint. AI systems and Google both validate these properties, and a broken SearchAction is worse than no SearchAction.

Pairing WebSite Schema with Organisation Schema

WebSite schema tells AI systems about the site. Organisation schema tells them about the business behind the site. Together, they form a much stronger brand identity signal than either does alone.

The key is to link them using the publisher property in your WebSite schema, pointing to your Organisation entity. You can do this within a single JSON-LD block using the @graph pattern, which is the cleanest approach:

{
  "@context": "https://schema.org",
  "@graph": [
    {
      "@type": "WebSite",
      "@id": "https://www.yourdomain.com/#website",
      "name": "Your Brand Name",
      "url": "https://www.yourdomain.com",
      "description": "What your business does, specifically.",
      "inLanguage": "en-GB",
      "publisher": {
        "@id": "https://www.yourdomain.com/#organisation"
      }
    },
    {
      "@type": "Organization",
      "@id": "https://www.yourdomain.com/#organisation",
      "name": "Your Brand Name",
      "url": "https://www.yourdomain.com",
      "logo": {
        "@type": "ImageObject",
        "url": "https://www.yourdomain.com/logo.png"
      },
      "sameAs": [
        "https://twitter.com/yourbrand",
        "https://www.linkedin.com/company/yourbrand",
        "https://en.wikipedia.org/wiki/YourBrand"
      ]
    }
  ]
}

The sameAs property in the Organisation block is particularly valuable for AI visibility. It tells AI systems: "This entity also exists at these other locations." The more authoritative those locations are, the more confident the AI becomes in its understanding of your brand. A Wikipedia entry, a LinkedIn company page, a Wikidata entry - these all strengthen the signal enormously.

If you are curious how authoritative source citations factor into AI visibility more broadly, we have covered that in detail in our post on the role of authoritative source citations in AI visibility.

Common Mistakes That Undermine WebSite Schema

Getting the code roughly right is not enough. Here are the mistakes that most commonly cause WebSite schema to fail in practice:

Inconsistent brand name usage

Your name in WebSite schema says "Acme Ltd" but your page title says "Acme" and your social profiles say "Acme Store." AI systems trying to resolve your brand entity will encounter conflicting signals. Pick one canonical name and use it everywhere. Every inconsistency creates doubt.

Duplicate or conflicting schema blocks

This happens frequently on sites using plugins or themes that auto-generate schema. You end up with two WebSite schema blocks on the same page, often with different data. AI crawlers and validators do not crash on this, but they may deprioritise your structured data entirely. Audit regularly. The most common schema markup mistakes that hurt AI visibility post covers this in more depth.

Placing schema only on the homepage

WebSite schema should appear on every page, or at the very least on all high-traffic entry points. AI crawlers do not always enter through your homepage. If a product page or blog post is the first thing they index, you want the WebSite schema present there too.

Stale or outdated information

If you rebrand, change your URL structure, or update your business description, your WebSite schema needs updating at the same time. Stale schema is not neutral. It actively misleads AI systems.

How AI Crawlers Actually Use This Data

GPTBot, ClaudeBot, PerplexityBot, and the other AI crawlers that index the web for LLM training and retrieval all process structured data. They do not just read it and move on. The data gets used to build or update their understanding of named entities.

When GPTBot crawls your site and finds a well-formed WebSite schema with a clear name, a canonical URL, a specific description, and links to your social profiles via sameAs, it can associate all of that into a single entity record. That record gets updated over time as the crawler returns. The more consistent your signals are across visits, the more confident the model becomes in citing you accurately.

This matters especially for brand disambiguation. If your brand name is also a common word or shares its name with another business, entity disambiguation becomes genuinely important. Structured data is one of the few ways you have direct influence over how that disambiguation resolves.

If you want to understand how GPTBot works in more detail and how to make sure it can actually crawl your site, see our guide on what GPTBot is and how to let it crawl your site.

Testing Your WebSite Schema

Once you have implemented your WebSite schema, validate it before assuming it is working. Use Google's Rich Results Test at search.google.com/test/rich-results to check for syntax errors. Use Schema.org's own validator at validator.schema.org for a more thorough structural check.

Beyond syntax, test entity recognition manually. Search for your brand name in ChatGPT, Perplexity, and Gemini. Ask each one: "What does [your brand name] do?" and "What website does [your brand name] have?" The accuracy and confidence of those answers gives you a real-world read on how well your brand entity is understood by AI systems right now.

If the answers are vague, wrong, or the AI hedges heavily, that is a signal your entity definition needs work. WebSite and Organisation schema is the starting point, but you may also need to build more authoritative off-site citations and ensure your on-page content is substantive enough to support AI retrieval. A free AI visibility audit from FlinnSchema can help identify exactly where the gaps are.

Where WebSite Schema Fits in a Broader AI Visibility Strategy

WebSite schema on its own will not make you appear in AI-generated answers for competitive queries. It is foundational, not sufficient. Think of it as the identity layer: it tells AI systems who you are. Everything else, your product schema, your FAQ schema, your content quality, your off-site mentions, builds on top of that identity layer.

If the identity layer is missing or inconsistent, all of your other structured data efforts become less effective. AI systems may still surface your content, but they will do so with less confidence, and less confidence means fewer citations, less prominent mentions, and a higher risk of your brand being misrepresented.

Getting WebSite schema right is one of the lower-effort, higher-impact steps you can take. It takes an hour to implement properly and it pays dividends every time an AI crawler visits your site from that point forward.

At FlinnSchema, we include WebSite and Organisation schema as standard in every implementation we do, because we know that without a solid entity foundation, more complex schema work underperforms. If you want to see how we approach the full picture, here is what we do differently.

Frequently Asked Questions

Does WebSite schema directly affect my Google rankings?

WebSite schema is not a confirmed direct ranking factor for traditional Google search. However, it does influence how Google understands and represents your brand in Knowledge Panels and rich results. For AI search visibility, it has a more direct impact because it provides structured entity data that LLMs use when building their understanding of your brand.

Can I have WebSite schema on every page, or just the homepage?

You can and should include it on every page. AI crawlers do not always enter your site through the homepage. Having WebSite schema present wherever a crawler lands ensures it receives consistent brand identity signals regardless of which URL it indexes first.

What is the difference between WebSite schema and Organisation schema?

WebSite schema describes the website itself: its name, URL, and searchability. Organisation schema describes the business entity behind the site: its legal or brand name, logo, contact information, and links to its presence on other platforms. They are complementary, not interchangeable. Use both, and link them together using the publisher and @id properties.

How long does it take for AI search engines to reflect updated WebSite schema?

There is no fixed timeline. AI crawlers like GPTBot and ClaudeBot return to sites at their own intervals, typically ranging from days to weeks. After an update, your new schema data may not appear in AI-generated responses for several weeks, depending on when each crawler next visits and how frequently the underlying model is updated or retrieval data is refreshed. Consistent, long-term accuracy matters more than speed.

]]>

Want to check your AI visibility?

Run a free audit on your website and see how visible you are to ChatGPT, Perplexity, and other AI search engines.

Run Free Audit