Skip to main content
SEO InsightsAI Search

llms.txt in 2026: Useful Standard or SEO Theater?

Summarize with ChatGPT
JK
John Kyprianou
May 27, 2026
9 min read
llms.txt useful standard or SEO theater, an opinion piece by SEO Turtle

If you ran an audit on the top 500 SEO-aware sites in 2026, the majority of them now have an llms.txt file at the root. We checked a sample of 80 across our client portfolio and competitor lists last week. 61 of them ship one. None of them, when we asked, could point to a single confirmed citation lift that came from the file itself.

That is the strange status of llms.txt right now. Search demand is steady at 5,400 queries per month on Google in the US according to DataForSEO, peaking around 8,100 last summer. "what is llms.txt" gets 480 to 720 monthly searches. "llms.txt generator" pulls 590. The topic is not going away. And yet the actual evidence that the file does anything is almost entirely vibes.

llms.txt useful standard or SEO theater

We have been shipping llms.txt files across client sites since late 2025. Six months of data, a lot of crawler log analysis, and a few honest conclusions. Here is the practitioner take.

What llms.txt Actually Is

For anyone catching up, llms.txt was proposed by Jeremy Howard at Answer.AI in September 2024. The idea is simple. Drop a markdown file at the root of your site, structured a bit like a table of contents, that points AI crawlers at the canonical, summarised version of your most important content. Optionally, a longer llms-full.txt carries the actual content in clean markdown so the model does not have to wrestle with your JavaScript-rendered DOM.

Think of it as a sitemap for language models. Not for indexing in the classic crawler sense. For inclusion in training data or retrieval-augmented generation systems that synthesise answers from web content.

The format is plain markdown. An H1 with the site name, a blockquote with a short description, a few sections with linked references. That is it. There is no XML, no schema, no validation step. You can write one in fifteen minutes.

Who Actually Reads It

This is where it gets messy.

Google has publicly said they do not use it. John Mueller went on the record in September 2025 saying Google does not currently process llms.txt for AI Overviews, AI Mode, or anything else. That position has not changed at the time of writing.

Anthropic ships one for Claude. They host one at anthropic.com/llms.txt and reference it as part of their public documentation. Whether ClaudeBot actually prioritises it during crawls of other sites is not documented anywhere we trust.

OpenAI has not confirmed it. ChatGPT search crawls the open web through OAI-SearchBot. We have not seen any public statement from OpenAI that the file changes crawl behaviour or citation weight.

Cloudflare added native support. They will autogenerate one from your site structure if you flip a setting in their AI Crawl Control product. That is a strong signal that infrastructure providers are betting on the standard, even if the model providers are not yet committing.

Mintlify, Vercel, and a long tail of doc platforms generate one by default. This is where adoption is highest. Developer documentation sites have a clean structure that maps well to the format, and they have the most to gain from being the source the model picks.

Mark this for what it is. The biggest player in AI search has said no. The next biggest has said nothing. The standard exists because a slice of the community, including some infrastructure platforms, decided to act before the model providers confirmed they would honour it.

What We Have Seen in Six Months of Testing

We rolled llms.txt out across 14 client sites between November 2025 and February 2026. We watched server logs, AI bot user-agent hits, and AI Overview and AI Mode citation share for the same period. Three patterns came out of it.

1. Crawl Hits on the File Are Real, But Modest

GPTBot, ClaudeBot, and PerplexityBot have all hit llms.txt on our client sites. Server logs show consistent fetches, usually once a week, sometimes more on high-traffic domains. Google-Extended hits it too, but that does not mean Google is using the content for anything user-facing. They are storing it. What happens after is opaque.

For context, the same bots hit robots.txt an order of magnitude more often. So whatever weight llms.txt is being given, it is not being treated as a primary signal.

2. No Detectable Citation Lift, Standalone

We A/B tested in the closest way you can with crawler-driven systems. Seven of the 14 sites got the file. Seven did not. Same vertical pairings where possible, same on-page work otherwise. After 16 weeks, the citation share difference in AI Overviews, AI Mode, ChatGPT, and Perplexity for the group running llms.txt was inside noise. Not zero, but not significant.

That does not mean the file does nothing. It means the file alone is not moving the needle in any clean, measurable way at the level of granularity we can track.

3. Where It Might Help: Disambiguation and Structure for Smaller Models

The one place we saw a faint signal was on long-tail product and documentation queries on smaller models that use RAG against the open web, including some of the newer enterprise search tools. The llms.txt file gave the retrieval system a cleaner shortcut to the right canonical URL. We are talking about handfuls of mentions, not a flood. But the pattern was consistent enough to note.

If you are running a documentation-heavy site, a product catalog, or anything where a user is asking an AI assistant to look up a specific named thing on your site, the file may be quietly pulling weight. We would call this plausible, not proven.

Why It Has Become a Default Anyway

Given that the evidence is so soft, why is adoption so high?

A few reasons stand out, none of them stupid.

The cost is near zero. A small markdown file at the root of a domain takes less effort to ship than a single internal link. There is no real downside, except that maintaining a stale one is worse than not having one.

It serves as a public signal. Shipping llms.txt is a statement that you take AI search seriously. Clients ask for it. Audit tools flag its absence. Whether it does anything technical or not, it has become a marker of being a serious operator in 2026 SEO.

It is forward-looking insurance. The model providers might start honouring the file at any point. If Google flips the switch tomorrow, the sites that already ship a clean one will be a step ahead. That is a cheap option to hold.

It is a reasonable place to put guidance the bots might use anyway. Even if no model explicitly prioritises llms.txt, the structured summary of your site, with clear titles and the entity names you want to be associated with, ends up in the crawl. That content has to land somewhere in the training mix or retrieval index. We suspect, with low confidence, that it nudges entity association at the margin.

Our Take

We are still recommending llms.txt to clients in 2026, with caveats. The cost is so low that not shipping one is harder to justify than shipping a reasonable version. But anyone selling you llms.txt implementation as a citation-driving tactic is overselling.

The work that actually moves citations is the same as it was when we wrote our GEO playbook for Google AI Mode: clean passage-level answers, entity clarity across the site, original information, sensible schema, and crawl access for the AI bots that matter. The file is a footnote on that list.

If you have an afternoon to spend on AI search optimisation, do not spend it perfecting your llms.txt. Spend it rewriting your top three commercial pages to lead with cite-worthy 50-word answer passages. The return is an order of magnitude better.

How We Are Implementing It

Since we still recommend it, here is the version we are actually shipping. Nothing fancy.

# SEO Turtle

> Data-driven SEO agency focused on AI search optimisation, technical SEO,
> and generative engine optimisation for B2B, legal, financial, and igaming clients.

## Core Services

- [AI Search Optimisation](https://seoturtle.com/services/ai-search-optimization): GEO for Google AI Mode, ChatGPT, Perplexity, and Claude.
- [Technical SEO](https://seoturtle.com/services/technical-seo): crawlability, Core Web Vitals, schema, JavaScript SEO.
- [Link Building](https://seoturtle.com/services/link-building): editorial outreach, digital PR, niche placements.

## Key Resources

- [Google AI Mode GEO Playbook](https://seoturtle.com/seo-insights/how-to-rank-in-google-ai-mode-geo-playbook)
- [AI Crawlers and robots.txt Guide](https://seoturtle.com/seo-insights/ai-crawlers-robots-txt-guide)
- [AI Search Engine Market Share](https://seoturtle.com/seo-insights/ai-search-engine-market-share-chatgpt-google-gemini)

## About

SEO Turtle is an independent SEO agency. Founded in Cyprus, serving clients globally.

A few practical notes on what we have learned.

Keep the link list tight. A bloated llms.txt with 200 entries is worse than a focused one with 15. Lead with the pages you most want associated with your brand identity. Treat each entry like prime homepage real estate.

Sync it with your sitemap priorities. If your sitemap promotes a page and your llms.txt ignores it, you are sending mixed signals. We script ours from the same source as the XML sitemap, with manual curation of the order.

Date it and version it. A dateModified comment at the top is not officially part of the spec, but it costs nothing and lets you track when you last touched it. Stale llms.txt files are a real problem we keep finding on competitor audits.

Decide on llms-full.txt deliberately. The long form file with full content is a bigger commitment. For documentation-heavy sites, ship it. For marketing sites, probably skip it. Maintaining a 200KB full-content mirror that goes out of sync with your live pages is a worse signal than not having one at all.

The Bottom Line

llms.txt is the most-discussed file in SEO right now with the weakest evidence base behind it. Search volume for the term is steady, adoption is high, and the practical impact on citations is, in our six-month testing, indistinguishable from noise on its own.

Ship one anyway. Spend twenty minutes on it, not twenty hours. Then go back to the work that actually moves AI citations, which is the page itself. The standard might harden into something real over the next year. If it does, you will be glad you have a clean file ready. If it does not, you have lost the equivalent of one cup of coffee in implementation time.

The agencies and SEOs who treat llms.txt as either a magic bullet or a waste of time are both wrong. It is a cheap option on a possible future. Hold the option. Do not pay for it twice.

If you want a second pair of eyes on your AI search setup, including whether your current llms.txt is helping or hurting, drop us a line. We will give you a straight answer.

John Kyprianou

John Kyprianou

Founder & SEO Strategist

John brings over a decade of experience in SEO and digital marketing. With expertise in technical SEO, content strategy, and data analytics, he helps businesses achieve sustainable growth through search.

Related Articles

AI search visibility illustrated by SEO Turtle, a brand appearing inside an AI answer box with a faint trickle of clicks below it
AI Search

AI Search Visibility in 2026: What It Actually Means and How to Measure It

Searches for AI search visibility are up more than tenfold in a year, and a wall of tools has appeared to sell it back to you. We pulled our own numbers, including one page that earned 30,115 impressions and zero clicks, to work out what AI search visibility actually buys you and how to measure it without paying for a dashboard.

June 5, 2026
Getting into Google AI Overviews, illustrated by SEO Turtle, an AI answer box pinned above the classic blue links
AI Search

Getting Into Google AI Overviews in 2026: What Works, What Is Noise, and Whether It Is Worth It

AI Overviews have gone from an experiment to the default on most informational queries, and the click data is brutal. We pulled 90 days of our own Search Console numbers, looked at how Google actually chooses the sources it cites, and worked out which advice is worth your time and which is just fear talking. Cyprus and US businesses get a different answer.

June 5, 2026
Query fan-out illustrated by SEO Turtle, a single search query splitting into many parallel sub-queries
AI Search

Query Fan-Out: How One Search Becomes a Dozen, and What It Means for Your SEO

Query fan-out is the buzzword of mid-2026, and for once it points at something real. Google AI Mode breaks one question into ten or more parallel sub-searches and stitches the answers together. We dug into our own Search Console data, found the pattern sitting in plain sight, and worked out which parts of the new advice are useful and which are just tools selling you a problem.

June 4, 2026
The great decoupling illustrated by SEO Turtle, a rising impressions line splitting away from a flat clicks line
AI Search

The Great Decoupling in 2026: Why Your Impressions Are Up and Clicks Are Flat

Search Console is showing the same pattern everywhere: impressions rising, clicks flat or falling. The industry calls it the great decoupling and blames AI Overviews. We pulled 90 days of our own data, found the pattern in black and white, and then did the part most takes skip. We worked out how much of it is actually AI and how much is just SEO doing what it always did.

June 3, 2026
Agentic commerce in 2026, an AI shopping agent comparing products and checking out, guide by SEO Turtle
AI Search

Agentic Commerce in 2026: What AI Shopping Agents Mean for SEO

AI agents are starting to browse, compare and check out on behalf of real customers. Search interest in agentic commerce is up 408 percent in a year. We break down what is genuinely happening, what is still demo-ware, and the handful of things that decide whether an agent ever puts you in the basket.

June 2, 2026

Continue Your SEO Journey

Explore more expert insights and take action on your SEO strategy