Skip to content

feat: robots.txt — allow all AI crawlers#40

Draft
TaprootFreak wants to merge 1 commit into
developfrom
feat/robots-allow-ai-crawlers
Draft

feat: robots.txt — allow all AI crawlers#40
TaprootFreak wants to merge 1 commit into
developfrom
feat/robots-allow-ai-crawlers

Conversation

@TaprootFreak

Copy link
Copy Markdown
Contributor

What

Adds a repo-controlled robots.txt at src/.vuepress/public/robots.txt (VuePress copies public/ verbatim to the site root → served at /robots.txt) that explicitly allows all crawlers, including AI agents.

Why

The documentation is public content and should be discoverable by both search engines and AI agents. Sets User-agent: * → Allow: /, a positive Content-Signal: search=yes, ai-input=yes, ai-train=yes (no ai-train=no), and explicit Allow: / records for ClaudeBot, GPTBot, Google-Extended, CCBot, Bytespider, Amazonbot, Applebot-Extended, meta-externalagent.

No Sitemap: line: docs.deuro.com/sitemap.xml is not served (404) and the VuePress build generates none.

Note

This file becomes the live crawl policy once the site's Cloudflare "Manage robots.txt / Block AI bots" managed setting is disabled for the zone.

Allow search engines and AI agents to crawl the public documentation: wildcard
Allow: / with a positive content signal (no ai-train=no) and explicit Allow
records for the major AI crawlers (ClaudeBot, GPTBot, Google-Extended, CCBot,
Bytespider, Amazonbot, Applebot-Extended, meta-externalagent).

VuePress copies src/.vuepress/public/ verbatim to the published site root, so
this is served at /robots.txt.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant