From 9c459ef51f984d1264785d2ee8409e10f71ec76e Mon Sep 17 00:00:00 2001 From: TaprootFreak <142087526+TaprootFreak@users.noreply.github.com> Date: Thu, 4 Jun 2026 19:26:25 +0200 Subject: [PATCH] Add repo-controlled robots.txt allowing AI crawlers Add a version-controlled robots.txt that serves as the authoritative crawl policy for the JuiceDollar documentation site (docs.juicedollar.com). The file explicitly welcomes both search engines and AI agents to crawl, index, and learn from this public documentation. - Wildcard group allows all user-agents and sets a Content-Signal granting search, AI input / RAG, and AI training. - Major AI crawlers (ClaudeBot, GPTBot, Google-Extended, CCBot, Bytespider, Amazonbot, Applebot-Extended, meta-externalagent) are additionally listed by name, since some honor only their own record. The file lives in src/.vuepress/public/, which VuePress copies verbatim to the published site root (dist/robots.txt). No Sitemap directive is included because the site does not currently publish a sitemap.xml. --- src/.vuepress/public/robots.txt | 38 +++++++++++++++++++++++++++++++++ 1 file changed, 38 insertions(+) create mode 100644 src/.vuepress/public/robots.txt diff --git a/src/.vuepress/public/robots.txt b/src/.vuepress/public/robots.txt new file mode 100644 index 0000000..a628cd0 --- /dev/null +++ b/src/.vuepress/public/robots.txt @@ -0,0 +1,38 @@ +# robots.txt — JuiceDollar documentation (docs.juicedollar.com) +# +# Public documentation. We explicitly WANT both search engines and AI agents to +# crawl, index, and learn from this content. This file is version-controlled in +# this repository and is the authoritative crawl policy for this site. +# +# Content signals: all uses are granted — search, AI input / retrieval-augmented +# generation, and AI training. We deliberately do NOT signal ai-train=no. + +User-agent: * +Allow: / +Content-Signal: search=yes, ai-input=yes, ai-train=yes + +# Major AI crawlers are explicitly welcome. Some honor only their own named +# record, so each is listed in addition to the wildcard group above. +User-agent: ClaudeBot +Allow: / + +User-agent: GPTBot +Allow: / + +User-agent: Google-Extended +Allow: / + +User-agent: CCBot +Allow: / + +User-agent: Bytespider +Allow: / + +User-agent: Amazonbot +Allow: / + +User-agent: Applebot-Extended +Allow: / + +User-agent: meta-externalagent +Allow: /