Skip to content

DOCS-1374 - Implement LLM markdown mirror#6768

Draft
kimsauce wants to merge 1 commit into
mainfrom
docs-1374-md-mirror
Draft

DOCS-1374 - Implement LLM markdown mirror#6768
kimsauce wants to merge 1 commit into
mainfrom
docs-1374-md-mirror

Conversation

@kimsauce
Copy link
Copy Markdown
Collaborator

@kimsauce kimsauce commented Jun 5, 2026

Purpose of this pull request

Adds a Docusaurus postBuild plugin that copies every .md and .mdx file from docs/ into {outDir}/llm/docs/ on every production build. This creates a raw-markdown mirror of the full documentation at predictable URLs like /help/llm/docs/search/get-started-with-search.md, making source content directly accessible to AI crawlers without HTML parsing.

Also generates /llm/docs/index.txt listing every mirrored path for easy discovery.

Supporting changes:

  • static/llms.txt — updated to advertise the mirror URL pattern and index
  • static/robots.txt — added User-agent: Algolia Crawler block to block /llm/ (Algolia shouldn't index raw markdown; AI crawlers like GPTBot/ClaudeBot are unaffected)

Select the type of change

  • Minor Changes - Typos, formatting, slight revisions
  • Update Content - Revisions, updating sections
  • New Content - New features, sections, pages, tutorials
  • Site and Tools - .clabot, version updates, maintenance, dependencies, new packages for the site (Docusaurus, Gatsby, React, etc.)

Ticket (if applicable)

https://sumologic.atlassian.net/browse/DOCS-1374

Adds a postBuild plugin that copies all .md/.mdx files from docs/ to
{outDir}/llm/docs/ on every production build, making raw source
accessible to AI crawlers at /help/llm/docs/{path}.md. Also generates
an index.txt listing every mirrored path.

Updates llms.txt to advertise the mirror and robots.txt to block
Algolia from crawling /llm/ (it's for AI crawlers, not search indexing).

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@kimsauce kimsauce self-assigned this Jun 5, 2026
@cla-bot cla-bot Bot added the cla-signed Contributor approved, listed in .clabot file label Jun 5, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla-signed Contributor approved, listed in .clabot file

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant