docling-skill

docling-skill is a local, agent-first ingestion layer built on top of Docling. It converts local documents into a stable source.* sidecar contract that LLM agents can inspect before consuming.

中文 README

Use it when an agent needs risk-aware local PDF, Office, HTML, text, or Markdown conversion before downstream reasoning, retrieval, wiki ingestion, or handoff. The key output is not just Markdown; it is an inspectable manifest with minimum viability gates, risk level, warnings, and evidence signals.

What It Does

Supported local inputs: pdf, docx, xls, xlsx, csv, html, txt, and md.

Each successful conversion writes:

Artifact	Purpose
`source.manifest.json`	Quality risk, routing, remediation, and evidence metadata
`source.md`	Default agent-readable Markdown
`source.docling.json`	Authoritative structured Docling export from the same conversion result
`source.images.json`	Always-written image sidecar list; empty when extraction is unavailable or no images are found
`source.meta.json`	Lightweight ingestion metadata for downstream workflows

Downstream rule:

Read source.manifest.json first.
Inspect quality.status, quality.risk_level, quality.warnings, and quality.signals.
If quality.agent_ready is true, source.md is usable as the default agent input.
Use source.docling.json when structure, recovery, or deeper inspection matters.
Resolve image placeholders such as [[image:picture-p2-1]] through source.images.json.

The automatic quality model is a risk screen, not a semantic audit. A low-risk result means no hard failure was detected; it does not prove source fidelity or complete source-to-Markdown alignment.

docling-skill intentionally does not fetch remote URLs, chunk documents, or emit downstream knowledge fields such as tags, keywords, categories, or summaries.

Install

pip install "git+https://github.com/realraelrr/docling-skill.git@v1.1.0"
docling-skill "/path/to/file.pdf" "/tmp/docling-sidecar"

If your environment uses SOCKS proxies:

pip install "docling-skill[proxy] @ git+https://github.com/realraelrr/docling-skill.git@v1.1.0"

For local development:

git clone https://github.com/realraelrr/docling-skill.git
cd docling-skill
pip install -e ".[proxy]"

Use

CLI:

docling-skill "<input_path>" "<output_dir>"

Equivalent module entrypoint:

python -m docling_skill.cli "<input_path>" "<output_dir>"

PDF-oriented OCR options:

--ocr-engine auto|tesseract|ocrmac|rapidocr
--ocr-lang <lang>
--force-full-page-ocr
--no-ocr-remediation

Manifest check:

python3 -c 'import json, pathlib; p = pathlib.Path("/tmp/docling-sidecar/source.manifest.json"); m = json.loads(p.read_text(encoding="utf-8")); q = m["quality"]; print({"status": q["status"], "risk_level": q["risk_level"], "agent_ready": q["agent_ready"], "warnings": q["warnings"], "selected_attempt": m["selected_attempt"]})'

Python API:

from pathlib import Path

from docling_skill import convert_document_to_ingestion_outputs

outputs = convert_document_to_ingestion_outputs(
    input_path=Path("/path/to/file.html"),
    output_dir=Path("/tmp/docling-sidecar"),
)

manifest = outputs["manifest"]
if not manifest["quality"]["agent_ready"]:
    raise RuntimeError(manifest["quality"])

if manifest["quality"]["risk_level"] != "low":
    print(manifest["quality"]["warnings"])
    print(manifest["quality"]["signals"])

markdown_text = outputs["markdown_text"]
structured_document = outputs["docling_document"]
images = outputs["images"]
meta = outputs["meta"]

Skill Integration

This repository supports both Codex and Claude Code from the same source skill.

Codex: .codex/skills/docling-skill/SKILL.md
Claude Code: .claude/skills/docling-skill/SKILL.md
Root source: SKILL.md

For global installs, use the same target layout under the user's home directory:

Codex: ~/.codex/skills/docling-skill/SKILL.md
Claude Code: ~/.claude/skills/docling-skill/SKILL.md

Agent Setup

If you are Codex, Claude Code, or another LLM agent setting up this project from scratch:

Treat this repository as the source of truth.
Clone it into the user's local code workspace.
Install it into the existing conda environment named docling, using the proxy extra when SOCKS proxies are configured.
Point both agent skill entrypoints at this repo's root SKILL.md, preferably with symlinks instead of copies.
Validate the root skill and both installed skill paths with the validator available in the active runtime.
Run the test suite.
Do not commit .state/ or other agent working files.

Expected verification:

# If the Codex skill validator is available:
conda run -n docling python "$HOME/.codex/skills/.system/skill-creator/scripts/quick_validate.py" .
conda run -n docling python "$HOME/.codex/skills/.system/skill-creator/scripts/quick_validate.py" .codex/skills/docling-skill
conda run -n docling python "$HOME/.codex/skills/.system/skill-creator/scripts/quick_validate.py" .claude/skills/docling-skill

conda run -n docling python -m pytest

Scope

docling-skill is a thin workflow layer on top of official docling, not a Docling fork or official distribution.

The skill workflow contract lives in SKILL.md. Docling supports more formats than this project exposes; new formats should only be added when they preserve the local source.* contract, risk evidence model, and tests.

Acknowledgements

Built on top of Docling, which provides the parser, document model, and format support.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.claude/skills/docling-skill		.claude/skills/docling-skill
.codex/skills/docling-skill		.codex/skills/docling-skill
src/docling_skill		src/docling_skill
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
SKILL.md		SKILL.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

docling-skill

What It Does

Install

Use

Skill Integration

Agent Setup

Scope

Acknowledgements

About

Uh oh!

Releases 5

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

docling-skill

What It Does

Install

Use

Skill Integration

Agent Setup

Scope

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages