DEIDPROOF

Re-identification risk assessment that computes k-anonymity, l-diversity, and HIPAA Safe Harbor compliance on a dataset.

Healthcare & Life-Sciences — HIPAA, PHI, FHIR/HL7, and clinical data.

pip install cognis-deidproof

deidproof scan .            # → prioritized findings in seconds

Usage — step by step

Install the CLI:
```
pip install deidproof
```
Check a CSV dataset for re-identification risk, naming your quasi-identifier and sensitive columns:
```
deidproof check dataset.csv --quasi-identifiers zip,age,sex --sensitive diagnosis
```

Enforce thresholds — require a minimum k-anonymity and l-diversity:

deidproof check dataset.csv --quasi-identifiers zip,age,sex --sensitive diagnosis --min-k 5 --min-l 2

Read the output. Add --format json for machine-readable results:

deidproof check dataset.csv --quasi-identifiers zip,age,sex --format json > risk.json

Wire it into CI — block a data release that fails k/l targets (non-zero exit):

deidproof check dataset.csv --quasi-identifiers zip,age,sex --sensitive diagnosis --min-k 5 || exit 1

Why deidproof?

Proves your 'de-identified' export actually is de-identified, emitting a signed risk report — the safety net researchers cite before publishing or sharing data.

deidproof is single-purpose, scriptable, and self-hostable: point it at a target, get prioritized results in the format your workflow already speaks (table · JSON · SARIF), gate CI on it, and let agents drive it over MCP.

↑ back to top

Features

✅ K Anonymity
✅ L Diversity
✅ Safe Harbor Scan
✅ Analyze Rows
✅ Analyze Csv
✅ Runs on Linux/macOS/Windows · Docker · devcontainer
✅ Ports in Python, JavaScript, Go, and Rust (ports/)

↑ back to top

Quick start

pip install cognis-deidproof

deidproof --version

deidproof scan .                       # scan current project

deidproof scan . --format json         # machine-readable

deidproof scan . --fail-on high        # CI gate (non-zero exit)

↑ back to top

Example


$ deidproof scan .

  [HIGH    ] DEI-001  example finding             (./src/app.py)

  [MEDIUM  ] DEI-002  another signal              (./config.yaml)



  2 findings · risk score 5 · 38ms

↑ back to top

Architecture

flowchart LR
  IN[sources] --> P[deidproof<br/>curate + validate]
  P --> OUT[query / analysis]

↑ back to top

Use it from any AI stack

deidproof is interoperable with every popular way of using AI:

MCP server — deidproof mcp (Claude Desktop, Cursor, Cognis.Studio, uncensored-fleet)
OpenAI-compatible / JSON — pipe deidproof scan . --format json into any agent or LLM
LangChain · CrewAI · AutoGen · LlamaIndex — wrap the CLI/JSON as a tool in one line
CI / scripts — exit codes + SARIF for non-AI pipelines

↑ back to top

How it compares

| | Cognis deidproof | ARX Data Anonymization Tool |

|---|:---:|:---:|

| Self-hostable, no account | ✅ | varies |

| Single command, zero config | ✅ | ⚠️ |

| JSON + SARIF for CI | ✅ | varies |

| MCP-native (AI agents) | ✅ | ❌ |

| Polyglot ports (JS/Go/Rust) | ✅ | ❌ |

| Open license | ✅ COCL | varies |

Built in the spirit of ARX Data Anonymization Tool, re-framed the Cognis way. Missing a credit? Open a PR.

↑ back to top

Integrations

Pipes into your stack: SARIF for code-scanning, JSON for anything, an MCP server (deidproof mcp) for AI agents, and a webhook forwarder for SIEM/Slack/Jira. See docs/INTEGRATIONS.md.

↑ back to top

Install — every way, every platform

pip install "git+https://github.com/cognis-digital/deidproof.git"    # pip (works today)

pipx install "git+https://github.com/cognis-digital/deidproof.git"   # isolated CLI

uv tool install "git+https://github.com/cognis-digital/deidproof.git" # uv

pip install cognis-deidproof                                          # PyPI (when published)

docker run --rm ghcr.io/cognis-digital/deidproof:latest --help        # Docker

brew install cognis-digital/tap/deidproof                             # Homebrew tap

curl -fsSL https://raw.githubusercontent.com/cognis-digital/deidproof/main/install.sh | sh

|---|---|---|---|---|

↑ back to top

Related Cognis tools

phiscrub — Stream-scan logs, CSVs, and free-text notes for PHI (names, MRNs, SSNs, dates, addresses) and redact or tokenize in place.
dicomsweep — De-identify DICOM imaging studies per the DICOM PS3.15 Annex E profile, scrubbing tags and burned-in pixel text.
fhirlint — Validate FHIR R4/R5 resources and bundles against profiles (US Core, etc.) with precise, line-level error reporting.
hl7tap — Parse, pretty-print, diff, and replay HL7 v2 messages over MLLP from the terminal.
consentledger — Maintain a tamper-evident, hash-chained audit log of patient-data access and consent events.
synthcohort — Generate statistically realistic synthetic patient cohorts (FHIR/CSV) from a schema spec for dev and testing.

Explore the suite → 🗂️ all 170+ tools · ⭐ awesome-cognis · 🔗 cognis-sources · 🤖 uncensored-fleet · 🧠 engram

↑ back to top

Contributing

PRs, new rules, and demo scenarios are welcome under the collaboration-pull model — see CONTRIBUTING.md and SECURITY.md.

⭐ If deidproof saved you time, star it — it genuinely helps others find it.

Interoperability

{} composes with the 300+ tool Cognis suite — JSON in/out and a shared OpenAI-compatible /v1 backbone. See INTEROP.md for the suite map, composition patterns, and reference stacks.

License

Source-available under the Cognis Open Collaboration License (COCL) v1.0 — free for personal, internal-evaluation, research, and educational use; commercial / production use requires a license (licensing@cognis.digital). See LICENSE.

_{Cognis Digital · one of 170+ tools in the Cognis Neural Suite · Making Tomorrow Better Today}

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.cognis		.cognis
.devcontainer		.devcontainer
.github		.github
deidproof		deidproof
demos		demos
deploy		deploy
docs		docs
integrations		integrations
ports		ports
scripts		scripts
tests		tests
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
INTEGRATIONS.md		INTEGRATIONS.md
INTEROP.md		INTEROP.md
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
VERSION		VERSION
install.sh		install.sh
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DEIDPROOF

Re-identification risk assessment that computes k-anonymity, l-diversity, and HIPAA Safe Harbor compliance on a dataset.

Usage — step by step

Contents

Why deidproof?

Features

Quick start

Example

Architecture

Use it from any AI stack

How it compares

Integrations

Install — every way, every platform

Related Cognis tools

Contributing

⭐ If `deidproof` saved you time, star it — it genuinely helps others find it.

Interoperability

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DEIDPROOF

Re-identification risk assessment that computes k-anonymity, l-diversity, and HIPAA Safe Harbor compliance on a dataset.

Usage — step by step

Contents

Why deidproof?

Features

Quick start

Example

Architecture

Use it from any AI stack

How it compares

Integrations

Install — every way, every platform

Related Cognis tools

Contributing

⭐ If deidproof saved you time, star it — it genuinely helps others find it.

Interoperability

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

⭐ If `deidproof` saved you time, star it — it genuinely helps others find it.

Packages