Odin — Dataflow-Guided AI Code Review

Open-source AI code review with intra-procedural taint analysis, a public FP-rate leaderboard, and a learning feedback loop.

Odin implements the LLift (OOPSLA 2024) / INFERROI (ICSE 2025) architecture: cheap taint propagation narrows the search space, then an LLM reasons about exploitability only on real candidates. A feedback loop suppresses known false-positive (source, sink) pairs before the LLM runs — so cost and noise drop together over time.

FP rates on 193 clean-code samples: dataflow taint tracker 0.0% (narrow: ~8 sink categories), deterministic rules 8.8% (full ruleset). Semgrep: 2.1% on the same corpus, wider ruleset. The odin-rules number is the fair head-to-head; the dataflow number is scoped to sinks where a full source→sink path is required. Every number is reproducible — see leaderboard.md for methodology and caveats.

One-line install

# No Docker, no server, no checkout — just works (BYOK via env var)
uvx odin-review review path/to/file.py

# Rules only — instant, no LLM
uvx odin-review review path/to/file.py --rules-only

Set your provider once:

export ODIN_LLM_PROVIDER=openrouter
export ODIN_OPENROUTER_API_KEY=sk-or-v1-...
export ODIN_OPENROUTER_MODEL=anthropic/claude-sonnet-4-5

GitHub App — one-click install

Install the GitHub App and Odin automatically reviews every PR in your repos — no webhook setup, no token management.

https://your-odin-instance/api/github/app/install

Or self-host and register your own App:

# .env
ODIN_GITHUB_APP_ID=123456
ODIN_GITHUB_APP_PRIVATE_KEY_PATH=/run/secrets/odin-app.pem
ODIN_GITHUB_APP_WEBHOOK_SECRET=your-secret

Odin posts structured reviews with inline comments, severity badges, and fix suggestions on every PR automatically.

Features

Feature	Details
Dataflow triage	Intra-procedural taint analysis → LLM reasons about exploitability on narrowed candidates only (LLift/INFERROI architecture)
27 deterministic rules	Python, JS, TS, Go, Rust, Java — zero cost, instant
Learning feedback loop	Mark a finding false-positive twice → that (source, sink) pair is suppressed before the LLM runs next time
Honest leaderboard	Public FP-rate benchmark on 60 clean samples + CVE recall; every number reproducible
uvx one-binary install	`uvx odin-review review <file>` — works from a clean machine, BYOK
GitHub App	One-click install, auto-registers webhook, reviews every PR
GitHub webhook	Manual webhook setup if you prefer
MCP server	Use Odin as a tool inside Claude Code or Cursor
Local LLMs	LM Studio, Ollama, or any OpenAI-compatible endpoint
BYOK	OpenRouter, OpenAI, Anthropic
6 languages	Python, JavaScript, TypeScript, Go, Rust, Java

How it works

Client / GitHub PR ──▶ FastAPI + LangGraph
                              │
                   tree-sitter AST parse
                              │
                     LangGraph fan-out (parallel)
         ┌────────────────────┼──────────────────┬──────────────────┐
         ▼                    ▼                  ▼                  ▼
   SecurityAgent        QualityAgent        DocsAgent       DataflowTriage
   (LLM call)           (LLM call)         (LLM call)      taint→LLM triage
         │                    │                  │                  │
         └────────────────────┴──────────────────┴──────────────────┘
                              │
                        Rules Engine
                     (27+ instant checks)
                              │
                         synthesize()
                    (dedup + score + sort)
                              │
               GitHub PR review / Web UI / CLI / SSE stream

DataflowTriage pipeline:

Walk each function body with an intra-procedural taint tracker (assignment-chain + call-arg propagation)
Check the (source_sig, sink_sig) suppression table — skip known-FP pairs before the LLM runs
LLM reasons about exploitability for remaining candidates only (≤20 per file, asyncio.Semaphore(4))
Confirmed false positives feed back into the suppression table — noise drops over time

Benchmarks

FP rate is a first-class metric. We report where Odin loses.

False Positive Rate — 193 clean-code samples

Tool	FP Rate	Notes
`odin-dataflow`	0.0%	0/193 — dataflow taint tracker refuses to fire without a source→sink path
`semgrep`	2.1%	4/193 — the open-source reference point
`odin-rules`	8.8%	17/193 — pattern rules trade precision for coverage

Recall on real CVEs

Tool	SecVulEval (14)	CVE-Bench crits (50)	SWE-bench Verified (50)
`odin-rules`	86%	32% (SOTA ~13%)	100%
`semgrep`	50%	26%	2%
`odin-dataflow`	21%	6%	2%

Every number is reproducible — dataset SHAs pinned, seed fixed at 42:

cd backend
python -m bench.harness --seed 42                           # full head-to-head
python -m bench.harness --dataset clean_corpus --tool semgrep --seed 42
python -m bench.harness --dataset cvebench --tool odin-rules --seed 42
python -m bench.harness --json                              # machine-readable

Full methodology + every number: bench/reports/leaderboard.md

CodeRabbit, Greptile, Qodo, CodeQL, and Copilot runners are wired into the same harness but require API keys / hosted access — once enabled they drop into the tables above.

Quick Start — self-hosted

Option 1: LM Studio (local, fully private)

git clone https://github.com/RahulModugula/odin
cd odin
cp .env.example .env

# .env:
# ODIN_LLM_PROVIDER=lmstudio
# ODIN_LMSTUDIO_MODEL=qwen2.5-coder-32b

docker compose -f docker-compose.yml -f docker-compose.lmstudio.yml up

Open http://localhost:3000

Option 2: OpenRouter (BYOK)

# .env:
# ODIN_LLM_PROVIDER=openrouter
# ODIN_OPENROUTER_API_KEY=sk-or-v1-...
# ODIN_OPENROUTER_MODEL=anthropic/claude-sonnet-4-5

docker compose up

Option 3: OpenAI / any OpenAI-compatible API

# ODIN_LLM_PROVIDER=openai
# ODIN_LLM_API_KEY=sk-...
# ODIN_LLM_MODEL=gpt-4o-mini
docker compose up

CLI

# Install once (no checkout required)
uvx odin-review review path/to/file.py

# Rules only — instant, no LLM, no server
uvx odin-review review path/to/file.py --rules-only

# Staged changes (pre-push check)
uvx odin-review review --staged --rules-only

# Fail CI on high+ severity
uvx odin-review review --staged --fail-on high

# JSON output for scripting
uvx odin-review review path/to/file.py --json | jq .

# Filter by severity and confidence
uvx odin-review review backend/ --min-severity high --min-confidence 0.8

# Noise budget — keep only the top N findings (severity DESC, confidence DESC)
uvx odin-review review backend/ --max-findings 10

# AI Code Validator mode — sharpens the review for Copilot/ChatGPT-authored code
uvx odin-review review generated_file.py --local --ai-generated

Install as a git pre-push hook:

bash cli/install-hook.sh

Flags: --staged · --diff REF · --rules-only · --local · --quiet · --min-severity · --min-confidence · --fail-on · --fail-on-score · --max-findings · --ai-generated · --json · --sarif

Editor + demo surfaces

VS Code extension: vscode-extension/ — on-save rules, one-click full AI review, hover squiggles.
Live demo instance: deploy the public, rate-limited demo with the config in infra/demo/ (ODIN_DEMO_ENABLED=true exposes POST /api/demo/review).
Policy-as-code: drop YAML files in .odin/rules/ to add custom pattern-based rules without writing Python — see app/rules/custom_loader.py for the schema.

GitHub Webhook (manual setup)

openssl rand -hex 32  # generate webhook secret

In your GitHub repo: Settings → Webhooks → Add webhook

Payload URL: https://your-odin/api/webhook/github
Content type: application/json
Events: Pull requests, Issue comments

# .env
ODIN_GITHUB_TOKEN=ghp_...
ODIN_GITHUB_WEBHOOK_SECRET=your-secret

Bot commands in PRs: @odin review · @odin help

Deterministic Rules Reference

ID	Name	Severity	Language
PY001	Bare except clause	HIGH	Python
PY002	Mutable default argument	HIGH	Python
PY003	eval()/exec()	CRITICAL	Python
PY004	Hardcoded secret/credential	CRITICAL	Python
PY005	SQL string formatting	CRITICAL	Python
PY006	High cyclomatic complexity	MEDIUM	Python
PY007	Overly long function	MEDIUM	Python
PY008	Excessive nesting depth	MEDIUM	Python
PY009	Missing type hints	LOW	Python
JS001	Use of var	LOW	JS/TS
JS002	console.log in code	LOW	JS/TS
JS003	XSS via innerHTML	HIGH	JS/TS
JS004	Deep callback nesting	MEDIUM	JS/TS
JS005	JWT decode without verify	HIGH	JS/TS
JS006	Prototype pollution	HIGH	JS/TS
TS001	TypeScript `any` type	MEDIUM	TypeScript
TS002	Non-null assertion overuse	MEDIUM	TypeScript
GO001	Error return value ignored	HIGH	Go
GO002	panic() in library code	HIGH	Go
GO003	Goroutine leak	MEDIUM	Go
GO004	SQL injection via fmt.Sprintf	CRITICAL	Go
GO005	Mutex without deferred Unlock	MEDIUM	Go
GO006	context.Context not first param	LOW	Go
GO007	Hardcoded IP address	LOW	Go
GO008	Unbuffered channel send deadlock	MEDIUM	Go
CL001	TODO/FIXME comment	INFO	All
CL002	File too large	MEDIUM	All
CL003	Magic number	LOW	All
CL004	Hardcoded credential	CRITICAL	All

Configuration (.odin.yaml)

provider:
  name: openrouter
  model: anthropic/claude-sonnet-4-5

review:
  agents: [security, quality, docs]
  severity_threshold: low

ignore:
  paths: [vendor/, node_modules/, "*.min.js"]
  rules: [CL001]

quality_gate:
  min_score: 70
  max_critical: 0
  block_on_fail: false

MCP Server

Use Odin as a tool inside Claude Code or Cursor:

{
  "mcpServers": {
    "odin": {
      "command": "python",
      "args": ["-m", "app.mcp.stdio_runner"],
      "cwd": "/path/to/odin/backend"
    }
  }
}

Available tools: review_code, analyze_file, get_findings, query_codebase

Development

cd backend
uv venv && uv pip install -e ".[dev]"
uvicorn app.main:app --reload

# Tests
pytest tests/ -v --cov=app

# Lint
ruff check . && ruff format --check . && mypy --strict app/

# Benchmark
python -m bench.harness --dataset clean_corpus

vs CodeRabbit

The honest comparison — including where CodeRabbit wins.

	Odin	CodeRabbit Free	CodeRabbit Pro
Price	Free	Free (limited)	$24/dev/mo
Open source	✅ MIT	❌	❌
Self-hostable	✅	❌	❌
Data privacy	✅ never leaves your infra	❌	❌
Local LLMs (LM Studio, Ollama)	✅	❌	❌
BYOK (OpenRouter, OpenAI)	✅	❌	✅
Published FP rate	✅ 0.0% on 60 clean samples	❌	❌
Reproducible benchmarks	✅ `python -m bench.harness`	❌	❌
Taint-guided triage (LLift/INFERROI)	✅	❌	❌
Learning feedback loop	✅ suppresses FPs at generator level	❌	limited
GitHub App one-click install	✅	✅	✅
GitHub webhook	✅	✅	✅
CLI (`uvx odin-review review`)	✅ no Docker needed	❌	❌
PR summary & walkthrough	✅	✅	✅
Inline comments	✅	✅	✅
Deterministic rules	✅ 29 rules, 6 languages	✅ 40+	✅ 40+
MCP server	✅ Claude Code / Cursor	❌	❌
GitLab / Bitbucket	❌ (GitHub only)	✅	✅

Where CodeRabbit wins: more platform integrations (GitLab, Bitbucket, Azure DevOps), more rules out of the box, more mature bot UX, and a larger team maintaining it. If you're on GitLab or want something fully managed, CodeRabbit is a better fit today.

Where Odin wins: if your code can't leave your infrastructure, if you want to understand and audit what's running, if you want FP rates that actually drop over time, or if you want to run it free with your own model — Odin is the only open-source option in this space with published, reproducible benchmarks and a research-backed triage architecture.

License

MIT — use it, fork it, make it better.

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
.github		.github
.odin/rules		.odin/rules
backend		backend
cli		cli
frontend		frontend
infra		infra
plans		plans
vscode-extension		vscode-extension
.env.example		.env.example
.gitignore		.gitignore
BLOG_POST.md		BLOG_POST.md
COMPETITIVE_ANALYSIS.md		COMPETITIVE_ANALYSIS.md
CONTRIBUTING.md		CONTRIBUTING.md
LAUNCH_CHECKLIST.md		LAUNCH_CHECKLIST.md
LAUNCH_PLAYBOOK.md		LAUNCH_PLAYBOOK.md
LICENSE		LICENSE
README.md		README.md
TODO_LAUNCH.md		TODO_LAUNCH.md
docker-compose.lmstudio.yml		docker-compose.lmstudio.yml
docker-compose.yml		docker-compose.yml
finalplan.md		finalplan.md
odin.code-workspace		odin.code-workspace
plan.md		plan.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Odin — Dataflow-Guided AI Code Review

One-line install

GitHub App — one-click install

Features

How it works

Benchmarks

False Positive Rate — 193 clean-code samples

Recall on real CVEs

Quick Start — self-hosted

Option 1: LM Studio (local, fully private)

Option 2: OpenRouter (BYOK)

Option 3: OpenAI / any OpenAI-compatible API

CLI

Editor + demo surfaces

GitHub Webhook (manual setup)

Deterministic Rules Reference

Configuration (.odin.yaml)

MCP Server

Development

vs CodeRabbit

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Odin — Dataflow-Guided AI Code Review

One-line install

GitHub App — one-click install

Features

How it works

Benchmarks

False Positive Rate — 193 clean-code samples

Recall on real CVEs

Quick Start — self-hosted

Option 1: LM Studio (local, fully private)

Option 2: OpenRouter (BYOK)

Option 3: OpenAI / any OpenAI-compatible API

CLI

Editor + demo surfaces

GitHub Webhook (manual setup)

Deterministic Rules Reference

Configuration (.odin.yaml)

MCP Server

Development

vs CodeRabbit

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages