CUSTODIAN

High-Performance Distributed Misinformation Detection System

Misinformation spreads in milliseconds, while verification takes minutes.

CUSTODIAN bridges this latency gap using a high-concurrency, tiered inference architecture. Built on a Go backend, it provides instant protection by resolving known rumors through a Redis cache in under a millisecond.

For new or unseen content, the system applies asymmetric confidence gating. A lightweight local DistilBERT model rapidly filters high-volume or obvious misinformation, while only ambiguous claims are escalated asynchronously to a cloud-based LLM for deeper reasoning. Human reviewers retain final authority over disputed cases by design.

By decoupling user response time from expensive AI processing, CUSTODIAN enables fast, scalable misinformation flagging while controlling cost and preserving accuracy.

Key Features

Zero-Cost Infrastructure: Runs entirely on local hardware behind a University firewall, using Cloudflare Tunnels to punch through CGNAT.
Tiered Inference: * Layer 1 (Fast): Redis Cache (<1ms)
- Layer 2 (Medium): Local DistilBERT Model (~50ms)
- Layer 3 (Slow/Deep): Async LLM + Web Search Agent (~2s)
Anti-Fragile: The system degrades gracefully. If the LLM is down, BERT takes over. If the Internet is down, the Cache takes over.
Eventual Consistency: Users get an instant preliminary result (BERT), which is silently updated to a high-confidence result (LLM) in the background via the "Invisible Intelligence" pattern.

System Architecture

Ingestion: Query hits the Go Orchestrator.
Cache Layer: Checks Redis. If hit -> Return Instantly.
Scatter-Gather: If miss -> Query dispatched to Postgres (Audit) and BERT Service (Python) simultaneously.
Fast Path: BERT analyzes the text. If confidence > threshold, result is returned and cached.
Async Escalation: If BERT is uncertain, the job is pushed to a Redis Queue.
Deep Blue Worker: A background worker picks up the job:
- Optimizes the search term.
- Queries live web sources.
- Synthesizes a final verdict using Gemini LLM.
Write-Through: The new, high-confidence verdict overwrites the cache and DB, ensuring the next user gets the "Verdict" instantly.

Backend: Go (Golang) - Chosen for high concurrency and low latency.
ML/AI: Python, DistilBERT, Gemini API, DuckDuckGo Search.
Data: Redis (Cache), PostgreSQL (Persistent Storage).
Infrastructure: Docker Compose, Cloudflare Zero Trust.

How to run

Prerequisites

Docker & Docker Compose installed.
A .env file with GOOGLE_API_KEY and POSTGRES_PASSWORD.

Note: The ML model is not included in the repository due to size constraints and must be fetched separately.

Deploy

git clone https://github.com/souls-syntax/Custodian
cd Custodian

# 2. Clone the model from Hugging Face
git clone https://huggingface.co/souls-syntax/kohai_v1

# 3. Place the model in the expected location
mkdir -p intelligence/models
cp -r kohai_v1/* intelligence/models/

# 4. Start the build
docker-compose up -d --build

# 5. Verify status
docker ps

Limitations

As with all statistical models, CUSTODIAN does not produce absolute truth.
Its outputs should be treated as probabilistic assessments designed to aid verification, not replace human judgment.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
frontend		frontend
intelligence		intelligence
orchestrator		orchestrator
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
prometheus.yml		prometheus.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CUSTODIAN

High-Performance Distributed Misinformation Detection System

Key Features

System Architecture

How to run

Prerequisites

Deploy

Limitations

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CUSTODIAN

High-Performance Distributed Misinformation Detection System

Key Features

System Architecture

How to run

Prerequisites

Deploy

Limitations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages