SHIFA — AI Clinical Assistant for Humanitarian Crisis Settings

Built for community health workers in Sudan, DRC, Somalia, Nigeria, and Rwanda — because a wrong triage call in a displacement camp costs a child's life.

SHIFA comes from the Arabic word شفاء (shifa), meaning healing.

Offline-capable clinical decision support for crisis settings, with 100% urgent recall in validation and deterministic WHO/IMCI safety guardrails.

The Problem

Community health workers in crisis settings make life-or-death decisions with almost nothing. No doctor nearby. No reliable power. Internet that drops in and out, if it exists at all.

The wrong call — keeping a child home who needed urgent referral, or sending someone on a dangerous journey they didn't need — has real consequences. SHIFA exists to give that CHW one more reliable tool.

What makes SHIFA different:

Fine-tuned Gemma 4 E2B runs fully offline on Android after setup — no cloud dependency for clinical decisions.
Two-layer safety architecture: learned model reasoning plus deterministic WHO/IMCI guardrails that model output cannot override.
Built for the actual field: six languages, five crisis-country contexts, physical Android offline smoke testing, SMS/BLE Guard relay, and dashboard outbreak monitoring.

What It Does

Clinical Triage

The CHW speaks or types symptoms. SHIFA listens — in Arabic, Somali, French, Lingala, Kinyarwanda, or Hausa — runs a fine-tuned Gemma 4 model on the device, and gives a clear answer: treat at home, refer routinely, or refer urgently right now. The result is read aloud in the CHW's language, using the closest local or regional voice available on the device so the guidance sounds familiar and easy to understand.

When the models are downloaded, this runs with no internet at all. When they're not, it falls back to Gemini API. Case data syncs to a coordinator dashboard whenever connectivity returns.

SHIFA Guard

If a CHW encounters a threat in the field, Guard lets them capture photo or video evidence. The offline detector analyzes still images for visible weapons and armed individuals, while Gemini fallback handles richer scene context such as checkpoints or convoy-like situations when connectivity is available. If a threat is confirmed, SHIFA attaches GPS coordinates, queues an SMS alert to saved coordinator numbers via Africa's Talking, logs the event locally, and attempts a Bluetooth mesh relay to other nearby SHIFA devices — so the alert can still propagate even without a cell signal.

Guard also publishes and downloads a compact offline firearm detector (.tflite, 5.35 MB). On Android, still-image Guard evidence can run through a native TFLite bridge before cloud fallback. The current validated release gates alerts on visible firearms: GUN mAP50 is 0.725 against a 0.60 release target. Knife detection is treated as experimental and never triggers dispatch by itself.

Outbreak Monitoring

Every case logged in the field feeds a coordinator dashboard. Spatial DBSCAN clustering runs over the case records to flag potential hotspots — early warning for cholera, meningitis, measles, and other conditions that move fast in displacement settings. On the dashboard map, country boundaries are lightly outlined and regions with active outbreak alerts are highlighted in red for fast coordinator triage. Current backend rules use condition-specific DBSCAN windows: cholera/AWD clusters require 5 cases within 3 km over 48 hours, meningitis requires 2 cases within 5 km over 168 hours, measles requires 3 cases within 10 km over 336 hours, and mpox requires 2 cases within 5 km over 336 hours. Backend tests cover clustered AWD/cholera and Nigeria meningitis alert scenarios.

Try It

Install the Android preview build

Open the live coordinator dashboard

On first setup, the app downloads the offline E2B clinical model, the Whisper base voice-input model, and the compact Guard firearm detector. Skip those and it runs in cloud fallback mode.

git clone https://github.com/Mist-Labs/shifa.git

# Run the clinical validation suite
cd shifa/ml
pip install unsloth boto3
# Gemma base model access requires Hugging Face approval:
# https://huggingface.co/google/gemma-4-e2b-it
# Published SHIFA artifacts are hosted on public R2 URLs;
# no R2 credentials are needed for download.
python scripts/download_artifacts.py
python finetune/validate.py

# Run the mobile app
cd ../shifa-mobile
npm install
npx expo run:android

Demo video coming before the submission deadline.

Validation Results

Tested on a 60-case WHO IMCI set. The mobile E2B clears every clinical target; E4B clears the safety-critical decision, urgent-recall, dosing, and protocol targets, while danger-sign naming is documented in the results report.

Metric	E2B Mobile	E4B Server	Target
Decision accuracy	95.0%	96.7%	88% ✅
Urgent recall	100.0%	100.0%	95% ✅
Urgent miss rate	0.0%	0.0%	— ✅
Danger sign detection	95.0%	88.3%	92%
Drug dose accuracy	100.0%	100.0%	95% ✅
Protocol adherence	93.3%	100.0%	90% ✅

The model never misses an emergency. That's the only number that matters.

How It Works

CHW speaks or types symptoms
        ↓
Whisper base STT transcribes offline
        ↓
Fine-tuned Gemma 4 E2B (LiteRT) reasons over the case
        ↓
Deterministic WHO/IMCI guardrails apply
        ↓
Decision + voice response in CHW's language
        ↓
Case logged locally → syncs to dashboard when back online

Two layers of safety — the model handles reasoning and produces structured clinical JSON, the guardrails enforce hard protocol rules on top of that. Things like MUAC < 11.5cm, bilateral edema, neonatal danger signs, convulsions, sexual violence, meningitis signs, maternal danger signs, and altered consciousness always trigger urgent referral regardless of what the model output says.

The Models

Gemma 4 E2B's edge-oriented architecture and LiteRT-LM export path are what make offline Android clinical inference possible without a cloud dependency on a mid-range device.

	E2B	E4B
Base	`google/gemma-4-e2b-it`	`google/gemma-4-e4b-it`
Fine-tuning	QLoRA via Unsloth	QLoRA via Unsloth
Training time	56 min on Kaggle T4	103 min on Kaggle T4
Train loss	0.1759	0.0599
Mobile runtime	LiteRT-LM `.litertlm` · 3.1 GB + Whisper base · 142 MB	GGUF Q4_K_M · 5.0 GB
Target	Mid-range Android (6GB+ RAM)	High-end device / server

2,000 synthetic training cases across 6 languages, 5 countries, 11 clinical conditions. Before the final run, we cleaned out invalid decision aliases (MONITOR, OBSERVE, REFER_NON_URGENT) that had crept into the synthetic data — that single cleanup pushed E2B raw model accuracy from 73% to 83%.

The LiteRT export ran on a Vast.ai A100 SXM4, after two OOM failures on Kaggle T4. The full story is in SHIFA_Technical_Challenges.md.

For the full training flow, validation procedure, and artifact evidence, see ml/TRAINING_AND_VALIDATION_PROCESS.md and ml/TRAINING_AND_VALIDATION_RESULTS.md.

Model Artifacts

Published weights and mobile runtime artifacts are hosted on Cloudflare R2:

Artifact	Link
E2B LiteRT-LM primary mobile runtime	shifa-gemma4-e2b-finetuned.litertlm
E2B LoRA adapter weights	adapter_model.safetensors
E2B GGUF fallback runtime	shifa-gemma4-e2b-q4km.gguf
E2B validation metrics	validation_metrics.json
E2B training manifest	training_manifest.json
Guard firearm detector TFLite	shifa-guard-weapon-detector.tflite
Guard validation metrics	guard/validation_metrics.json

Clinical Coverage

Acute watery diarrhea / cholera · Severe and moderate acute malnutrition · Neonatal danger signs · Severe and non-severe pneumonia · Malaria · Meningitis · Infected conflict wounds · Sexual violence (GBV) · Maternal danger signs · Suspected measles

Field Notes

Physical Android testing confirmed first-run model download, offline E2B analysis, Kinyarwanda output, TTS playback, regional/local voice preference fallback, local case logging, and sync to the backend when connectivity returned. LiteRT-LM .litertlm is now the primary mobile runtime. GGUF stays as a documented fallback.

Offline STT — Whisper base — is part of the first-run setup. Voice recordings try offline transcription first. If that fails and there's no typed input, the app blocks silent analysis and asks the CHW to type or reconnect. No guessing.

The Guard firearm detector is also part of the first-run offline pack. Android uses a native TFLite bridge for still-image firearm screening, then keeps Gemini Guard analysis as fallback for richer visual context and video evidence. iOS native detector inference is still a separate bridge task. Knife detection remains experimental until a better-balanced dataset is trained.

Setup

# Training pipeline
cd shifa/ml
pip install -r requirements-gpu.txt
python scripts/download_artifacts.py
python finetune/finetune_unsloth.py
python finetune/validate.py

# Mobile
cd shifa/shifa-mobile
npm install
npx expo run:android   # Android
npx expo run:ios       # iOS simulator smoke test

For iOS device builds and TestFlight, see shifa-mobile/IOS_RUNBOOK.md. The model is not bundled in the IPA — same first-run download flow as Android.

Project Structure

shifa/
├── ml/
│   ├── finetune/
│   │   ├── finetune_unsloth.py        # Training
│   │   ├── validate.py                # Validation suite
│   │   ├── guardrails.py              # WHO/IMCI safety overrides
│   │   └── common.py
│   ├── data/
│   ├── reports/
│   ├── TRAINING_AND_VALIDATION_PROCESS.md
│   └── TRAINING_AND_VALIDATION_RESULTS.md
├── shifa-mobile/
│   └── IOS_RUNBOOK.md
├── SHIFA_Technical_Challenges.md
└── README.md

Built by

Okoli Arinze Evans — Mist Labs · Kigali, Rwanda github.com/OkoliEvans

Made for health workers in the hardest places on earth.

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
backend		backend
ml		ml
shifa-dashboard		shifa-dashboard
shifa-mobile		shifa-mobile
.gitignore		.gitignore
README.md		README.md
SHIFA_Technical_Challenges.md		SHIFA_Technical_Challenges.md
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SHIFA — AI Clinical Assistant for Humanitarian Crisis Settings

The Problem

What It Does

Clinical Triage

SHIFA Guard

Outbreak Monitoring

Try It

Validation Results

How It Works

The Models

Model Artifacts

Clinical Coverage

Field Notes

Setup

Project Structure

Built by

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SHIFA — AI Clinical Assistant for Humanitarian Crisis Settings

The Problem

What It Does

Clinical Triage

SHIFA Guard

Outbreak Monitoring

Try It

Validation Results

How It Works

The Models

Model Artifacts

Clinical Coverage

Field Notes

Setup

Project Structure

Built by

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages