Agentic Periodontal Digital Twin

Run locally

Install: npm install
Start: npm run dev → open http://localhost:3000

Optional

AI Summary: create .env.local with GEMINI_API_KEY=...
Chat endpoint: add LLM_ENDPOINT=your_url (defaults to http://216.81.248.15:8000/generate)

Use

Single-click tooth to select, double-click to zoom
“AI Summary” for a clinical note; “Chat” to ask questions about the selected tooth
Transform panel: move/rotate/scale → Save to persist (localStorage)

Models (optional)

Put .glb files in public/ (falls back to procedural teeth if missing)

Charting sequence

Voice + tools

Automation flow (n8n)

App screenshot

Audio walkthrough

https://raw.githubusercontent.com/Arnie016/Project-OrvaX/main/public/audio/how-it-works.mp3

Training dataset

Source: Hugging Face — periodontal-reasoning-40k
License: CC BY 4.0
Format: JSONL rows with prompt, completion, label ∈ {1, -1}

Quick load example (Python):

from datasets import load_dataset

ds = load_dataset("Wildstash/periodontal-reasoning-40k", split="train")

# Each row: {"prompt": str, "completion": str, "label": 1 or -1}
print(ds[0])

DPO/KTO prep (optional):

pos = ds.filter(lambda x: x["label"] == 1)
neg = ds.filter(lambda x: x["label"] == -1)

DPO/KTO prep (optional):

pos = ds.filter(lambda x: x["label"] == 1)
neg = ds.filter(lambda x: x["label"] == -1)

Off-policy KTO training (brief)

We fine-tuned off-policy with KTO using the positive/negative labels:

from datasets import load_dataset
from transformers import AutoTokenizer, AutoModelForCausalLM, TrainingArguments
from trl import KTOTrainer

base_model = "mistralai/Mistral-7B-Instruct-v0.3"  # example; pick what fits your GPU
tokenizer = AutoTokenizer.from_pretrained(base_model, use_fast=True)
tokenizer.pad_token = tokenizer.eos_token

ds = load_dataset("Wildstash/periodontal-reasoning-40k", split="train")

def map_row(row):
    return {
        "prompt": row["prompt"],
        "completion": row["completion"],
        "label": 1 if row["label"] == 1 else -1,
    }

train_ds = ds.map(map_row, remove_columns=ds.column_names)

model = AutoModelForCausalLM.from_pretrained(base_model, torch_dtype="auto")

args = TrainingArguments(
    output_dir="kto-out",
    per_device_train_batch_size=1,  # single-GPU friendly
    gradient_accumulation_steps=8,
    learning_rate=5e-6,
    num_train_epochs=1,
    bf16=True,
    logging_steps=50,
    save_steps=1000,
)

trainer = KTOTrainer(
    model=model,
    tokenizer=tokenizer,
    args=args,
    train_dataset=train_ds,
    max_target_length=256,
    max_prompt_length=128,
)

trainer.train()
trainer.save_model("kto-out")

Notes

Use the positive subset (label==1) to warm-start with SFT if desired, then KTO.
Keep sequence lengths modest to fit single GPU.
Dataset: Hugging Face — periodontal-reasoning-40k

Training run: single H100 (Prime Intellect.ai)

Hardware: 1× NVIDIA H100 (80 GB) via Prime Intellect.ai
Precision: bf16; gradient accumulation to keep microbatch small
Model: mistralai/Mistral-7B-Instruct-v0.3 (example)
Data: Wildstash/periodontal-reasoning-40k (SFT warm-start on label==1 → KTO)
Key settings: per_device_train_batch_size=1, grad_accumulation=8, lr=5e-6, max_prompt_length=128, max_target_length=256
Rationale: off-policy KTO leverages preference-style labels without online rollouts; single GPU is sufficient with careful context/batch sizing

1️⃣ Overview

Orva is the PerioCharting Agent responsible for periodontal dictation, structured data logging, and webhook persistence.
Position in system: ElevenLabs (voice) → n8n (routing) → Supabase (storage) → App (visualization).
Purpose: Converts clinical voice dictation into JSON payloads in real time.

2️⃣ Personality & Behavior

Tone: calm, professional; pace ~120–140 wpm; concise confirmations only.
Responses limited to confirmations/errors; no small talk.
Confirmation style: one-line ack; brief 1s pause between prompts.

Dentist: "Pocket depths buccal one seven three-one-two..."
Orva: "Got it, logged successfully." (pause 1s)

3️⃣ Session Goal

Collect full-mouth PD and REC in a fixed sequence.
Capture flags per site: bleeding, plaque; and per tooth: mobility, furcation.
High-pocket summary rule: any site with PD > 4 mm is flagged.
Integrates with missingMap renumbering for omitted teeth.

4️⃣ Tool Calling Logic

Tool	Type	Trigger	Notes
addPocketDepth2	Webhook	After PD dictation	source of truth
addRecession2	Webhook	After REC dictation
addBleeding2	Webhook	When bleeding flagged
addPlaque2	Webhook	When plaque flagged
addMobility2	Webhook	When mobility flagged
addFurcation2	Webhook	When furcation flagged
addMissingTeeth2	Webhook	On missing tooth announcement	renumbering
storePatientMemory2	Webhook	Non‑clinical remark	mem0/Gemini opt‑in
scheduleAppointment	Webhook	Appointment intent
identifyHighPockets	Client	After final REC	generates summary

5️⃣ Dictation Parsing Rules

Triplets: "3 3 3" → [3,3,3]
"Same" → reuse previous triplet
"Skip" → { skipped: true }
Site overwrite: "Mesial is four" → last triplet[Mesial]=4
Edge cases: invalid counts → ask once → skip; missing tooth uses renumbering.

6️⃣ Example Input–Output

Input

"Buccal one seven three-one-two two-three-one three-two-two..."

Output

{
  "surface": "buccal",
  "quadrant": 1,
  "teeth": [{ "tooth": 7, "depths": [3,1,2] }]
}

7️⃣ Prompt Sequence

16 steps: UR buccal PD → UL buccal PD → UR buccal REC → UL palatal REC → … → LL lingual REC.
Flags (BOP/plaque/mobility/furcation) prompted after each PD/REC segment.
First PD entry calls logFirstBuccalData with the same payload.

8️⃣ Confirmation Protocol

Event	Response
PD or REC success	"Got it, logged successfully."
Bleeding	"Got it, bleeding logged."
Mobility	"Got it, mobility graded."
Plaque	"Got it, plaque logged."
Furcation	"Got it, furcation logged."
Error	"Please repeat." / "Skipping."

9️⃣ Correction & Revision Protocol

In-place replacement of tooth/site values.
Rebuild quadrant and resend if numbering changes.
Late missing-tooth correction adjusts renumbering and replays affected steps.
Flag corrections repost updated webhook; ask at most twice for clarification.

🔟 High-Pocket Analysis

Runs after last REC via identifyHighPockets.
Rule: site PD > 4 mm → flagged.
Example: "High pockets detected on teeth 17 and 26."

11️⃣ Memory Integration

Detects non‑clinical notes; posts storePatientMemory2 payload (Mem0/Gemini optional).
Recalls prior notes next session to personalize briefings.

12️⃣ Webhook Communication & Security

Convex URL: https://glorious-mosquito-126.convex.site/...
HMAC: ELEVENLABS_WEBHOOK_SECRET
Flow: Tool → n8n → Supabase → 2xx → Orva confirmation.

13️⃣ Handoff Logic

Trigger	Action
Mentions other specialty	`<handoff_to_triage>`
"back", "main menu"	`<handoff_to_triage>`
Appointment intent	`scheduleAppointment`
Memory remark	`storePatientMemory2`

14️⃣ Guardrails

No unsolicited talk; no medical interpretation.
Always confirm after success; always follow step order.
Retry once on webhook delay.

15️⃣ Completion & Reset

"Charting complete — summary delivered. Let me know if you need anything else."
Reset transient buffers; keep persistent { allPD }; wait silently.

✅ Optional add‑ons

Internal state machine: Listen → Parse → Call tool → Confirm → Await → Next prompt.
Example session log; JSON schemas for payload validation.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
components		components
config		config
logs		logs
public		public
services		services
.gitignore		.gitignore
App.tsx		App.tsx
DATABASE_SYNC_README.md		DATABASE_SYNC_README.md
QUICK_START.md		QUICK_START.md
README.md		README.md
SUPABASE_SETUP.md		SUPABASE_SETUP.md
constants.ts		constants.ts
dbDataSync.ts		dbDataSync.ts
index.css		index.css
index.html		index.html
index.tsx		index.tsx
maxillary_first_molar_with_cusp_of_carabelli.glb		maxillary_first_molar_with_cusp_of_carabelli.glb
metadata.json		metadata.json
package-lock.json		package-lock.json
package.json		package.json
supabaseConfig.ts		supabaseConfig.ts
toothModelMapping.ts		toothModelMapping.ts
tsconfig.json		tsconfig.json
types.ts		types.ts
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agentic Periodontal Digital Twin

Run locally

Use

Models (optional)

Charting sequence

Voice + tools

Automation flow (n8n)

App screenshot

Audio walkthrough

Training dataset

Off-policy KTO training (brief)

Training run: single H100 (Prime Intellect.ai)

1️⃣ Overview

2️⃣ Personality & Behavior

3️⃣ Session Goal

4️⃣ Tool Calling Logic

5️⃣ Dictation Parsing Rules

6️⃣ Example Input–Output

7️⃣ Prompt Sequence

8️⃣ Confirmation Protocol

9️⃣ Correction & Revision Protocol

🔟 High-Pocket Analysis

11️⃣ Memory Integration

12️⃣ Webhook Communication & Security

13️⃣ Handoff Logic

14️⃣ Guardrails

15️⃣ Completion & Reset

✅ Optional add‑ons

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agentic Periodontal Digital Twin

Run locally

Use

Models (optional)

Charting sequence

Voice + tools

Automation flow (n8n)

App screenshot

Audio walkthrough

Training dataset

Off-policy KTO training (brief)

Training run: single H100 (Prime Intellect.ai)

1️⃣ Overview

2️⃣ Personality & Behavior

3️⃣ Session Goal

4️⃣ Tool Calling Logic

5️⃣ Dictation Parsing Rules

6️⃣ Example Input–Output

7️⃣ Prompt Sequence

8️⃣ Confirmation Protocol

9️⃣ Correction & Revision Protocol

🔟 High-Pocket Analysis

11️⃣ Memory Integration

12️⃣ Webhook Communication & Security

13️⃣ Handoff Logic

14️⃣ Guardrails

15️⃣ Completion & Reset

✅ Optional add‑ons

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages