-
We propose a scalable, EHR-grounded, multi agent pipeline for synthetic multi-party dialogue generation, ensuring realism and factuality via independent rule-based concept and topic-flow checkers and an iterative critique-and-refine loop.
-
We introduce EMSDialog, an EMS-specific synthetic dataset of 4,414 realistic multi-party conversations, generated based on a real-world ePCR dataset and annotated with 43 diagnoses, turn level speaker roles and topics. Human expert and LLM-based evaluations show strong quality at both utterance level (realism, safety, role accuracy, groundedness) and conversation level (logical flow, factuality, diversity).
-
We demonstrate the downstream utility of EMSDialog by training models of different sizes for conversational diagnosis prediction and evaluating them on real-world EMS conversations. Experiments show that EMSDialog-augmented training improves prediction accuracy, timeliness, and stability, and combining synthetic with real data yields the strongest overall performance.
- 🤗 Synthetic EMSDialog Dataset: Link
- 🩺 Our dataset is also available at github repo data
python generate.py --model_name_or_path=Qwen/Qwen3-32B --enable_concept_check --enable_topicflow_check --enable_style_checkcd ./code/bash
./static_train_4b_ours.shcd ./code/bash
./dynamic_train_4b_ours.shIf you find this work useful, please consider citing our paper.
@misc{ge2026emsdialogsyntheticmultipersonemergency,
title={EMSDialog: Synthetic Multi-person Emergency Medical Service Dialogue Generation from Electronic Patient Care Reports via Multi-LLM Agents},
author={Xueren Ge and Sahil Murtaza and Anthony Cortez and Homa Alemzadeh},
year={2026},
eprint={2604.07549},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2604.07549},
}