| Simulation & Benchmarks |
AgentClinic: A Multimodal Agent Benchmark to Evaluate AI in Simulated Clinical Environments |
https://arxiv.org/abs/2405.07960 arXiv |
|
MedAgentBench: A Virtual EHR Environment to Benchmark Medical LLM Agents |
https://arxiv.org/abs/2501.14654 arXiv |
|
3MDBench: Medical Multimodal Multi-agent Dialogue Benchmark |
https://arxiv.org/abs/2504.13861 nhsx.github.io |
|
Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents |
https://arxiv.org/abs/2405.02957 arXiv |
|
MedAgentSim (Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions) |
https://arxiv.org/abs/2503.22678 arXiv |
|
LLM-based Agent Simulation for Maternal Health Interventions |
https://arxiv.org/abs/2503.22719 arXiv |
| Frameworks & Architectures |
MMedAgent: Learning to Use Medical Tools with a Multi-modal Agent |
https://arxiv.org/abs/2407.02483 arXiv |
|
Coordinated AI agents for advancing healthcare |
https://www.nature.com/articles/s41551-025-01363-2 |
|
MeNTi: Bridging Medical Calculator and LLM Agent with Nested Tool Calling |
https://arxiv.org/abs/2410.13610 arXiv |
|
M³Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging |
https://arxiv.org/abs/2502.20301 arXiv |
|
ClinicalAgent: Clinical Trial Multi-Agent System with LLM-based Reasoning |
https://arxiv.org/abs/2404.14777 arXiv |
|
Multi-Agent Medical Assistant (general-purpose, on-device) |
https://github.com/souvikmajumder26/Multi-Agent-Medical-Assistant |
|
IMAS: Agentic System for Rural Healthcare Delivery |
https://github.com/uheal/imas |
| Tool-centric Applications |
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools |
https://arxiv.org/abs/2503.10970 arXiv |
|
MedAgent-Pro: Towards Multi-modal Evidence-based Medical Diagnosis via Reasoning Agentic Workflow |
https://arxiv.org/abs/2503.18968 arXiv |
|
TrialGPT: Matching Patients to Clinical Trials with Large Language Models |
https://arxiv.org/abs/2307.15051 arXiv |
|
EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records |
https://arxiv.org/abs/2401.07128 |
| Decision Support Agents |
Reinforcing Clinical Decision Support through Multi-Agent Systems |
https://arxiv.org/abs/2504.03699 arXiv |
| Considerations |
Emerging Cyber Attack Risks of Medical AI Agents |
https://arxiv.org/abs/2504.03759 |