Category Title Link
Simulation & Benchmarks AgentClinic: A Multimodal Agent Benchmark to Evaluate AI in Simulated Clinical Environments https://arxiv.org/abs/2405.07960 arXiv
MedAgentBench: A Virtual EHR Environment to Benchmark Medical LLM Agents https://arxiv.org/abs/2501.14654 arXiv
3MDBench: Medical Multimodal Multi-agent Dialogue Benchmark https://arxiv.org/abs/2504.13861 nhsx.github.io
Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents https://arxiv.org/abs/2405.02957 arXiv
MedAgentSim (Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions) https://arxiv.org/abs/2503.22678 arXiv
LLM-based Agent Simulation for Maternal Health Interventions https://arxiv.org/abs/2503.22719 arXiv
Frameworks & Architectures MMedAgent: Learning to Use Medical Tools with a Multi-modal Agent https://arxiv.org/abs/2407.02483 arXiv
Coordinated AI agents for advancing healthcare https://www.nature.com/articles/s41551-025-01363-2
MeNTi: Bridging Medical Calculator and LLM Agent with Nested Tool Calling https://arxiv.org/abs/2410.13610 arXiv
M³Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging https://arxiv.org/abs/2502.20301 arXiv
ClinicalAgent: Clinical Trial Multi-Agent System with LLM-based Reasoning https://arxiv.org/abs/2404.14777 arXiv
Multi-Agent Medical Assistant (general-purpose, on-device) https://github.com/souvikmajumder26/Multi-Agent-Medical-Assistant
IMAS: Agentic System for Rural Healthcare Delivery https://github.com/uheal/imas
Tool-centric Applications TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools https://arxiv.org/abs/2503.10970 arXiv
MedAgent-Pro: Towards Multi-modal Evidence-based Medical Diagnosis via Reasoning Agentic Workflow https://arxiv.org/abs/2503.18968 arXiv
TrialGPT: Matching Patients to Clinical Trials with Large Language Models https://arxiv.org/abs/2307.15051 arXiv
EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records https://arxiv.org/abs/2401.07128
Decision Support Agents Reinforcing Clinical Decision Support through Multi-Agent Systems https://arxiv.org/abs/2504.03699 arXiv
Considerations Emerging Cyber Attack Risks of Medical AI Agents https://arxiv.org/abs/2504.03759