While large language models (LLMs) have shown promise in diagnostic dialogue1, their capabilities for effective management reasoning—including disease progression, therapeutic response, and safe medication prescription—remain under-explored. We advance the previously demonstrated diagnostic capabilities of the Articulate Medical Intelligence Explorer (AMIE)1−3 through a new LLM-based agentic system optimized for multi-visit clinical management and dialogue. To ground its reasoning in authoritative clinical knowledge, AMIE leverages Gemini’s long-context capabilities4, combining in-context retrieval with structured reasoning to align its output with up-to-date clinical practice guidelines and drug formularies. In a randomized, blinded virtual Objective Structured Clinical Examination (OSCE) study, AMIE was compared to 21 primary care physicians (PCPs) across 100 multi-visit case scenarios designed to reflect UK NICE Guidance and BMJ Best Practice guidelines. AMIE was non-inferior to PCPs in management reasoning as assessed by specialists and scored better in both preciseness of treatments and investigations, and in its alignment with and grounding in clinical guidelines. To benchmark medication reasoning, we developed RxQA, a multiple-choice question benchmark derived from two national drug formularies (US, UK) and validated by board-certified pharmacists. Though AMIE and PCPs both benefited from the ability to access external drug information, AMIE outperformed PCPs on higher difficulty questions. While further research would be needed before real-world translation, AMIE’s strong performance across evaluations marks a significant step towards conversational AI as a tool in disease management.
Towards Conversational AI for Disease Management
Why This Matters
This advancement in conversational AI for disease management highlights its potential to support healthcare providers by offering accurate, guideline-based treatment recommendations. As AI systems like AMIE demonstrate performance comparable to primary care physicians, they could enhance clinical decision-making, improve patient outcomes, and reduce healthcare disparities. This progress signals a transformative shift towards AI-assisted disease management in the healthcare industry.
Key Takeaways
- AMIE performs on par with primary care physicians in management reasoning.
- The system aligns its recommendations with current clinical guidelines and drug formularies.
- AI tools like AMIE could augment healthcare delivery and improve patient outcomes.
Explore topics:
large language models
articular medical intelligence explorer
gemini
uk nice guidance
rxqa
Get alerts for these topics