Research & Papers May 4 theguardian.com

OpenAI's o1 AI diagnoses 67% of ER patients vs doctors' 50-55%

Original: OpenAI's o1 correctly diagnosed 67% of ER patients vs. 50-55% by triage doctors

Why This Matters

Demonstrates AI's potential to improve critical medical decision-making in time-sensitive emergency situations.

Harvard study shows OpenAI's o1 reasoning model correctly diagnosed 67% of emergency room patients compared to 50-55% accuracy by human triage doctors. The AI particularly excelled in high-pressure situations requiring rapid decisions with minimal patient information.

A groundbreaking Harvard study published in Science journal found OpenAI's o1 AI system outperformed human doctors in emergency medicine triage. Testing 76 patients at a Boston hospital, the AI achieved 67% diagnostic accuracy using standard electronic health records, while human doctors reached only 50-55%. The AI's advantage was most pronounced in rapid triage situations with limited information. When more detailed patient data was available, the AI's accuracy increased to 82% compared to doctors' 70-79%, though this difference wasn't statistically significant. The AI also outperformed doctors in providing longer-term treatment plans. Independent experts described the results as 'a genuine step forward' in AI clinical reasoning, with researchers stating large language models 'have eclipsed most benchmarks of clinical reasoning.'

Source

theguardian.com — Read original →

OpenAI's o1 AI diagnoses 67% of ER patients vs doctors' 50-55%

Why This Matters

Source

Related articles

Sign in to listen