Wie gut sind „KI-Ärzte“ – und werden sie die Medizin übernehmen?

Researchers are exploring the potential of AI in medicine, with recent studies highlighting both progress and limitations. In April, a study in *Science* revealed that OpenAI’s advanced large language model, o1, developed in San Francisco, diagnosed emergency department cases more accurately than doctors at a Boston hospital. The AI achieved 67% accuracy, while human physicians scored between 50–55%, using real-world patient data instead of simulated scenarios. Another study, posted on arXiv in March, tested Google’s AI system, AMIE, which communicated with patients via text messages before their clinic visits. AMIE correctly identified diagnoses in 75% of cases, with the top suggestion matching the final diagnosis in 56% of instances, comparable to human doctors. However, human clinicians provided more practical and cost-effective treatment plans than AMIE. Experts caution that AI cannot yet replace physicians, as medicine involves complex, unpredictable patient interactions. Harvard Medical School’s David Wu noted that AI struggles with real-world variability, where patients often present non-textbook symptoms. While AI excels at structured tasks like note-taking and prescription renewals, its ability to handle nuanced medical scenarios remains unproven. The studies mark progress in AI’s role in healthcare, but researchers emphasize the need for further development before widespread adoption. AI tools like o1 and AMIE demonstrate potential in diagnostics, yet their practical integration into clinical workflows requires addressing gaps in patient interaction and treatment planning.

How good are ‘AI doctors’ — and will they take over medicine?

Comments (0)