Emergency departments and other clinical settings across the world are now one step closer to sounding like the cockpit of the Millennium Falcon —with human doctors soliciting advice from, bickering with, and not infrequently trusting the guidance of their opinionated AI colleagues. Researchers at Harvard and Boston’s Beth Israel Deaconess Medical Center have successfully tested an advanced large language model (LLM) AI against two attending physicians (humans) in their performance diagnosing incoming emergency room patients at the triage phase. The LLM, OpenAI’s first so-called “reasoning” model o1-preview , made the correct call in 67.1% of the 76 actual emergency department cases put to it, with what the researchers called “exact or a very close” diagnostic accuracy in the new study , published today in the journal Science.…