/
Navigation
Chronicles
Browse all articles
Explore
Semantic exploration
Research
Entity momentum
Nexus
Correlations & relationships
Story Arc
Topic evolution
Drift Map
Semantic trajectory animation
Posts
Analysis & commentary
Pulse API
Tech news intelligence API
Browse
Entities
Companies, people, products, technologies
Domains
Browse by publication source
Handles
Browse by social media handle
Detection
Concept Search
Semantic similarity search
High Impact Stories
Top coverage by position
Sentiment Analysis
Positive/negative coverage
Anomaly Detection
Unusual coverage patterns
Analysis
Rivalry Report
Compare two entities head-to-head
Semantic Pivots
Narrative discontinuities
Crisis Response
Event recovery patterns
Connected
Search: /
Command: ⌘K
Embeddings: large
TEXXR

Chronicles

The story behind the story

days · browse · Enter similar · o open

Study: OpenAI's o1 correctly diagnosed 67% of emergency room patients using electronic records and a few sentences from nurses, vs. to 50-55% for triage doctors

Researchers say results mark a ‘profound change in technology that will reshape medicine’  —  From George Clooney in ER …

The Guardian Robert Booth

Discussion

  • Vox Dylan Scott on x
    A major new study found AI outperformed doctors in ER diagnosis — but there's a catch
  • @emollick Ethan Mollick on x
    New paper (on an old AI) tests o1 against doctors on medical benchmarks & real ER cases: “across a variety of scenarios and applications, the large language model outperformed both human physicians and older models” The potential suggests an “urgent need for prospective trials.” …
  • Gilles Frydman Gilles Frydman on linkedin
    First of several posts on the new Science paper claiming LLMs have eclipsed the benchmarks of clinical reasoning. …
  • Bill Faruki Bill Faruki on linkedin
    Harvard, in Science today: OpenAI's o1 reasoning model out-diagnosed expert ER physicians on real triage cases — 67% vs. 50-55% — and beat them on management planning 89% to 34%. …
  • @dropthet Mr. Herbert Garrison on bluesky
    FYI, this model is almost 2 years old now.  Studies take at least this long to catch up.  [embedded post]