/
Navigation
C
Chronicles
Browse all articles
C
E
Explore
Semantic exploration
E
R
Research
Entity momentum
R
N
Nexus
Correlations & relationships
N
~
Story Arc
Topic evolution
S
Drift Map
Semantic trajectory animation
D
P
Posts
Analysis & commentary
P
Browse
@
Entities
Companies, people, products, technologies
Domains
Browse by publication source
Handles
Browse by social media handle
Detection
?
Concept Search
Semantic similarity search
!
High Impact Stories
Top coverage by position
+
Sentiment Analysis
Positive/negative coverage
*
Anomaly Detection
Unusual coverage patterns
Analysis
vs
Rivalry Report
Compare two entities head-to-head
/\
Semantic Pivots
Narrative discontinuities
!!
Crisis Response
Event recovery patterns
Connected
Nav: C E R N
Search: /
Command: ⌘K
Embeddings: large
VOICE ARCHIVE

Silas Alberti

@silasalberti
8 posts
2025-10-31
super excited to finally release SWE-1.5 - a frontier-scale model (~hundreds of billions of params) running at insane speeds (up to 950 tok/s) - outperforms GPT-5-High on SWE-Bench-Pro - 13x faster than Sonnet 4.5, 6x faster than Haiku 4.5 - more than double the benchmark perf
2025-10-31 View on X
Cognition

Cognition releases SWE-1.5, a new coding model in Windsurf, saying it partnered with Cerebras to serve SWE-1.5 at speeds up to 13x faster than Claude Sonnet 4.5

lots of new paradigms/UX to figure out. @cognition : Today we're releasing SWE-1.5, our fast agent model. It achieves near-SOTA coding performance while setting a new standard for ...

2025-07-15
Devin + Windsurf are stronger together! At Cognition we've been focused on building the best cloud agent - but the IDE was always a missing piece. The Windsurf team built the first agentic IDE & scaled it to hundreds of thousands of users - but wanted to add a cloud agent.
2025-07-15 View on X
New York Times

Cognition acquires Windsurf, saying it will waive 100% of Windsurf employees' vesting cliffs and give them “fully accelerated vesting for their work to date”

techcrunch.com/2025/07/14/c...  [embedded post] Dare Obasanjo / @carnage4life : The AI acquisition moves fast.  First, OpenAI ditched its Windsurf deal after clashing with Microsof...

2025-04-15
Have always been a fan of 4o-mini. It's been unbeaten in its price class for many months now. So we've been super excited for 4.1-mini & 4.1-nano! Impressions from early testing: 4.1-mini = first model at that price point that can coherently execute agentic tasks. Much better
2025-04-15 View on X
TechCrunch

OpenAI releases GPT‑4.1, GPT‑4.1 mini, and GPT‑4.1 nano, which excel at coding, instruction following, and long context understanding, available via its API

OpenAI on Monday launched a new family of models called GPT-4.1.  Yes, “4.1” — as if the company's nomenclature wasn't confusing enough already.

2025-03-31
Wow we just ran Gemini 2.5 Pro on our evals and it got a new state of the art. Congrats to the Gemini team! Sharing preliminary results here and working on bringing it into Devin: [image]
2025-03-31 View on X
9to5Google

Google says it is rolling out Gemini 2.5 Pro Experimental to all Gemini users, after initially launching the model for Gemini Advanced subscribers on March 25

No Subscription Needed The Economic Times : Google rolls out experimental version of Gemini 2.5 Pro for free users The Hindu : Google's Gemini 2.5 Pro rolled out to all users days ...

2024-12-22
o3 scores a 2727 ELO on Codeforces which places it 175th in the global ranking. That's better than ~99.9% of humans on the website (who already tend to be far above average). [image]
2024-12-22 View on X
TechCrunch

OpenAI unveils o3 and o3-mini, trained to “think” before responding via what OpenAI calls a “private chain of thought”, and plans to launch them in early 2025

12 Days of OpenAI: Day 12 Naomi Li Gan / Tech in Asia : OpenAI unveils AI model for advanced reasoning Bojan Stojkovski / Interesting Engineering : OpenAI unveils o3 reasoning AI m...

2024-12-21
o3 scores a 2727 ELO on Codeforces which places it 175th in the global ranking. That's better than ~99.9% of humans on the website (who already tend to be far above average). [image]
2024-12-21 View on X
TechCrunch

OpenAI unveils o3 and o3-mini, trained to “think” before responding via what OpenAI calls a “private chain of thought”, and plans to launch them in early 2025

OpenAI announced its new o3 models on Friday.  —  In a tweet ahead of its final livestream for its …

2024-09-13
Excited to finally share our work on cognition-golden and our collaboration with OpenAI building on the new o1 models! The model's intellectual honesty and thorough approach to problem solving deeply fascinated the mathematician in me
2024-09-13 View on X
The Verge

OpenAI releases o1, the first of its rumored reasoning-focused Strawberry models, in preview, alongside a smaller o1-mini, for ChatGPT Plus and Team subscribers

Advancing cost-efficient reasoning.  —  Contributions Sabrina Ortiz / ZDNET : OpenAI trained its new o1 AI models to think before they speak - how to access them Ethan Mollick / On...

Excited to finally share our work on cognition-golden and our collaboration with OpenAI building on the new o1 models! The model's intellectual honesty and thorough approach to problem solving deeply fascinated the mathematician in me
2024-09-13 View on X
Simon Willison's Weblog

OpenAI's o1 models aren't as simple as the next step up from GPT-4o as they introduce major cost and performance trade-offs in exchange for improved “reasoning”

OpenAI released two major new preview models today: o1-preview and o1-mini (that mini one is also a preview …