silasalberti · TEXXR

super excited to finally release SWE-1.5 - a frontier-scale model (~hundreds of billions of params) running at insane speeds (up to 950 tok/s) - outperforms GPT-5-High on SWE-Bench-Pro - 13x faster than Sonnet 4.5, 6x faster than Haiku 4.5 - more than double the benchmark perf

2025-10-31 View on X

Cognition

Cognition releases SWE-1.5, a new coding model in Windsurf, saying it partnered with Cerebras to serve SWE-1.5 at speeds up to 13x faster than Claude Sonnet 4.5

lots of new paradigms/UX to figure out. @cognition : Today we're releasing SWE-1.5, our fast agent model. It achieves near-SOTA coding performance while setting a new standard for ...

View original

Devin + Windsurf are stronger together! At Cognition we've been focused on building the best cloud agent - but the IDE was always a missing piece. The Windsurf team built the first agentic IDE & scaled it to hundreds of thousands of users - but wanted to add a cloud agent.

2025-07-15 View on X

New York Times

Cognition acquires Windsurf, saying it will waive 100% of Windsurf employees' vesting cliffs and give them “fully accelerated vesting for their work to date”

techcrunch.com/2025/07/14/c... [embedded post] Dare Obasanjo / @carnage4life : The AI acquisition moves fast. First, OpenAI ditched its Windsurf deal after clashing with Microsof...

View original

Have always been a fan of 4o-mini. It's been unbeaten in its price class for many months now. So we've been super excited for 4.1-mini & 4.1-nano! Impressions from early testing: 4.1-mini = first model at that price point that can coherently execute agentic tasks. Much better

2025-04-15 View on X

TechCrunch

OpenAI releases GPT‑4.1, GPT‑4.1 mini, and GPT‑4.1 nano, which excel at coding, instruction following, and long context understanding, available via its API

OpenAI on Monday launched a new family of models called GPT-4.1. Yes, “4.1” — as if the company's nomenclature wasn't confusing enough already.

View original

Wow we just ran Gemini 2.5 Pro on our evals and it got a new state of the art. Congrats to the Gemini team! Sharing preliminary results here and working on bringing it into Devin: [image]

2025-03-31 View on X

9to5Google

Google says it is rolling out Gemini 2.5 Pro Experimental to all Gemini users, after initially launching the model for Gemini Advanced subscribers on March 25

No Subscription Needed The Economic Times : Google rolls out experimental version of Gemini 2.5 Pro for free users The Hindu : Google's Gemini 2.5 Pro rolled out to all users days ...

View original

o3 scores a 2727 ELO on Codeforces which places it 175th in the global ranking. That's better than ~99.9% of humans on the website (who already tend to be far above average). [image]

2024-12-22 View on X

TechCrunch

OpenAI unveils o3 and o3-mini, trained to “think” before responding via what OpenAI calls a “private chain of thought”, and plans to launch them in early 2025

12 Days of OpenAI: Day 12 Naomi Li Gan / Tech in Asia : OpenAI unveils AI model for advanced reasoning Bojan Stojkovski / Interesting Engineering : OpenAI unveils o3 reasoning AI m...

View original

o3 scores a 2727 ELO on Codeforces which places it 175th in the global ranking. That's better than ~99.9% of humans on the website (who already tend to be far above average). [image]

2024-12-21 View on X

TechCrunch

OpenAI unveils o3 and o3-mini, trained to “think” before responding via what OpenAI calls a “private chain of thought”, and plans to launch them in early 2025

OpenAI announced its new o3 models on Friday. — In a tweet ahead of its final livestream for its …

View original

Excited to finally share our work on cognition-golden and our collaboration with OpenAI building on the new o1 models! The model's intellectual honesty and thorough approach to problem solving deeply fascinated the mathematician in me

2024-09-13 View on X

The Verge

OpenAI releases o1, the first of its rumored reasoning-focused Strawberry models, in preview, alongside a smaller o1-mini, for ChatGPT Plus and Team subscribers

Advancing cost-efficient reasoning. — Contributions Sabrina Ortiz / ZDNET : OpenAI trained its new o1 AI models to think before they speak - how to access them Ethan Mollick / On...

View original

Excited to finally share our work on cognition-golden and our collaboration with OpenAI building on the new o1 models! The model's intellectual honesty and thorough approach to problem solving deeply fascinated the mathematician in me

2024-09-13 View on X

Simon Willison's Weblog

OpenAI's o1 models aren't as simple as the next step up from GPT-4o as they introduce major cost and performance trade-offs in exchange for improved “reasoning”

OpenAI released two major new preview models today: o1-preview and o1-mini (that mini one is also a preview …

View original