aj_kourabi · TEXXR

HBM as a % total cost is increasing. Solution? Make a cheap, highly performant chip with no HBM for compute bound workflows Hard not to admire Nvidia's ingenuity with this one [image]

2025-09-15 View on X

SemiAnalysis

A deep dive into Nvidia's Rubin CPX chip architecture, which is optimized for the prefill phase of inference, emphasizing compute FLOPS over memory bandwidth

the Rubin CTX is for inference, and it's a beast — sold as a rack as the unit, this one optimizes various LLM phases into silicon — notably: it does prefill in fp4 with low mem...

View original

important piece covering the critical but not impossible to overcome challenges and what a large opportunity this unlocks i helped make this happen and feel very proud of the work we did and the speed at which we did it

2025-05-18 View on X

SemiAnalysis

AI deals that the US struck with the UAE and Saudi Arabia reinforce its AI leadership, but raise security concerns that the US can mitigate through safeguards

5 GW Datacenter, HUMAIN, G42, Diversion and Misuse Risks, Security Requirements, American AI Wins

View original

o3 does 25.2% on Frontier Math. Previous models barely got 2%. Here are some sample questions. It is a hard eval (and unpublished). Progress is not slowing down. [image]

2024-12-22 View on X

TechCrunch

OpenAI unveils o3 and o3-mini, trained to “think” before responding via what OpenAI calls a “private chain of thought”, and plans to launch them in early 2025

12 Days of OpenAI: Day 12 Naomi Li Gan / Tech in Asia : OpenAI unveils AI model for advanced reasoning Bojan Stojkovski / Interesting Engineering : OpenAI unveils o3 reasoning AI m...

View original

o3 results robustly showcase how the piece I helped write with semianalysis is right on so many of the critical topics, take a look if you have not already https://semianalysis.com/...

2024-12-22 View on X

TechCrunch

OpenAI unveils o3 and o3-mini, trained to “think” before responding via what OpenAI calls a “private chain of thought”, and plans to launch them in early 2025

12 Days of OpenAI: Day 12 Naomi Li Gan / Tech in Asia : OpenAI unveils AI model for advanced reasoning Bojan Stojkovski / Interesting Engineering : OpenAI unveils o3 reasoning AI m...

View original

o3 does 25.2% on Frontier Math. Previous models barely got 2%. Here are some sample questions. It is a hard eval (and unpublished). Progress is not slowing down. [image]

2024-12-21 View on X

TechCrunch

OpenAI unveils o3 and o3-mini, trained to “think” before responding via what OpenAI calls a “private chain of thought”, and plans to launch them in early 2025

OpenAI announced its new o3 models on Friday. — In a tweet ahead of its final livestream for its …

View original

o3 results robustly showcase how the piece I helped write with semianalysis is right on so many of the critical topics, take a look if you have not already https://semianalysis.com/...

2024-12-21 View on X

TechCrunch

OpenAI unveils o3 and o3-mini, trained to “think” before responding via what OpenAI calls a “private chain of thought”, and plans to launch them in early 2025

OpenAI announced its new o3 models on Friday. — In a tweet ahead of its final livestream for its …

View original