clementdelangue

The Department of War just learned the golden rule of AI: Not your weights, not your brain 🧠🔒

2026-02-27 View on X

Axios

Anthropic says new DOD “contract language” made “virtually no progress” on preventing Claude's use for mass domestic surveillance or fully autonomous weapons

Anthropic CEO Dario Amodei on Thursday said there has been “virtually no progress” on negotiations with the Pentagon.

View original

The Department of War just learned the golden rule of AI: Not your weights, not your brain 🧠🔒

2026-02-27 View on X

Anthropic

Dario Amodei says Anthropic cannot “in good conscience” accede to DOD's request to remove safeguards and will work to ensure a smooth transition if offboarded

I believe deeply in the existential importance of using AI to defend the United States and other democracies, and to defeat our autocratic adversaries.

View original

The Department of War just learned the golden rule of AI: Not your weights, not your brain 🧠🔒

2026-02-27 View on X

Axios

President Trump calls Anthropic a “radical left, woke company” and says he is directing every federal agency in the US to stop using its products

The Trump administration has decided to blacklist Anthropic in the most consequential and controversial policy decision to date …

View original

All tech companies can and should train their own models, otherwise they'll be left behind!

2025-12-13 View on X

Zoom

Zoom says its “federated AI” model, combining its SLM with open- and closed-source models, got 48.1% on Humanity's Last Exam vs. 45.8% for Gemini 3 Pro w/ tools

outperforming other AI models with a 2.3% jump over the previous SOTA. ✨ HLE is one of the most rigorous tests in AI, built to measure real expert-level knowledge and deep reasonin...

View original

As far as I know, there isn't any chatbot or API that gives you access to an IMO 2025 gold-medalist model. Not only does this change today, but you get to download the weights with the Apache 2.0 open-source release of @deepseek_ai Math-V2 on @huggingface!

2025-11-30 View on X

The Decoder

DeepSeek says its new DeepSeekMath-V2 model got gold-medal level status on the International Mathematical Olympiad 2025 and Chinese Mathematical Olympiad 2024

where models prove formal mathematical theorems—GPT-5 scores 20%. Gemini Deep Think IMO Gold hits 65.7%. DeepSeek Math V2 (Heavy) scores 61.9%. That's second place—but Gemini is...

View original

As far as I know, there isn't any chatbot or API that gives you access to an IMO 2025 gold-medalist model. Not only does this change today, but you get to download the weights with the Apache 2.0 open-source release of @deepseek_ai Math-V2 on @huggingface!

2025-11-29 View on X

The Decoder

DeepSeek says its new DeepSeekMath-V2 model got gold-medal level status on the International Mathematical Olympiad 2025 and Chinese Mathematical Olympiad 2024

Chinese startup Deepseek reports its new DeepseekMath-V2 model has reached gold medal status at the Math Olympiad …

View original

Sounds a bit sensationalized to me, but the core point is right: cyber-security teams need to understand and use AI more, which is exactly why open-source matters (also, shows that APIs aren't safer for these risks than open-weights)

2025-11-16 View on X

Ars Technica

Some experts question Anthropic's claims of cyberattack breakthroughs using its tools, noting that white-hat hackers report modest gains from AI-aided hacking

Researchers from Anthropic said they recently observed the “first reported AI-orchestrated cyber espionage campaign” …

View original

Great interview but am I the only one who feels like @dwarkesh_sp and @dylan522p sound brainwashed by big model labs? Personally loved @satyanadella's points about a lot of value being in the scaffolding, multi-model approaches, and open-source as the basis for the long-tail of

2025-11-13 View on X

Dwarkesh Podcast

Q&A with Satya Nadella on business models for AGI, Copilot, Microsoft AI, the hyperscale business, the OpenAI partnership, capex, sovereign AI efforts, and more

As part of this interview, Satya Nadella gave Dylan Patel (founder of SemiAnalysis) and me an exclusive first-look at their brand-new Fairwater 2 datacenter.

View original

The AI frontier is open-source!

2025-11-07 View on X

CNBC

Chinese startup Moonshot releases Kimi K2 Thinking, an open-weight model it claims beats GPT-5 in agentic capabilities; source: the model cost $4.6M to train

Chinese startup Moonshot on Thursday released its latest generative artificial intelligence model which claims to beat OpenAI's ChatGPT in …

View original

We're finally reaching the era of everyone training their own models based on open-source (versus relying on black box generalist APIs) and it is glorious!

2025-10-31 View on X

Cognition

Cognition releases SWE-1.5, a new coding model in Windsurf, saying it partnered with Cerebras to serve SWE-1.5 at speeds up to 13x faster than Claude Sonnet 4.5

lots of new paradigms/UX to figure out. @cognition : Today we're releasing SWE-1.5, our fast agent model. It achieves near-SOTA coding performance while setting a new standard for ...

View original

Beautiful work from @sundarpichai @demishassabis and team with open weights on HF: https://huggingface.co/... I'm so excited about the application of AI for biology and chemistry, especially in the open like this for all to benefit!

2025-10-16 View on X

The Keyword

Google releases Cell2Sentence-Scale 27B (C2S-Scale), a 27B-parameter foundation model for single-cell analysis built on its Gemma family of open models

We're launching a new 27 billion parameter foundation model for single-cell analysis built on the Gemma family of open models.

View original

It's easier than ever to train, optimize and run your own models thanks to open-source (versus delegating all learning, control, capabilities to black-box APIs). Cool to see @karpathy proving it once more by leveraging @huggingface fineweb ([link])!

2025-10-14 View on X

@karpathy

Andrej Karpathy unveils nanochat, a full-stack training and inference implementation of an LLM in a single, dependency-minimal codebase, deployable in 4 hours

It provides a full ChatGPT-style LLM, including training, inference and a web Ui … X: Clem / @clementdelangue : Am I wrong in sensing a paradigm shift in AI? Feels like we're movin...

View original

Am I wrong in sensing a paradigm shift in AI? Feels like we're moving from a world obsessed with generalist LLM APIs to one where more and more companies are training, optimizing, and running their own models built on open source (especially smaller, specialized ones) Some [image]

2025-10-14 View on X

@karpathy

Andrej Karpathy unveils nanochat, a full-stack training and inference implementation of an LLM in a single, dependency-minimal codebase, deployable in 4 hours

It provides a full ChatGPT-style LLM, including training, inference and a web Ui … X: Clem / @clementdelangue : Am I wrong in sensing a paradigm shift in AI? Feels like we're movin...

View original

Very cool paper! You can discuss with the author here: https://huggingface.co/... [image]

2025-10-09 View on X

VentureBeat

Samsung introduces the Tiny Recursion Model, a 7M-parameter model that can outperform LLMs 10,000x larger, like Gemini 2.5 Pro and o3-mini, on specific problems

The trend of AI researchers developing new, small open source generative models that outperform far larger …

View original

Great to see 🇺🇸 stepping up on open weights since the @WhiteHouse AI action plan! Well done @DavidSacks @mkratsios47 @sriramk @deanwball!

2025-08-24 View on X

@elonmusk

Elon Musk says xAI has open sourced Grok 2.5 and plans to do the same for Grok 3 in about six months, with Grok 2's weights now available on Hugging Face

The @xAI Grok 2.5 model, which was our best model last year, is now open source. Grok 3 will be made open source in about 6 months. https://huggingface.co/...

View original

Tired: building AI for waifus & chatbots Wired: building AI for space exploration! Excited to introduce Surya, the first open-source AI foundation model for heliophysics, released by @NASA & @IBM on @huggingface! It's a 366M-parameter transformer model pretrained on 9 years

2025-08-21 View on X

MIT Technology Review

NASA and IBM release Surya, an open-source machine learning model trained on over a decade's worth of NASA solar data to predict solar flares and solar winds

a new foundation model designed to help researchers protect infrastructure through accessible, accurate modeling of space weather. It's going to totally change how we forecast sola...

View original

Deepseek just released a new model! https://huggingface.co/... [image]

2025-08-19 View on X

Bloomberg

DeepSeek releases V3.1, adding a longer context window, with few other details; Chinese media blames CEO Liang Wenfeng's perfectionism and bugs for R2's delay

DeepSeek announced what appeared to be an update to its older V3 artificial intelligence model on Tuesday, declaring an enhanced version ready for testing.

View original

And just like that, @OpenAI gpt-oss is now the number one trending model on @huggingface , out of almost 2M open models 🚀 People sometimes forget that they've already transformed the field: GPT-2, released back in 2019 is HF's most downloaded text-generation model ever, and Whisper has consistently ranked in the top 5 audio models...

2025-08-06 View on X

Bloomberg

Amazon plans to make OpenAI's new gpt-oss open-weight models available on Bedrock and SageMaker, the first time it has offered OpenAI's models to AWS customers

Takeaways by Bloomberg AI — Hide … Tell us how AI is shaping your news experience. Share your feedback

View original

When @sama told me at the AI summit in Paris that they were serious about releasing open-source models & asked what would be useful, I couldn't believe it. But six months of collaboration later, here it is: Welcome to OSS-GPT on @huggingface ! It comes in two sizes, for both maximum reasoning capabilities & on-device, cheaper, faster option, all apache 2.0...

2025-08-06 View on X

Bloomberg

Amazon plans to make OpenAI's new gpt-oss open-weight models available on Bedrock and SageMaker, the first time it has offered OpenAI's models to AWS customers

Takeaways by Bloomberg AI — Hide … Tell us how AI is shaping your news experience. Share your feedback

View original

And just like that, @OpenAI gpt-oss is now the number one trending model on @huggingface , out of almost 2M open models 🚀 People sometimes forget that they've already transformed the field: GPT-2, released back in 2019 is HF's most downloaded text-generation model ever, and Whisper has consistently ranked in the top 5 audio models...

2025-08-06 View on X

Wired

OpenAI releases gpt-oss-120b and gpt-oss-20b, its first open-weight models since GPT-2; the smaller gpt-oss-20b can run locally on a device with 16GB+ of RAM

gpt-oss-120b and gpt-oss-20b push the frontier of open-weight reasoning models Simon Willison / Simon Willison's Weblog : OpenAI's new open weight (Apache 2) models are really good...

View original