2025-12-07
cracked team and a 🐐 shipped an s-tier open-source model
Essential AI
Essential AI, whose CEO co-wrote Google's Attention Is All You Need paper, unveils Rnj-1, an 8B-parameter open model with SWE-bench performance close to GPT-4o
The long-term advancement and equitable diffusion of AI technologies crucially depend on their development in the Open.
2023-10-09
Linear probes are a classic way of grounding distributed representations! They were famously used as an evaluation protocol for unsupervised representation learning methods like CPC, SimCLR, etc. @gyomalin_ML wrote about it in 2016: https://arxiv.org/...
Anthropic
A research paper details how decomposing groups of neural network neurons into “interpretable features” may improve safety by enabling the monitoring of LLMs
Neural networks are trained on data, not programmed to follow rules. With each step of training …
2023-10-08
Linear probes are a classic way of grounding distributed representations! They were famously used as an evaluation protocol for unsupervised representation learning methods like CPC, SimCLR, etc. @gyomalin_ML wrote about it in 2016: https://arxiv.org/...
Anthropic
A research paper details how decomposing groups of neurons in a neural network into interpretable “features” may improve safety by enabling monitoring of LLMs
Neural networks are trained on data, not programmed to follow rules. With each step of training …