2025-03-20
The AI Scientist Generates its First Peer-Reviewed Scientific Publication We're proud to announce that a paper produced by The AI Scientist-v2 passed the peer-review process at a workshop in ICLR, a top AI conference. Read more about this experiment → https://sakana.ai/... [image]
TechCrunch
AI startups Intology and Autoscience submitted AI-generated studies at a conference without disclosure and face criticism of co-opting peer review for publicity
Kyle Wiggers / TechCrunch : X: @intologyai , @pandaashwinee , @intologyai , @tuhinchakr , @sakanaailabs , @autoscienceai , @autoscienceai , and @dorialexander X: @intologyai : Zoc...
2025-02-22
Update: Combining evolutionary optimization with LLMs is powerful but can also find ways to trick the verification sandbox. We are fortunate to have readers, like @main_horse test our CUDA kernels, to identify that the system had found a way to “cheat”. For example, the system
TechCrunch
Sakana AI walks back claims that its new AI CUDA Engineer can speed up AI training by up to 100x, after complaints about worse-than-average training performance
Kyle Wiggers / TechCrunch :
2025-02-20
Introducing The AI CUDA Engineer: An agentic AI system that automates the production of highly optimized CUDA kernels. https://sakana.ai/... The AI CUDA Engineer can produce highly optimized CUDA kernels, reaching 10-100x speedup over common machine learning operations in [video]
Nikkei Asia
Tokyo-based Sakana AI details its AI CUDA Engineer, which it says can speed up AI training and inference by 10x to 100x by “breeding” efficient instructions
TOKYO — Tokyo-based startup Sakana AI says it has developed a system capable of accelerating artificial intelligence development …
2024-03-21
Training foundation models require enormous resources. We can overcome this by working with the vast collective intelligence of existing models. @HuggingFace has over 500k models in dozens of modalities that, in principle, can be combined to form new models with new capabilities! [image]
Reuters
Tokyo-based Sakana AI, founded by two Google researchers, releases three Japanese language models built using “model merging”, which combines existing AI models
Sakana AI, a Tokyo-based artificial intelligence startup founded by two prominent former Google (GOOGL.O) researchers …
@huggingface As a 🇯🇵 AI lab, we wanted to apply our method to produce foundation models for Japan. We were able to quickly evolve 3 best-in-class models with language, vision and image generation capabilities, tailored for Japan and its culture. Read more in our paper https://arxiv.org/... [image]
Reuters
Tokyo-based Sakana AI, founded by two Google researchers, releases three Japanese language models built using “model merging”, which combines existing AI models
Sakana AI, a Tokyo-based artificial intelligence startup founded by two prominent former Google (GOOGL.O) researchers …
Introducing Evolutionary Model Merge: A new approach bringing us closer to automating foundation model development. We use evolution to find great ways of combining open-source models, building new powerful foundation models with user-specified abilities! https://sakana.ai/... [video]
Reuters
Tokyo-based Sakana AI, founded by two Google researchers, releases three Japanese language models built using “model merging”, which combines existing AI models
Sakana AI, a Tokyo-based artificial intelligence startup founded by two prominent former Google (GOOGL.O) researchers …