sakanaailabs · TEXXR

The AI Scientist Generates its First Peer-Reviewed Scientific Publication We're proud to announce that a paper produced by The AI Scientist-v2 passed the peer-review process at a workshop in ICLR, a top AI conference. Read more about this experiment → https://sakana.ai/... [image]

2025-03-20 View on X

TechCrunch

AI startups Intology and Autoscience submitted AI-generated studies at a conference without disclosure and face criticism of co-opting peer review for publicity

Kyle Wiggers / TechCrunch : X: @intologyai , @pandaashwinee , @intologyai , @tuhinchakr , @sakanaailabs , @autoscienceai , @autoscienceai , and @dorialexander X: @intologyai : Zoc...

View original

Update: Combining evolutionary optimization with LLMs is powerful but can also find ways to trick the verification sandbox. We are fortunate to have readers, like @main_horse test our CUDA kernels, to identify that the system had found a way to “cheat”. For example, the system

2025-02-22 View on X

TechCrunch

Sakana AI walks back claims that its new AI CUDA Engineer can speed up AI training by up to 100x, after complaints about worse-than-average training performance

Kyle Wiggers / TechCrunch :

View original

Introducing The AI CUDA Engineer: An agentic AI system that automates the production of highly optimized CUDA kernels. https://sakana.ai/... The AI CUDA Engineer can produce highly optimized CUDA kernels, reaching 10-100x speedup over common machine learning operations in [video]

2025-02-20 View on X

Nikkei Asia

Tokyo-based Sakana AI details its AI CUDA Engineer, which it says can speed up AI training and inference by 10x to 100x by “breeding” efficient instructions

TOKYO — Tokyo-based startup Sakana AI says it has developed a system capable of accelerating artificial intelligence development …

View original

Training foundation models require enormous resources. We can overcome this by working with the vast collective intelligence of existing models. @HuggingFace has over 500k models in dozens of modalities that, in principle, can be combined to form new models with new capabilities! [image]

2024-03-21 View on X

Reuters

Tokyo-based Sakana AI, founded by two Google researchers, releases three Japanese language models built using “model merging”, which combines existing AI models

Sakana AI, a Tokyo-based artificial intelligence startup founded by two prominent former Google (GOOGL.O) researchers …

View original

@huggingface As a 🇯🇵 AI lab, we wanted to apply our method to produce foundation models for Japan. We were able to quickly evolve 3 best-in-class models with language, vision and image generation capabilities, tailored for Japan and its culture. Read more in our paper https://arxiv.org/... [image]

2024-03-21 View on X

Reuters

Tokyo-based Sakana AI, founded by two Google researchers, releases three Japanese language models built using “model merging”, which combines existing AI models

Sakana AI, a Tokyo-based artificial intelligence startup founded by two prominent former Google (GOOGL.O) researchers …

View original

Introducing Evolutionary Model Merge: A new approach bringing us closer to automating foundation model development. We use evolution to find great ways of combining open-source models, building new powerful foundation models with user-specified abilities! https://sakana.ai/... [video]

2024-03-21 View on X

Reuters

Tokyo-based Sakana AI, founded by two Google researchers, releases three Japanese language models built using “model merging”, which combines existing AI models

Sakana AI, a Tokyo-based artificial intelligence startup founded by two prominent former Google (GOOGL.O) researchers …

View original