Meta launches Llama 4 Maverick with 400B parameters and Scout with 109B parameters and a 10M context window, and previews Behemoth with 2T total parameters
Takeaways — We're sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Meta launches Llama 4 Maverick with 400B parameters and Scout with 109B parameters and a 10M context window, and previews Behemoth with 2T total parameters
Takeaways — We're sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Meta launches Llama 4 Maverick with 400B parameters and Scout with 109B parameters and a 10M context window, and previews Behemoth with 2T total parameters
Takeaways — We're sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
MIT spinoff Liquid AI debuts its non-transformer AI models LFM-1B, LFM-3B, and LFM-40B MoE, claiming they achieve “state-of-the-art performance at every scale”
Liquid AI, a startup co-founded by former researchers from the Massachusetts Institute of Technology (MIT) …
HyperWrite CEO unveils Reflection 70B, based on Llama 3.1 70B Instruct and trained using reflection-tuning, and says it beats GPT-4o in all benchmarks tested
There's a new king in town: Matt Shumer, co-founder and CEO of AI writing startup HyperWrite, today unveiled Reflection 70B …
Mark Zuckerberg argues that “open source AI” is the path forward, closed models are vulnerable to vendor lock-in and state-backed espionage, and more
RE: https://www.threads.net/... Dare Obasanjo / @carnage4life : You can find @zuck's full post here https://www.facebook.com/... Dare Obasanjo / @carnage4life : Mark Zuckerberg has...
Meta details Llama 3: 8B- and 70B-parameter models, a focus on reducing false refusals, and an upcoming model trained on 15T+ tokens that has 400B+ parameters
What To Know About ‘Llama 3’ Model Marcus Gopolang Moloko / Memeburn : Meta AI with built in Llama 3 is on WhatsApp in South Africa Hamsat Abdurasheed / News.ng : Meta releases Lla...
AI21 Labs launches Jamba, an AI model that integrates two architectures: transformer and Mamba, which is based on the Structured State Space model
Increasingly, the AI industry is moving toward generative AI models with longer contexts. But models with large context windows tend to be compute-intensive.