Google releases VaultGemma, a 1B-parameter model it says is the largest open LLM trained from scratch with differential privacy, on Hugging Face and Kaggle
Amer Sinha, Software Engineer, and Ryan McKenna, Research Scientist, Google Research — We introduce VaultGemma …
OpenAI releases gpt-oss-120b and gpt-oss-20b, its first open-weight models since GPT-2; the smaller gpt-oss-20b can run locally on a device with 16GB+ of RAM
gpt-oss-120b and gpt-oss-20b push the frontier of open-weight reasoning models Simon Willison / Simon Willison's Weblog : OpenAI's new open weight (Apache 2) models are really good OpenAI on GitHub : ...
Google unveils benchmarking platform Kaggle Game Arena, where LLMs compete head-to-head in strategic games, starting with a chess tournament from August 5 to 7
Watch models compete in complex games providing a verifiable and dynamic measure of their capabilities. Kaggle : Chess Text Input Leaderboard Nick Bild / Hackster : Shall We Play a Game? Maximilian Sc...
The Wikimedia Foundation partners with Kaggle to release a dataset of “structured Wikipedia content in English and French” optimized for AI model training
Data science platform Kaggle is hosting a Wikipedia dataset that's specifically optimized for machine learning applications.
The head scientist for Alexa argues that Turing Test, designed as a thought experiment, is no longer relevant for building AIs that are designed to help humans
and that the concept of machines being indistinguishable from a human is out of touch.” https://www.fastcompany.com/ ... Gavin Baker / @gavinsbaker : Superb from Alexa's head scientist. I had not hear...
The head scientist for Alexa argues that Turing Test, designed as a thought experiment, is no longer relevant for building AIs that are designed to help humans
This year marks 70 years since Alan Turing published his paper introducing the concept of the Turing Test in response to the question, “Can machines think?” Tweets: @gavinsbaker , @edbott , and @carna...
Google unveils Dataset Search, a search engine that will cover datasets from environmental and social sciences, government, and ProPublica-style news orgs
covers a variety of public and commercial domains, e.g. “police shootings” results include Kaggle, city portals, @datadotworld, and ICPSR: http://toolbox.google.com/... http://twitter.com/... Paul Ked...
Developer scraped 40K Tinder selfies from Bay Area and uploaded dataset to machine-learning platform Kaggle; Tinder says terms of service violated
Natasha Lomas / TechCrunch :