2025-01-26
(Yet) another tale of Rise and Fall: DeepSeek R1 is claimed to match o1/o1-preview on olympiad level math & coding problems. Can it handle versions of AIW problems that reveal generalization & basic reasoning deficits in SOTA LLMs? ( https://arxiv.org/...) 🧵1/n [image]
MIT Technology Review
Rather than weakening China's AI capabilities, US sanctions appear to be driving startups like DeepSeek to innovate by prioritizing efficiency and collaboration
The AI community is abuzz over DeepSeek R1, a new open-source reasoning model. — The model was developed by the Chinese AI startup DeepSeek …
(Yet) another tale of Rise and Fall: DeepSeek R1 is claimed to match o1/o1-preview on olympiad level math & coding problems. Can it handle versions of AIW problems that reveal generalization & basic reasoning deficits in SOTA LLMs? ( https://arxiv.org/...) 🧵1/n [image]
Financial Times
Industry insiders say DeepSeek's focus on research makes it a dangerous competitor as it's willing to share breakthroughs rather than protect them for profits
China is pulling the same trick. — www.ft.com/content/747a... Mastodon: Brian Kung / @briankung@hachyderm.io : “There's a pretty delicious, or maybe disconcerting irony to this, ...
2024-08-31
LAION-5B is important reference research dataset for reproducible language-vision foundation models studies. We release Re-LAION-5B as a transparent safety iteration on LAION-5B which fixes issues and allows broad research community to continue using open datasets as reference🧵
TechCrunch
LAION, a research org whose dataset was used to train Stable Diffusion and other models, releases a new dataset it claims has been “thoroughly cleaned” of CSAM
LAION, the German research org that created the data used to train Stable Diffusion, among other generative AI models …
Re-LAION-5B fixes the issues reported by Stanford Internet Observatory (SIO) in December 2023 for original LAION-5B. In cooperation with IWF, C3P and David Thiel (SIO), all 1008 links to suspected CSAM in the report are removed from LAION-5B metadata, using safe hash lists.
TechCrunch
LAION, a research org whose dataset was used to train Stable Diffusion and other models, releases a new dataset it claims has been “thoroughly cleaned” of CSAM
LAION, the German research org that created the data used to train Stable Diffusion, among other generative AI models …
For the Re-LAION release we remove a much larger pool of ca. 65 million of various neutral links, in addition to 2236 suspected links. This way, none of the suspected links can be identified by a diff between Re-LAION and any older versions of LAION-5B.
TechCrunch
LAION, a research org whose dataset was used to train Stable Diffusion and other models, releases a new dataset it claims has been “thoroughly cleaned” of CSAM
LAION, the German research org that created the data used to train Stable Diffusion, among other generative AI models …