jjitsev · TEXXR

(Yet) another tale of Rise and Fall: DeepSeek R1 is claimed to match o1/o1-preview on olympiad level math & coding problems. Can it handle versions of AIW problems that reveal generalization & basic reasoning deficits in SOTA LLMs? ( https://arxiv.org/...) 🧵1/n [image]

2025-01-26 View on X

MIT Technology Review

Rather than weakening China's AI capabilities, US sanctions appear to be driving startups like DeepSeek to innovate by prioritizing efficiency and collaboration

The AI community is abuzz over DeepSeek R1, a new open-source reasoning model. — The model was developed by the Chinese AI startup DeepSeek …

View original

(Yet) another tale of Rise and Fall: DeepSeek R1 is claimed to match o1/o1-preview on olympiad level math & coding problems. Can it handle versions of AIW problems that reveal generalization & basic reasoning deficits in SOTA LLMs? ( https://arxiv.org/...) 🧵1/n [image]

2025-01-26 View on X

Financial Times

Industry insiders say DeepSeek's focus on research makes it a dangerous competitor as it's willing to share breakthroughs rather than protect them for profits

China is pulling the same trick. — www.ft.com/content/747a... Mastodon: Brian Kung / @briankung@hachyderm.io : “There's a pretty delicious, or maybe disconcerting irony to this, ...

View original

LAION-5B is important reference research dataset for reproducible language-vision foundation models studies. We release Re-LAION-5B as a transparent safety iteration on LAION-5B which fixes issues and allows broad research community to continue using open datasets as reference🧵

2024-08-31 View on X

TechCrunch

LAION, a research org whose dataset was used to train Stable Diffusion and other models, releases a new dataset it claims has been “thoroughly cleaned” of CSAM

LAION, the German research org that created the data used to train Stable Diffusion, among other generative AI models …

View original

Re-LAION-5B fixes the issues reported by Stanford Internet Observatory (SIO) in December 2023 for original LAION-5B. In cooperation with IWF, C3P and David Thiel (SIO), all 1008 links to suspected CSAM in the report are removed from LAION-5B metadata, using safe hash lists.

2024-08-31 View on X

TechCrunch

LAION, a research org whose dataset was used to train Stable Diffusion and other models, releases a new dataset it claims has been “thoroughly cleaned” of CSAM

LAION, the German research org that created the data used to train Stable Diffusion, among other generative AI models …

View original

For the Re-LAION release we remove a much larger pool of ca. 65 million of various neutral links, in addition to 2236 suspected links. This way, none of the suspected links can be identified by a diff between Re-LAION and any older versions of LAION-5B.

2024-08-31 View on X

TechCrunch

LAION, a research org whose dataset was used to train Stable Diffusion and other models, releases a new dataset it claims has been “thoroughly cleaned” of CSAM

LAION, the German research org that created the data used to train Stable Diffusion, among other generative AI models …

View original