OpenAI calls DeepSeek “state-controlled” and recommends that the US ban “PRC-produced equipment and models that violate user privacy and create security risks”
https://techcrunch.com/... Threads: Vishvanand Subramanian / @vishvanands : trying hard to steelman this position from openai but unless it's possible to hide malware in the model weights, what exactl...
In a deposition for the Kadrey v. Meta copyright case, Mark Zuckerberg seemingly cited YouTube to justify Meta's use of copyrighted data for AI training
Meta CEO Mark Zuckerberg appears to have used YouTube and its battle to take down pirated content to defend his own company's use …
Sources: OpenAI's GPT-5, codenamed Orion, is behind schedule and faces technical hurdles, including high computing costs and limited high-quality training data
OpenAI has run into problem after problem on its new artificial-intelligence project, code-named Orion Bluesky: @allytibbitt.me , @tomashirstecon , @columnist , @madamehardy , @dawnnafus , @seed-corn-...
Suchir Balaji, who spent four years at OpenAI, says OpenAI's use of copyrighted data violated the law and ChatGPT damages the internet; he left in August 2024
Suchir Balaji spent nearly four years as an artificial intelligence researcher at OpenAI. Among other projects …
Fairly Trained certifies KL3M, an LLM legal tech consultancy startup 273 Ventures claims to have built without the permissionless use of copyrighted materials
OpenAI claimed it's “impossible” to build good AI models without using copyrighted data. An “ethically created” …
OpenAI responds to The New York Times' lawsuit: training is fair use and there is an opt-out, “regurgitation” is a rare bug, and NYT “manipulated” its models
written evidence (LLM0113) Dan Milmo / The Guardian : ‘Impossible’ to create AI tools like ChatGPT without copyrighted material, OpenAI says Ryan Daws / AI News : Copyrighted data ‘impossible’ to avo...
Brave appears to be selling copyrighted data for AI training and giving third parties the “rights” to that data, while not disclosing its own robot crawler
And even though there are some concerns about the type of data that was used [...] — I'm fairly certain … Mastodon: @the_turtle@mastodon.sdf.org , @derekmceachern@infosec … , and @carnage4life@mas.t...
Experts say it is not clear whether generative art made by AI systems trained on copyrighted data, like OpenAI's DALL-E 2, can be considered as fair use
but not the most important ones relating to the use of existing © works to train AI. https://www.wired.com/... @maasgad : “Is it right that the AIs of the future are able to produce something magical ...