Anthropic updates its Responsible Scaling Policy, including separating the safety commitments it'll make unilaterally and its recommendations for the industry
Anthropic's initial Risk Report under its new RSP: “We believe that AI models could, in the next few years, have a broad range of capabilities that exceed human capabilities. In particular, most or all of the work needed to advance research and development in key domains - from […
We're now separating the safety commitments we'll make unilaterally and our recommendations for the industry. We're also committing to publish new Frontier Safety Roadmaps with detailed safety goals, and Risk Reports that quantify risk across all our deployed models.
We're updating our Responsible Scaling Policy to its third version. Since it came into effect in 2023, we've learned a lot about the RSP's benefits and its shortcomings. This update improves the policy, reinforcing what worked and committing us to even greater transparency.
I can't completely fault the “unilateral disarmament won't fix the industry” line of reasoning, but that's quite a walk-back for the Amodeis. #AI “safety” was supposed to be their raison d'etre... [embedded post]