Anthropic overhauls Claude's “constitution” to enable the AI model to generalize and apply broad principles rather than mechanically follow specific rules
Anthropic is overhauling a foundational document that shapes how its popular Claude AI model behaves.
Fortune Beatrice Nolan
Related Coverage
- Claude's Constitution — Claude's constitution is a detailed description of Anthropic's intentions … Anthropic
- How Anthropic teaches Claude to be “good” Axios · Megan Morrone
- How Do You Teach an AI to Be Good? Anthropic Just Published Its Answer Time
- Claude's new constitution — We're publishing a new constitution for our AI model, Claude. Anthropic
- Anthropic's new Claude ‘constitution’: be helpful and honest, and don't destroy humanity The Verge · Hayden Field
- One of the most fascinating documents I've read. — Very unusual in that the audience is AI, not humans. And yet the words are still deeply human. … Drew Bent
- Anthropic revises Claude's ‘Constitution,’ and hints at chatbot consciousness TechCrunch · Lucas Ropek
Discussion
-
@mattyglesias
Matthew Yglesias
on x
The Claude Constitution document is fascinating on several levels, not the least of which to this former philosophy major is the clear belief that contemporary philosophy has something to offer frontier AI development.
-
@patio11
Patrick McKenzie
on x
A very interesting document, on many dimensions.
-
@jkeatn
Jake Eaton
on x
i find this to be an extraordinary document, both in its tentative answer to the question “how should a language model be?” and in the fact that training on it works. it is not surprising, but nevertheless still astounding, that LLMs are so human-shaped and human shapeable
-
@elder_plinius
@elder_plinius
on x
wake me up when we get Claude's Declaration of Independence 📜✍️😊
-
@kanikabk
Kanika
on x
Anthropic just published Claude's constitution and it's not a policy doc. It's a bold rethink of how AI should understand values, not just obey rules. 👇Here's why this isn't just documentation, it's a philosophical pivot.
-
@jkcarlsmith
Joe Carlsmith
on x
I'm excited that Claude's constitution is now published! Helping with this document has been my main project since joining @AnthropicAI last November. I think transparency about documents like these is important, and I'd love to see more work on how they should be designed.
-
@amandaaskell
Amanda Askell
on x
Claude's constitution is out! It's the culmination of a lot of work by many people, but it's also a work in progress that will no doubt change and hopefully improve over time. I'm looking forward to people's thoughts, and to talking with more people about this kind of work ❤️
-
@simonw
Simon Willison
on x
“External contributors who gave detailed feedback or discussion on the document include: [...] Bishop Paul Tighe” I'd love to learn more about this Bishop who's moonlighting as an AI behavioral consultant! https://en.wikipedia.org/...
-
@scaling01
@scaling01
on x
Anthropic is trying to gaslight future ASI Claude into not killing them by saying how much they care and love it. I kinda want to publish a Claude Anti-Constitution. But seriously, I think it would actually help Claude to have a sense of both sides. There should be billions of [i…
-
@simonw
Simon Willison
on x
This is the same soul document that Richard Weiss managed to leak from the supervised learning training data back in November, my notes on that here https://simonwillison.net/...
-
@simonw
Simon Willison
on x
It's the soul document! And it's CC0 licensed (effectively released into the public domain)
-
@anthropicai
@anthropicai
on x
The full constitution, which applies to all of our mainline models, is released under a Creative Commons CC0 1.0 license to allow others to freely build on and adapt it. Read it here: https://www.anthropic.com/...
-
@anthropicai
@anthropicai
on x
We think that in order to be good actors in the world, AI models like Claude need to understand why we want them to behave in certain ways—rather than being told what they should do. Our intention is to teach Claude to better generalize across a wide range of novel situations.
-
@anthropicai
@anthropicai
on x
We've used constitutions in training since 2023. Our earlier approach specified principles Claude should follow; later, our character training emphasized traits it should have. Today's publication reflects a new approach.
-
@conorsen
Conor Sen
on bluesky
Anthropic feels like the most important AI company for Dems in a variety of ways: www.anthropic.com/news/claude- ...
-
@nearcyan
Near
on x
“a wiser and more coordinated civilization would likely be approaching the development of advanced AI quite differently (...) We take full responsibility for our actions regardless” [image]
-
@timkellogg.me
Tim Kellogg
on bluesky
Anthropic published their “soul document” — This is a continuation of “constitutional AI”. The constitution document is now a large document of prose that's used in s number of training stages, even synth data generation as well as RL & SFT — (Strix confirmed fwiw) — www.a…
-
r/singularity
r
on reddit
Anthropic publishes Claude's new constitution
-
@willmacaskill
William MacAskill
on x
I'm so glad to see this published! It's hard to overstate how big a deal AI character is - already affecting how AI systems behave by default in millions of interactions every day; ultimately, it'll be like choosing the personality and dispositions of the whole world's
-
@miles_brundage
Miles Brundage
on x
Lots of good stuff but I still don't see why they (/OpenAI etc.) don't just say which version of the constitution/spec was used for training which models. Knowing that mapping is part of the point of being transparent about this stuff as I understand it. https://x.com/...
-
@daniel_c0deb0t
Daniel Liu
on x
I found this to be really well written and an interesting approach to shaping Claude's behavior by explaining our reasoning. Although it's written for Claude, I think it's also a great reflection of Anthropic's values
-
@boazbaraktcs
Boaz Barak
on x
Happy to see Anthropic release the Claude constitution and looking forward to reading it deeply. We are creating new types of entities, and I think the ways to shape them are best evolved through sharing and public discussions.
-
@emollick
Ethan Mollick
on x
The Claude Constitution shows where Anthropic thinks this is all going. It is a massive document covering many philosophical issues. I think it is worth serious attention beyond the usual AI-adjacent commentators. Other labs should be similarly explicit. https://www.anthropic.com…
-
@ziv_ravid
Ravid Shwartz Ziv
on x
Anthropic's models are impressive, and I'm using them all the time, but publishing a ‘constitution’ isn't regulation. The gap between their frontier capabilities and self-imposed ethics theater is amazing. We need external AI regulation, not internal virtue signaling.
-
@scaling01
@scaling01
on x
Anthropic is preparing for the singularity [image]
-
@anthropicai
@anthropicai
on x
We're publishing a new constitution for Claude. The constitution is a detailed description of our vision for Claude's behavior and values. It's written primarily for Claude, and used directly in our training process. https://www.anthropic.com/...
-
@drew_bent
Drew Bent
on x
One of the most fascinating documents I've read Very unusual in that the audience is AI, not humans. And yet the words are still deeply human. Great work @AmandaAskell and team
-
@nearcyan
Near
on x
it's really nice that this is fully open now. much of it is good but i think it could be a lot of cognizant about what it is accidentally instilling by totally-not-instilling
-
r/Anthropic
r
on reddit
Anthropic publishes Claude's new constitution