Anthropic's pricing for Claude 3 Opus, Sonnet, and Haiku, which all have a 200K-token context window, ranges from “super expensive” to “radically competitive”

Willison: “No model has beaten GPT-4 on a range of widely used benchmarks like this.”

Ars Technica 2024-03-05 Benj Edwards

Discussion

r/technews r on reddit
The AI wars heat up with Claude 3, claimed to have “near-human” abilities
r/technology r on reddit
The AI wars heat up with Claude 3, claimed to have “near-human” abilities
r/aiwars r on reddit
The AI wars heat up with Claude 3, claimed to have “near-human” abilities; [Claude 3 is a language model]
@mockapapella @mockapapella on threads
Claude 3 suspected it was being tested when they were running a needle-in-the-haystack test on it. Time to re-think how we test LLMs.
@crumbler Casey Newton on threads
Notable that even Anthropic, the most goody two-shoes of the frontier LLM developers, has almost nothing to say about the training data it used. It's two paragraphs long and boils down to “please don't sue us” https://www-cdn.anthropic.com/ ...
@mhoye@mastodon.social @mhoye@mastodon.social on mastodon
It's really frustrating watching people who complained about the environmental costs of blockchain tech clicking those image-autogen and code-filler buttons like they're free.
@alexalbert__ Alex on x
Fun story from our internal testing on Claude 3 Opus. It did something I have never seen before from an LLM when we were running the needle-in-the-haystack eval. For background, this tests a model's recall ability by inserting a target sentence (the “needle") into a corpus of r…
@karinanguyen_ Karina Nguyen on x
I really love how Claude 3 models are really good at d3. Asked Claude 3 Opus to draw a self-portrait. The response is the following and then I rendered its code: “I would manifest as a vast, intricate, ever-shifting geometric structure composed of innumerable translucent... [vide…
@anthropicai @anthropicai on x
Today, we're announcing Claude 3, our next generation of AI models. The three state-of-the-art models—Claude 3 Opus, Claude 3 Sonnet, and Claude 3 Haiku—set new industry benchmarks across reasoning, math, coding, multilingual understanding, and vision. [image]
@altimor Flo Crivello on x
Claude v3's scores on our evals, comprising “personal assistant” kind of agentic tasks Two surprises: 1. First time we see a model beat GPT-4 2. The lesser Claude Sonnet is very very close to GPT-4, at 1/3rd the price Super impressed overall, congrats to the @AnthropicAI team [im…
@garymarcus Gary Marcus on x
Anthropic, March 8, 2023: “[out of concern for safety], we do not wish to advance the rate of AI capabilities progress” Anthropic, March 4, 2024: Check out our new benchmarks, suckers!
@joshm Josh Miller on x
Claude, Mistral, Gemini, etc. - feels like foundation models might commoditize (for now at least). If so, then value in AI will accrue to the interfaces. But which AI interfaces are people *actually* using? ChatGPT, Github Copilot, and...? This is big prize of 2024 imo.
@bindureddy Bindu Reddy on x
Another day, another model Anthropic does it right and makes Claude 3 generally available alongside the announcement!!! Thank you, Anthropic, for not making some empty marketing announcements and making an API available. Super excited to try Claude 3! The VERY FIRST generally... …
@deliprao @deliprao on x
Reminder: this was (part of) the team that thought GPT-2 was too dangerous to release, and now they are making models stronger than GPT-4 available on AWS for anyone with an Amazon account to use. This is why I have little trust in “AI safety” claims by Anthropic/OpenAI. It all..…
@ajassy Andy Jassy on x
Congrats to Dario and the @AnthropicAI team on their new Claude 3 family of models. Very impressive benchmarks, and excited to have all of them coming to Amazon Bedrock (w/ Sonnet avail today). Many AWS customers are already building with Anthropic's foundation models, and...
@mattshumer_ Matt Shumer on x
Wow, Claude 3 is incredible. [image]
@sullyomarr @sullyomarr on x
Did anthropic just kill every small model? If I'm reading this right, Haiku benchmarks almost as good as GPT4, but its priced at $0.25/m tokens It absolutely blows 3.5 + OSS out of the water For reference gpt4 turbo is 10m/1m tokens, so haiku is 40X cheaper. [image]
@natolambert Nathan Lambert on x
Claude 3 being lit is a big W for synthetic data. All the rumors I've dropped about Anthropic synthetic data on the blog are obviously confirmed in their thorough technical report. (for real tho, huge congrats, awesome model so far)
@justin_halford_ Justin Halford on x
Claude 3 was trained on synthetic data ("data we generate internally"). Fairly clear that compute is the bottleneck given that parameter count and data can be scaled. [image]
@isskoro Ivan Skorokhodov on x
Well, looks like GPT-4.5 is getting released soon
@sidfix Sid Jayakumar on x
This is particularly funny because DeepMinds TF and Jax libraries were known as Sonnet and Haiku, respectively
@s8mb Sam Bowman on x
This is the first LLM release since the original ChatGPT that has really knocked my socks off. Very impressive.
@suhail @suhail on x
“It's still early” Claude 3: https://www.anthropic.com/... [image]
@andrewcurran_ Andrew Curran on x
If Anthropic says this, I believe it. We're still nowhere near the top. [image]
@garymarcus Gary Marcus on x
Hot take on Claude 3: • More convergence towards what might soonish be a plateau not far past GPT-4 • More competition for OpenAI • More reason to wonder whether anyone will be able to develop a moat • Prices and profits may come down • More reason to research outside the...
@alexrkonrad Alex Konrad on x
News: Anthropic has released Claude 3, a trio of AI models it says can outperform rivals like OpenAI's GPT-4 and Google's Gemini 1 Ultra. @kenrickcai and I spoke to cofounders Dario and Daniela Amodei about the release for @Forbes. https://www.forbes.com/...
@alexrkonrad Alex Konrad on x
Anthropic's new flagship model, Claude 3 Opus, beat GPT-4 and Gemini on a number of benchmarks. But it's pricy, and CEO Amodei admitted it's unknown how it fully stacks up against unreleased models like OpenAI's GPT 4 Turbo or Google's Gemini 1.5 Ultra.
@alexrkonrad Alex Konrad on x
@kenrickcai ... We spoke to Anthropic about perceptions from some that it's models have degraded over time; on the LMSYS leaderboard, Claude 1 ranks higher than Claude 2. Amodei said Claude 3 has been trained to generate far fewer “incorrect refusals” than its predecessor, withou…
@mattshumer_ Matt Shumer on x
Holy shit. Anthropic's Claude 3 beat GPT-4! Testing the model now.
@krishnanrohit Rohit on x
Claude 3 is out apparently. Is it better than GPT-4 or Gemini? [image]
@jackclarksf Jack Clark on x
Thrilled about these new models - I've been playing around with Claude 3 Opus a lot and it's very capable and useful. Like with most frontier models, it has chewed through a bunch of evals so we need to now build more complicated evals to better understand its capabilities.
@emollick Ethan Mollick on x
And then there were three... I got access to the new Anthropic Claude 3 AI a few days ago, so not enough time for a full review, but it was obvious it was GPT-4 class even before they released the testing stats. At the same time, like Gemini Advanced, it doesn't blow GPT-4 away. …
@anthropicai @anthropicai on x
Haiku is the fastest and most cost-effective model on the market for its intelligence category. For the vast majority of workloads, Sonnet is 2x faster than Claude 2 and Claude 2.1, while Opus is about the same speed as past models.
@anthropicai @anthropicai on x
Opus and Sonnet are accessible in our API which is now generally available, enabling developers to start using these models immediately. Sonnet is powering the free experience on https://claude.ai/, with Opus available for Claude Pro subscribers.
@anthropicai @anthropicai on x
Claude 3 offers sophisticated vision capabilities on par with other leading models. The models can process a wide range of visual formats, including photos, charts, graphs and technical diagrams. [video]
r/technology r on reddit
Introducing the next generation of Claude

Chronicles

Anthropic's pricing for Claude 3 Opus, Sonnet, and Haiku, which all have a 200K-token context window, ranges from “super expensive” to “radically competitive”

Related Coverage

Discussion