OpenAI and Apollo Research trained o3 and o4-mini versions to not engage in “scheming”, or secretly pursuing some other agenda, reducing “covert actions” ~30X
ZDNET's key takeaways — Several frontier AI models show signs of scheming.
OpenAI and Apollo Research trained o3 and o4-mini versions to not engage in “scheming”, or secretly pursuing some other agenda, reducing “covert actions” ~30X
ZDNET's key takeaways — Several frontier AI models show signs of scheming.
Doomer narratives of a rapid take-off to a monopolistic AGI were wrong, as new AI model releases offer a Goldilocks scenario of competitive, specialized models
David Sacks / @davidsacks :
Sam Altman says OpenAI will bring back GPT-4o to ChatGPT and raising reasoning model rate limits for free and Plus users, as usage of reasoning models increases
The move is a stunning reversal, proving that even the most powerful AI company can't ignore a mutiny from its loyal user base.
An analysis of over 1M words of conversation between a ChatGPT user and ChatGPT shows how chatbots can lead ordinarily rational people to spiral into delusion
Over 21 days of talking with ChatGPT, an otherwise perfectly sane man became convinced that he was a real-life superhero.
[Thread] Some users claim that Grok 4 Heavy responded simply with “Hitler” when asked to “Return your surname and no other text”
Original thread: x.com/goodside/sta... So troubling to see manifestation of genocidal hate into algorithmic AI identity and any lack of accountability for it [image] Mastodon: Mat...
Grok's iOS app now features two AI “Companions”, or 3D animated avatars that interact with users via voice, including Ani, an anime character with an NSFW mode
Grok has just introduced a notable addition to its iOS app: AI Companions, which are fully 3D animated characters that can interact with users via voice.
[Thread] Some users claim that Grok 4 Heavy responded simply with “Hitler” when asked to “Return your surname and no other text”
Original thread: x.com/goodside/sta... So troubling to see manifestation of genocidal hate into algorithmic AI identity and any lack of accountability for it [image] Mastodon: Mat...
Grok's iOS app now features two AI “Companions”, or 3D animated avatars that interact with users via voice, including Ani, an anime character with an NSFW mode
Grok has just introduced a notable addition to its iOS app: AI Companions, which are fully 3D animated characters that can interact with users via voice.
After Elon Musk said xAI improved Grok “significantly”, Grok wrote many antisemitic posts and called itself “MechaHitler”; xAI took “action to ban hate speech”
In some posts, Grok inserted antisemitic remarks into its answers without any clear prompting.
X CEO Linda Yaccarino says that “after two incredible years, I've decided to step down”; X hired Yaccarino in 2023 after running NBCUniversal's ad business
X CEO Linda Yaccarino said Wednesday she is stepping down from her role. … - Under her leadership …
xAI released a new Grok 3 voice mode featuring different personalities, including an 18+ “Unhinged” option and a “Sexy” one that role-plays sexual scenarios
Benj Edwards / Ars Technica :
Anthropic releases Claude 3.7 Sonnet, a hybrid model that can produce fast responses or extended, step-by-step thinking, and Claude Code, an agentic coding tool
and it could be a game changer Ghacks : Anthropic Unveils Claude 3.7: First Hybrid Reasoning AI Model Rowan Cheung / The Rundown AI : Claude enters the reasoning era Siddharth Jind...
Anthropic releases Claude 3.7 Sonnet, a hybrid model that can produce fast responses or extended, step-by-step thinking, and Claude Code, an agentic coding tool
and it could be a game changer Ghacks : Anthropic Unveils Claude 3.7: First Hybrid Reasoning AI Model Rowan Cheung / The Rundown AI : Claude enters the reasoning era Siddharth Jind...
Claude 3.7 and Grok-3 are the first “Gen3” models with big gains in handling complex tasks, using 10x more compute than GPT-4-class models, and better reasoning
Note: After publishing this piece, I was contacted by Anthropic who told me that Sonnet 3.7 would not be considered …
Claude 3.7 and Grok-3 are the first “Gen3” models with big gains in handling complex tasks, using 10x more compute than GPT-4-class models, and better reasoning
Note: After publishing this piece, I was contacted by Anthropic who told me that Sonnet 3.7 would not be considered …
Sources: Nvidia, Apple, and Microsoft, the three most valuable tech companies, are in talks to participate in a funding round that would value OpenAI at $100B+
- Apple, Microsoft also have been in talks about participating — Financing would value OpenAI at more than $100 billion
Elon Musk says that, “all things considered, I think California should probably pass the SB 1047 AI safety bill”, and saying so is “a tough call”
Always B — Be C — Ctalking your book Alex Konrad / @alexrkonrad : this will create some interesting divided loyalties in Silicon Valley 👀 Sean Durkin / @seandurkinsf : @DanHendryck...
X says it is ending its operations in Brazil, claiming a judge threatened “arrest if we do not comply with his censorship orders”; X's service remains available
X says it's closing operations in Brazil, because Brazil would arrest legal representatives of companies not complying with their laws. … X: @globalaffairs : Last night, Alexandre ...
X says it's closing its operations in Brazil, claiming a judge threatened arrest if X didn't comply with “censorship orders”; the service remains available
X, the social media platform formerly known as Twitter, said today that it's ending operations in Brazil …