/
Navigation
Chronicles
Browse all articles
Explore
Semantic exploration
Research
Entity momentum
Nexus
Correlations & relationships
Story Arc
Topic evolution
Drift Map
Semantic trajectory animation
Posts
Analysis & commentary
Pulse API
Tech news intelligence API
Browse
Entities
Companies, people, products, technologies
Domains
Browse by publication source
Handles
Browse by social media handle
Detection
Concept Search
Semantic similarity search
High Impact Stories
Top coverage by position
Sentiment Analysis
Positive/negative coverage
Anomaly Detection
Unusual coverage patterns
Analysis
Rivalry Report
Compare two entities head-to-head
Semantic Pivots
Narrative discontinuities
Crisis Response
Event recovery patterns
Connected
Search: /
Command: ⌘K
Embeddings: large
TEXXR

Chronicles

The story behind the story

days · browse · Enter similar · o open

OpenAI updates ChatGPT Plus and ChatGPT Enterprise to let users prompt the tool using voice commands or by uploading an image, coming to all users “soon after”

and Look Into Your Life Kyle Wiggers / TechCrunch : OpenAI's GPT-4 with vision still has flaws, paper reveals The Hill : ChatGPT given the ability to talk Laurent Giret / Thurrott : ChatGPT Can Now Talk and Analyze Images Damir Yalalov / Metaverse Post : Rumulations of OpenAI ‘Arrakis’ to Be Even More Powerful Than GPT-4 and Gobi Aman Gupta / Livemint : ChatGPT can now talk to you. Here's how to use the newly released features by OpenAI CNBC : ChatGPT can now ‘speak,’ listen and process images, OpenAI says Lisa Marie Segarra / PetaPixel : ChatGPT Can Now See Your Photos and Respond to Them James Laird / Tech.co : How to Use ChatGPT's New Voice Command and Image Features Benj Edwards / Ars Technica : ChatGPT update enables its AI to “see, hear, and speak,” according to OpenAI Ben Lyons / Gamereactor UK : ChatGPT will soon be able to hear and see you Kamya Pandey / MediaNama : OpenAI introduces its image command feature, says it will refuse requests for some prompts containing human images Evann Gastaldo / Newser : ChatGPT Can Now ‘See, Hear and Speak’ Steve Muchoki / Coinspeaker : OpenAI Announces Conversational and Image Search Features for ChatGPT Plus and Enterprise Users Abdullah / Gizchina : ChatGPT: Introducing Voice and Image Functionalities International Business Times : OpenAI's ChatGPT can now see, hear and speak; Also uses Whisper Christoph Schwaiger / Tom's Guide : You can now talk to ChatGPT — and it'll talk back Siamak Masnavi / Tech News Outlet : OpenAI Unveils Voice and Image Features for ChatGPT, Exclusively for Plus and Enterprise Users Daniel Levi / TechStartups : ChatGPT can now see, hear, speak, and engage in voice conversations like Apple's Siri Reuters : ChatGPT update will give it a voice and allow users to interact using images OpenAI : GPT-4V(ision) System Card Roger Cheng / Cord Cutters News : ChatGPT Can Now See, Hear, and Speak as AI Gets Better Scary Fast GovTech : Can you have a verbal conversation with ChatGPT? Catherine Thorbecke / CNN : Now you can speak to ChatGPT — and it will talk back Tech Xplore : ChatGPT AI getting chatty with voice prompts Andrew Romero / 9to5Google : ChatGPT has gone full virtual assistant with voice and image recognition Stefanie Schappert / Cybernews.com : ChatGPT finds its voice (plus eyes and ears) with major upgrade David Gewirtz / ZDNet : How to write better ChatGPT prompts for the best generative AI results Godfrey Elimian / Technext : Nigerian podcasters can now reach wider audiences with Spotify's new voice translation tool Kris Holt / Engadget : ChatGPT now supports voice chats and image-based queries Josh Norem / ExtremeTech : ChatGPT to Begin Allowing Photo and Voice-Based Queries PYMNTS.com : OpenAI Says ChatGPT Can Now ‘See’ and ‘Speak’ Fionna Agomuoh / Digital Trends : ChatGPT's new upgrade finally breaks the text barrier Matthew Gooding / Tech Monitor : ChatGPT update will help OpenAI's chatbot ‘see, hear and speak’ Kyt Dotson / SiliconANGLE : OpenAI's ChatGPT chatbot now allows users to use voice and pictures to get answers Nicola Agius / Search Engine Land : ChatGPT rolls out voice and image prompts Ryan McNeal / Android Authority : ChatGPT now lets you talk with it or submit pictures for prompts Emily Dreibelbis / PCMag : Tell Me a Story, ChatGPT: OpenAI Targets Families With New Voice, Image Features Ben Wilson / Windows Central : Show ChatGPT what you see: Voice and image features are live (for a price) Rahul Naskar / XDA Developers : OpenAI announces voice support and new image capabilities for ChatGPT Chris Smith / BGR : ChatGPT will support voice and picture prompts for free Abubakar Idris / The Messenger : OpenAI Reveals ChatGPT Can Now See and Hear, Not Just Speak Shubham Sharma / VentureBeat : ChatGPT goes multimodal: now supports voice, image uploads Jonathan Kemper / The Decoder : ChatGPT can now hear, speak, see, and understand multimodal prompts Devesh Beri / OnMSFT.com : ChatGPT can now have back-and-forth conversations with users, can decipher images Surur / BigTechWire : OpenAi announce multi-modal capabilities: ChatGPT can now see, hear and speak Pranav Dixit / Business Today : ChatGPT can now speak, hear, and see: Here's how to use new voice and image capabilities Matt G. Southern / Search Engine Journal : ChatGPT Leaps Forward With New Voice & Image Capabilities Kyle Barr / Gizmodo : ChatGPT Is Growing Eyes and Ears to Better Respond to Your Human Whims Hasan Chowdhury / Insider : ChatGPT can now talk back to you with an eerily human-like voice Cindy Tan / Metaverse Post : OpenAI's ChatGPT Unveils Major Upgrade, Adds Voice Conversation and Image Chat Gavin Phillips / MakeUseOf : OpenAI Gives ChatGPT a Voice to Respond to Prompts and Commands Markus Kasanmascheff / WinBuzzer : ChatGPT Adds Voice and Image Input, Turning into Fully Fledged Voice Assistant New York Times : ChatGPT Can Now Respond With Spoken Words Amrita Khalid / The Verge : Spotify partners with OpenAI to debut an AI translation feature that reproduces podcasts in other languages using a synthesized version of the podcaster's voice X: @openai : ChatGPT can now see, hear, and speak. Rolling out over next two weeks, Plus users will be able to have voice conversations with ChatGPT (iOS & Android) and to include images in conversations (all platforms). https://openai.com/... [video] @swyx : learnings from GPT4V model card: https://cdn.openai.com/... - ramped up to 16k BeMyEyes + 1k developer alpha testers over 6 months - reduced frequency and severity of hallucinations - improved OCR and quality of descriptions - great demand for describing people without affecting... [image] Bonnie Stewart / @bonstewart : i was literally teaching about GenAI & ChatGPT today in class when a student announced this. glad i didn't see the actual tweet or i'd have spent the WHOLE time (instead of just part of it) banging on about how LMMs canNOT actually see, hear, OR speak. whee. this hype cycle. 🫠 @sashamtl : The always and forever PSA: stop treating AI models like humans. No, ChatGPT cannot “see, hear and speak”. It can be integrated with sensors that will feed it information in different modalities. Don't fan the flames of hype, y'all. Will Depue / @willdepue : some people at OpenAI think voice and vision is a bigger deal than GPT-4, from a product standpoint. not sure if I disagree, walking around and talking to ChatGPT like you're on the phone is amazing. Chris Anderson / @tedchris : The rate of progress here is remarkable. So much new usefulness when you can truly iterate with the AI. Peter Welinder / @npew : The hero behind enabling GPT-4 to see is @TheRealRPuri who worked tirelessly across the whole stack, from research to product, for the past year to make this a reality for our ChatGPT users. @_lamaahmad : We've included a system card focused on the vision capabilities, building on the work from the GPT-4 system card. Thank you to all our expert testers and red teamers for helping to inform this work! https://openai.com/... Cesare G. Ardito / @cesaregardito : Multimodal GPT will finally be available to the wider public in a matter of weeks. Remember all those “put visual elements in your assignments, they can't copypaste them in ChatGPT?”. Think again! Greg Brockman / @gdb : Voice mode and image inputs are now in ChatGPT. Starting to feel like the interface to a real AI: @autismcapital : Enjoying these last 4-6 years we have left before nothing makes sense anymore. https://openai.com/... Mira Murati / @miramurati : A more intuitive interface for ChatGPT. Just chat with it using your voice or show it what you're talking about using images. Rolling out over next 2 weeks. Rowan Cheung / @rowancheung : 🚨 BREAKING: Massive breakthrough in the world of AI. ChatGPT can now speak, hear, see, and more. In other words, ChatGPT is officially multimodal and just got 10x easier to use! Here's everything you need to know (thread): [video] Sam Altman / @sama : voice mode and vision for chatgpt! really worth a try. https://openai.com/... Forums: r/audible : ChatGPT can now narrate your book in a realistic human voice and also respond to your questions. r/ChatGPTPro : ChatGPT can now see, hear, and speak r/OpenAI : ChatGPT can now see, hear, and speak Beehaw : OpenAI's ChatGPT chatbot now supports prompting with voice and images See also Mediagazer

The Verge David Pierce

Discussion

  • @openai @openai on x
    ChatGPT can now see, hear, and speak. Rolling out over next two weeks, Plus users will be able to have voice conversations with ChatGPT (iOS & Android) and to include images in conversations (all platforms). https://openai.com/... [video]
  • @swyx @swyx on x
    learnings from GPT4V model card: https://cdn.openai.com/... - ramped up to 16k BeMyEyes + 1k developer alpha testers over 6 months - reduced frequency and severity of hallucinations - improved OCR and quality of descriptions - great demand for describing people without affecting.…
  • @bonstewart Bonnie Stewart on x
    i was literally teaching about GenAI & ChatGPT today in class when a student announced this. glad i didn't see the actual tweet or i'd have spent the WHOLE time (instead of just part of it) banging on about how LMMs canNOT actually see, hear, OR speak. whee. this hype cycle. 🫠
  • @sashamtl @sashamtl on x
    The always and forever PSA: stop treating AI models like humans. No, ChatGPT cannot “see, hear and speak”. It can be integrated with sensors that will feed it information in different modalities. Don't fan the flames of hype, y'all.
  • @willdepue Will Depue on x
    some people at OpenAI think voice and vision is a bigger deal than GPT-4, from a product standpoint. not sure if I disagree, walking around and talking to ChatGPT like you're on the phone is amazing.
  • @tedchris Chris Anderson on x
    The rate of progress here is remarkable. So much new usefulness when you can truly iterate with the AI.
  • @npew Peter Welinder on x
    The hero behind enabling GPT-4 to see is @TheRealRPuri who worked tirelessly across the whole stack, from research to product, for the past year to make this a reality for our ChatGPT users.
  • @_lamaahmad @_lamaahmad on x
    We've included a system card focused on the vision capabilities, building on the work from the GPT-4 system card. Thank you to all our expert testers and red teamers for helping to inform this work! https://openai.com/...
  • @cesaregardito Cesare G. Ardito on x
    Multimodal GPT will finally be available to the wider public in a matter of weeks. Remember all those “put visual elements in your assignments, they can't copypaste them in ChatGPT?”. Think again!
  • @gdb Greg Brockman on x
    Voice mode and image inputs are now in ChatGPT. Starting to feel like the interface to a real AI:
  • @autismcapital @autismcapital on x
    Enjoying these last 4-6 years we have left before nothing makes sense anymore. https://openai.com/...
  • @miramurati Mira Murati on x
    A more intuitive interface for ChatGPT. Just chat with it using your voice or show it what you're talking about using images. Rolling out over next 2 weeks.
  • @rowancheung Rowan Cheung on x
    🚨 BREAKING: Massive breakthrough in the world of AI. ChatGPT can now speak, hear, see, and more. In other words, ChatGPT is officially multimodal and just got 10x easier to use! Here's everything you need to know (thread): [video]
  • @sama Sam Altman on x
    voice mode and vision for chatgpt! really worth a try. https://openai.com/...
  • r/audible r on reddit
    ChatGPT can now narrate your book in a realistic human voice and also respond to your questions.
  • r/ChatGPTPro r on reddit
    ChatGPT can now see, hear, and speak
  • r/OpenAI r on reddit
    ChatGPT can now see, hear, and speak