2025-01-24
With OpenAI's Operator (releasing today) you can take a picture of your grocery list and the Operator Agent will automatically order it from Instacart for you. Your AI may be the primary user of the web now. Screenshot from live demo [image]
The Verge
OpenAI releases a “research preview” of its Operator AI agent that can automate web-based tasks, launching to US subscribers of its $200/month ChatGPT Pro tier
A research preview of an agent that can use its own browser to perform tasks for you. OpenAI on YouTube : Introduction to Operator & Agents David Gewirtz / ZDNET : Operator isn't w...
With OpenAI's Operator (releasing today) you can take a picture of your grocery list and the Operator Agent will automatically order it from Instacart for you. Your AI may be the primary user of the web now. Screenshot from live demo [image]
TechCrunch
OpenAI partners with DoorDash, eBay, Instacart, Priceline, StubHub, Uber, and other companies to ensure that Operator respects their terms of service agreements
OpenAI CEO Sam Altman kicked off this year by saying in a blog post that 2025 would be big for AI agents, tools that can automate tasks and take actions on your behalf.
2023-05-13
Anthropic AI's new 100k token context window will allow for prompts equivalent to 250+ pages of text. This massive jump in token context length could allow students to upload entire textbooks for conversational analysis. https://twitter.com/...
TechCrunch
Anthropic expands Claude's context window from 9K to 100K tokens, or ~75K words it can digest and analyze; OpenAI's GPT-4 has a context window of ~32K tokens
2023-05-12
Anthropic AI's new 100k token context window will allow for prompts equivalent to 250+ pages of text. This massive jump in token context length could allow students to upload entire textbooks for conversational analysis. https://twitter.com/...
TechCrunch
Anthropic expands Claude's context window from 9K to 100K tokens, or ~75K words it can digest and analyze; OpenAI's GPT-4 has a context window of ~32K tokens
Historically and even today, poor memory has been an impediment to the usefulness of text-generating AI.
2023-03-24
GPT-4 connecting to the web 👀 https://twitter.com/...
OpenAI
OpenAI rolls out ChatGPT plugins, including two of its own, a web browser and a code interpreter, and open sources the code for a knowledgebase retrieval plugin
We've implemented initial support for plugins in ChatGPT. Plugins are tools designed specifically for language models with safety …
2023-03-02
The next big leap in useable AI: The ability to understand images (see examples) Microsoft's Kosmos-1, a Multimodal Large Language Model (MLLM) conducts various vision tasks - and suggests MLLMs may be capable of nonverbal reasoning Link to paper: https://arxiv.org/... https://twitter.com/...
Ars Technica
Microsoft researchers unveil Kosmos-1, a multimodal LLM they claim can understand image content, pass visual IQ tests, and accepts a variety of input formats
Microsoft believes a multimodal approach paves the way for human-level AI. — On Monday, researchers from Microsoft introduced Kosmos-1 …