CHANGELOG
Extended thinking in claude-sonnet-4-6 now streams at 2x the previous throughput. Internal reasoning tokens arrive in re...
Anthropic released Claude Sonnet 4.6, the latest in the Claude 4 family, with improved reasoning, faster response times,...
Extended thinking mode in Claude now streams thinking tokens in real-time. Previously the full thinking block was buffer...
Claude Code, Anthropic's agentic coding tool that runs in the terminal, exited beta and is now generally available. It c...
OpenAI flipped the switch on persistent memory for all Plus and Pro subscribers. ChatGPT now automatically stores facts,...
OpenAI enabled persistent memory by default for all ChatGPT Plus and Pro users. The model automatically saves facts, pre...
OpenAI's Realtime API now supports mixing text and audio modalities in the same WebSocket session. Send text, receive au...
Google has made grounding with Google Search free for up to 1,500 queries per day on Gemini 2.5 Pro via the Gemini API. ...
Google released Gemini 2.5 Pro with a 1 million token context window, improved multimodal reasoning, and native code exe...
Google made the Search grounding feature free for Gemini API users up to 1,500 queries per day, letting the model cite r...
Cursor 0.44 ships Background Agent — a fully autonomous coding agent that runs in a remote sandboxed environment. It can...
Cursor's Background Agent mode now supports running test suites, interpreting failures, self-correcting, and committing ...
ElevenLabs now supports Instant Voice Cloning from as little as 10 seconds of audio via their API. Upload a clean audio ...
ElevenLabs reduced the minimum audio required for instant voice cloning from 1 minute down to 10 seconds while maintaini...
Vercel released AI SDK 4.0 with a unified API that works identically across Claude, GPT-4o, Gemini, Mistral, and Llama. ...