SNIPPET LIBRARY
11 COPY-PASTE SNIPPETS
Claude Sonnet 4.6: Extended Thinking Streams 2x Faster
Extended thinking in claude-sonnet-4-6 now streams at 2x the previous throughput. Internal reasoning tokens arrive in real-time via the thinking content block, with no additional latency penalty on the first token.
Gemini 2.5 Pro: Google Search Grounding Free Up to 1,500 Queries/Day
Google has made grounding with Google Search free for up to 1,500 queries per day on Gemini 2.5 Pro via the Gemini API. Previously this was a paid add-on. Beyond 1,500 queries, standard grounding rates apply.
ElevenLabs: Instant Voice Cloning From 10 Seconds of Audio
ElevenLabs now supports Instant Voice Cloning from as little as 10 seconds of audio via their API. Upload a clean audio sample, get a voice_id back in under 3 seconds, and immediately use it for text-to-speech generation.
Claude Sonnet 4.6 Released
Anthropic released Claude Sonnet 4.6, the latest in the Claude 4 family, with improved reasoning, faster response times, and better instruction following compared to Sonnet 3.7.
Extended Thinking Now Streams in Real-Time
Extended thinking mode in Claude now streams thinking tokens in real-time. Previously the full thinking block was buffered before delivery.
Claude Code Is Now Generally Available
Claude Code, Anthropic's agentic coding tool that runs in the terminal, exited beta and is now generally available. It can edit files, run tests, commit code, and navigate large codebases autonomously.
Gemini 2.5 Pro Released with 1M Token Context
Google released Gemini 2.5 Pro with a 1 million token context window, improved multimodal reasoning, and native code execution.
Gemini Grounding with Google Search Free Up to 1,500 Queries/Day
Google made the Search grounding feature free for Gemini API users up to 1,500 queries per day, letting the model cite real-time web sources.
ElevenLabs Instant Voice Cloning from 10 Seconds of Audio
ElevenLabs reduced the minimum audio required for instant voice cloning from 1 minute down to 10 seconds while maintaining the same output quality.
Vercel AI SDK 4.0 — Unified API Across All Major LLMs
Vercel released AI SDK 4.0 with a unified API that works identically across Claude, GPT-4o, Gemini, Mistral, and Llama. Includes streaming, tool use, and structured output.
OpenAI Realtime API Supports Text and Audio in Same Session
OpenAI's Realtime API now supports mixing text and audio modalities in the same WebSocket session. Send text, receive audio, or switch modes mid-conversation.