ELEVENLABS April 3, 2026

ElevenLabs: Instant Voice Cloning From 10 Seconds of Audio

WHAT CHANGED

ElevenLabs now supports Instant Voice Cloning from as little as 10 seconds of audio via their API. Upload a clean audio sample, get a voice_id back in under 3 seconds, and immediately use it for text-to-speech generation.

WHY IT MATTERS

The friction of adding a custom voice to an app just collapsed. 10 seconds of audio is easy to collect from any user, any podcast clip, or any spokesperson recording. This unlocks personalized TTS at scale — onboarding flows, AI assistants, content localization.

HOW TO USE IT

POST an audio file to /v1/voices/add with name and files[] parameters. The API returns a voice_id immediately. Pass that voice_id to /v1/text-to-speech/{voice_id} for synthesis. Minimum sample: 10 seconds of clear speech, no background noise.

ELEVENLABS / PYTHON

import requests
import os

ELEVEN_API_KEY = os.environ["ELEVEN_API_KEY"]

# Step 1: Clone a voice from a short audio sample
def clone_voice(name: str, audio_path: str) -> str:
    with open(audio_path, "rb") as f:
        response = requests.post(
            "https://api.elevenlabs.io/v1/voices/add",
            headers={"xi-api-key": ELEVEN_API_KEY},
            data={"name": name},
            files={"files": (audio_path, f, "audio/mpeg")},
        )
    response.raise_for_status()
    return response.json()["voice_id"]

# Step 2: Generate speech with the cloned voice
def speak(voice_id: str, text: str, output_path: str):
    response = requests.post(
        f"https://api.elevenlabs.io/v1/text-to-speech/{voice_id}",
        headers={
            "xi-api-key": ELEVEN_API_KEY,
            "Content-Type": "application/json",
        },
        json={
            "text": text,
            "model_id": "eleven_multilingual_v2",
            "voice_settings": {"stability": 0.5, "similarity_boost": 0.8},
        },
    )
    response.raise_for_status()
    with open(output_path, "wb") as f:
        f.write(response.content)

voice_id = clone_voice("My Narrator", "sample.mp3")
speak(voice_id, "Welcome to the future of voice AI.", "output.mp3")

voice-cloninginstantapitts

ORIGINAL SOURCE

https://elevenlabs.io/docs/api-reference/voice-lab/create-voice

VIEW ORIGINAL SOURCE →

ElevenLabs instant voice cloning: 10 seconds in, production-ready voice out. The barrier to personalized audio just hit the floor.

← BACK TO UPDATES