¡NUEVO! Clonación de Voz ahora disponible en 37 Idiomas
Ver Todos

Texto a Voz
Una Llamada de API
Simplificado

Cree aplicaciones de voz, audiolibros y contenido accesible con nuestra API de texto a voz. 600+ voces naturales en 142 idiomas para desarrolladores y empresas.

Premiado por la Excelencia

Vea lo que nuestros usuarios dicen sobre Verbatik

PRECIOS DE LA API

Precios Simples y Transparentes

Valor Líder del Sector

Obtenga el mejor valor del sector con nuestra fijación de precios directa. Perfecto para empresas de cualquier tamaño, nuestra API ofrece conversión premium de texto a voz por solo $0,000025 por carácter. Escale su contenido de voz sin exceder el presupuesto.

  • 50,000 Characters per $1
  • High-Quality TTS Voices
  • Fast TTS Speed
  • Commercial Rights
  • Simple API Integration
  • 600 Voices 142 Languages

API Pay-As-You-Go

50,000 caracteres/$1

o $20 por 1M de caracteres

Para Desarrolladores

Por Qué los Desarrolladores Eligen la API TTS de Verbatik

API de Voz IA Superior

Acceso a 600+ voces con sonido natural usando tecnología avanzada de IA de voz

Procesamiento Ultra-Rápido

Convierta millones de caracteres en segundos con nuestra API de texto a voz optimizada

142 Idiomas y Dialectos

Alcance una audiencia global con nuestra API integral de texto a audio

Integración Fácil

Implemente nuestra API de IA de voz con solo unas pocas líneas de código y documentación clara

Escalabilidad Enterprise

Nuestra API de texto a voz IA maneja todo, desde pequeños proyectos hasta demandas enterprise

Derechos de Uso Comercial

Derechos completos para usar archivos de audio generados en sus productos comerciales

Cómo Comenzar con Nuestra API

Paso 1

Agregar Créditos

Elija el monto correcto para sus necesidades. Puede obtener 10.000 caracteres gratuitos para la API de generador de voz IA al agendar una reunión con nosotros.

Comenzar Ahora

Paso 2

Generar Clave de API

Acceda a su panel para crear su clave secreta única para autenticación segura de la API de voz.

Generar Texto a Voz

Paso 3

Sintetizar Voz

Integre la API de texto a audio con solo unas pocas líneas de código y transforme su texto en voz natural.

Comenzar Ahora

Text-to-Speech API Documentation

Complete guide to integrating TTS API with 568 voices across multiple providers

API v1.0

VerbatikText-to-Speech API Overview

High-quality neural text-to-speech with multiple voices

The Verbatik Text-to-Speech API provides high-quality neural voice synthesis Convert text to natural-sounding speech with 568 voices across 142 languages.

Key Features

  • 568 neural voices
  • 142 supported languages
  • SSML support for advanced control
  • Multiple provider support

Technical Specs

  • MP3 output format
  • 24kHz sample rate
  • Character-based billing
  • S3 storage integration

Voice Library

568
High-Quality Neural Voices
142
Languages Supported
24kHz
Sample Rate

API Endpoints

POST /api/v1/tts

Synthesize Speech

Convert text to speech using neural voices. Supports both plain text and SSML input with optional audio storage.

Request Headers

HeaderRequiredDescription
AuthorizationYesBearer token for authentication
Content-TypeYestext/plain or application/ssml+xml
X-Voice-IDOptionalVoice to use (default: Matthew)
X-Store-AudioOptionalStore audio in S3 and return URL (true/false)
Example Request (Plain Text)
curl -X POST "https://api.verbatik.com/api/v1/tts" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: text/plain" \
  -H "X-Voice-ID: Sarah" \
  -H "X-Store-Audio: true" \
  -d "Hello, this is a test of our text-to-speech API."
Example Request (SSML)
curl -X POST "https://api.verbatik.com/api/v1/tts" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: application/ssml+xml" \
  -H "X-Voice-ID: Emma" \
  -H "X-Store-Audio: true" \
  -d '<speak version="1.0">
    Hello, <break time="500ms"/> this is a <emphasis level="strong">test</emphasis> 
    of our text-to-speech API with <prosody rate="slow">SSML support</prosody>.
</speak>'
Example Response (Stored Audio)
{
  "success": true,
  "audio_url": "https://s3.eu-west-2.amazonaws.com/speak.verbatik.com/audio/123/uuid-filename.mp3"
}
GET /api/v1/voices

Get Available Voices

Retrieve the list of all available voices with their details including name, gender, and language.

Example Request
curl -X GET "https://api.verbatik.com/api/v1/voices" \
  -H "Authorization: Bearer YOUR_API_TOKEN"
Example Response
[
  {
    "name": "Sarah",
    "gender": "Female",
    "language_code": "en-US"
  },
  {
    "name": "Matthew",
    "gender": "Male", 
    "language_code": "en-US"
  },
  {
    "name": "Emma",
    "gender": "Female",
    "language_code": "en-US"
  }
]

Integration Examples

Ready-to-use code examples for popular programming languages and frameworks.

JavaScript/Node.js Example

async function synthesizeSpeech(text, voice, apiToken, storeAudio = true) {
    try {
        const response = await fetch('https://api.verbatik.com/api/v1/tts', {
            method: 'POST',
            headers: {
                'Authorization': `Bearer ${apiToken}`,
                'Content-Type': 'text/plain',
                'X-Voice-ID': voice,
                'X-Store-Audio': storeAudio.toString()
            },
            body: text
        });

        if (!response.ok) {
            throw new Error(`HTTP error! status: ${response.status}`);
        }

        if (storeAudio) {
            const result = await response.json();
            console.log('Audio URL:', result.audio_url);
            return result;
        } else {
            // Direct audio stream
            const audioBlob = await response.blob();
            const audioUrl = URL.createObjectURL(audioBlob);
            return { audio_url: audioUrl };
        }

    } catch (error) {
        console.error('TTS synthesis failed:', error);
        throw error;
    }
}

// Usage example
synthesizeSpeech(
    'Hello, this is a test of our text-to-speech API.',
    'Sarah',
    'YOUR_API_TOKEN'
).then(result => {
    console.log('Success:', result);
}).catch(error => {
    console.error('Error:', error);
});

Available Voices

Browse our collection of 568 neural voices across 142 languages from multiple providers.

568
Total Voices
Neural Quality
142
Languages
Global Coverage
24kHz
Sample Rate
High Fidelity

Voice Browser

Use the Voice Library to browse, filter, and test all available voices with audio previews.

af-ZA 2 voices
Sample: Adri, Willem
am-ET 2 voices
Sample: Mekdes, Ameha
ar-AE 2 voices
Sample: Fatima, Hamdan
ar-BH 2 voices
Sample: Laila, Ali
ar-DZ 2 voices
Sample: Amina, Ismael
ar-EG 2 voices
Sample: Salma, Shakir

Troubleshooting & Error Handling

HTTP Error Codes

CodeError TypeDescriptionSolution
400Bad RequestInvalid SSML or request formatCheck SSML syntax and headers
401UnauthorizedMissing or invalid API tokenVerify Authorization header
402Payment RequiredInsufficient character balanceTop up account or reduce text length
422Unprocessable EntityValidation errorsCheck request parameters
429Too Many RequestsRate limit exceededWait before making more requests
500Internal Server ErrorService unavailableTry again later or contact support

Common Issues & Solutions

"Invalid SSML format"

  • Ensure SSML starts with <speak> and ends with </speak>
  • Check for properly closed tags
  • Validate XML syntax
  • Use proper Content-Type header for SSML

"Voice not found"

  • Use the /api/v1/voices endpoint to get valid voice names
  • Check voice name spelling and case sensitivity
  • Ensure the voice supports the target language
  • Try with a default voice like "Matthew" or "Sarah"

"Audio quality issues"

  • Use neural voices for best quality
  • Check text for special characters or formatting
  • Use SSML for pronunciation control
  • Try different voices for comparison

SSML Quick Reference

Common SSML Tags

<break time="500ms"/> - Add pause
<emphasis level="strong"> - Emphasize text
<prosody rate="slow"> - Control speed
<prosody pitch="high"> - Control pitch

Example Usage

<speak version="1.0">
  Hello <break time="1s"/>
  <emphasis level="strong">world</emphasis>
</speak>

Need Help?

Our support team is here to help you integrate the TTS API successfully.

Voice Cloning API Documentation

Advanced AI-powered multilingual voice cloning technology

API v1.0

Voice Cloning API Overview

Advanced AI-powered multilingual voice cloning technology

The Voice Cloning API provides state-of-the-art text-to-speech voice cloning capabilities powered by advanced AI technology. Clone any voice using a sample audio file and generate speech in multiple languages with remarkable accuracy and naturalness.

Key Features

  • 14 supported languages
  • High-quality voice cloning
  • Multiple audio formats
  • Voice enhancement options

Technical Specs

  • Max text: 3,000 characters
  • Max audio: 50MB
  • 2x character billing rate
  • S3 storage integration

API Endpoints

POST /api/v1/voice-cloning

Clone Voice

Clone a voice using provided text and speaker audio. Supports both file uploads and URL-based audio inputs.

Request Parameters

ParameterTypeRequiredDescription
textstringYesText to synthesize (max 3000 characters)
languagestringYesTarget language code (see supported languages)
speaker_audiofileYes*Audio file for voice cloning (max 50MB)
speaker_audio_urlstringYes*URL to audio file for voice cloning
cleanup_voicebooleanNoApply denoising to speaker audio (default: true)

*Either speaker_audio or speaker_audio_url is required.

Example Request (cURL)
curl -X POST "https://api.verbatik.com/api/v1/voice-cloning" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: multipart/form-data" \
  -F "text=Hello, this is a test of voice cloning technology." \
  -F "language=en" \
  -F "speaker_audio=@/path/to/speaker.wav" \
  -F "cleanup_voice=true"
Example Response
{
  "success": true,
  "audio_url": "https://s3.eu-west-2.amazonaws.com/speak.verbatik.com/voice-cloning/uuid-filename.wav",
  "characters_used": 94,
  "remaining_balance": 9906,
  "language": "en"
}
GET /api/v1/voice-cloning/languages

Get Supported Languages

Retrieve the list of supported languages for voice cloning.

Example Request
curl -X GET "https://api.verbatik.com/api/v1/voice-cloning/languages" \
  -H "Authorization: Bearer YOUR_API_TOKEN"
Example Response
{
  "success": true,
  "languages": {
    "en": "English 🇺🇸",
    "fr": "French 🇫🇷",
    "de": "German 🇩🇪",
    "es": "Spanish 🇪🇸",
    "it": "Italian 🇮🇹",
    "pt": "Portuguese 🇵🇹",
    "cs": "Czech 🇨🇿",
    "pl": "Polish 🇵🇱",
    "ru": "Russian 🇷🇺",
    "nl": "Dutch 🇳🇱",
    "tr": "Turkish 🇹🇷",
    "ar": "Arabic 🇦🇪",
    "zh-cn": "Mandarin Chinese 🇨🇳",
    "hi": "Hindi 🇮🇳"
  }
}
GET /api/v1/voice-cloning/info

Get Service Information

Get detailed information about the voice cloning service capabilities and limitations.

Example Request
curl -X GET "https://api.verbatik.com/api/v1/voice-cloning/info" \
  -H "Authorization: Bearer YOUR_API_TOKEN"
Example Response
{
  "success": true,
  "service": "Verbatik Voice Cloning",
  "description": "Advanced AI-powered multilingual voice cloning technology",
  "supported_formats": ["wav", "mp3", "m4a", "ogg", "flv"],
  "max_text_length": 3000,
  "max_audio_size": "50MB",
  "character_rate": "2x (voice cloning uses double character rate)",
  "min_audio_duration": "6 seconds recommended",
  "languages": { ... }
}

Integration Examples

Ready-to-use code examples for popular programming languages and frameworks.

JavaScript/Node.js Example

const FormData = require('form-data');
const fs = require('fs');
const fetch = require('node-fetch');

async function cloneVoice(text, audioFile, language, apiToken) {
    const formData = new FormData();
    formData.append('text', text);
    formData.append('language', language);
    formData.append('speaker_audio', fs.createReadStream(audioFile));
    formData.append('cleanup_voice', 'true');

    try {
        const response = await fetch('https://api.verbatik.com/api/v1/voice-cloning', {
            method: 'POST',
            headers: {
                'Authorization': `Bearer ${apiToken}`,
                ...formData.getHeaders()
            },
            body: formData
        });

        if (!response.ok) {
            throw new Error(`HTTP error! status: ${response.status}`);
        }

        const result = await response.json();
        console.log('Voice cloning successful:', result);
        return result;

    } catch (error) {
        console.error('Voice cloning failed:', error);
        throw error;
    }
}

// Usage example
cloneVoice(
    'Hello, this is a test of voice cloning technology.',
    '/path/to/speaker.wav',
    'en',
    'YOUR_API_TOKEN'
).then(result => {
    console.log('Audio URL:', result.audio_url);
}).catch(error => {
    console.error('Error:', error);
});

Supported Languages

The Voice Cloning API supports 14 languages for high-quality voice synthesis.

EN
English 🇺🇸
en
FR
French 🇫🇷
fr
DE
German 🇩🇪
de
ES
Spanish 🇪🇸
es
IT
Italian 🇮🇹
it
PT
Portuguese 🇵🇹
pt
CS
Czech 🇨🇿
cs
PL
Polish 🇵🇱
pl
RU
Russian 🇷🇺
ru
NL
Dutch 🇳🇱
nl
TR
Turkish 🇹🇷
tr
AR
Arabic 🇦🇪
ar
ZH-CN
Mandarin Chinese 🇨🇳
zh-cn
HI
Hindi 🇮🇳
hi

Troubleshooting & Error Handling

HTTP Error Codes

CodeError TypeDescriptionSolution
400Bad RequestInvalid request parametersCheck parameter format and values
401UnauthorizedMissing or invalid authenticationVerify API token in Authorization header
402Payment RequiredInsufficient character balanceTop up your account or reduce text length
413Payload Too LargeAudio file exceeds size limitReduce file size to under 50MB
422Unprocessable EntityValidation errorsCheck the errors field in response
429Too Many RequestsRate limit exceededWait before making more requests
500Internal Server ErrorService unavailableTry again later or contact support

Common Issues & Solutions

"Invalid speaker audio file"

  • Check file format (must be wav, mp3, m4a, ogg, or flv)
  • Ensure file size is under 50MB
  • Verify file is not corrupted
  • Make sure the file contains actual audio data

"Voice cloning timed out"

  • The service has a 5-minute timeout
  • Try with shorter text or smaller audio files
  • Check if the audio file is too large or complex
  • Retry the request after a few minutes

"Insufficient character balance"

  • Voice cloning uses 2x character rate
  • Check user's available character balance
  • Consider upgrading user's plan
  • Reduce text length to fit within available credits

"Unsupported language"

  • Use the /voice-cloning/languages endpoint to get supported languages
  • Ensure language code is lowercase (e.g., 'en', not 'EN')
  • Check for typos in the language code
  • Refer to the supported languages section above

Security Best Practices

API Token Security

Store API tokens securely and never expose them in client-side code

HTTPS Only

Always use HTTPS for API requests to ensure data encryption

Need Help?

Our support team is here to help you integrate the Voice Cloning API successfully.

Integración con Sus Herramientas Favoritas

Conecte la poderosa API de texto a voz de Verbatik con miles de aplicaciones a través de la plataforma de automatización de Zapier.

Conversión Instantánea de Texto a Voz

Transforme texto en voz con sonido natural sin esfuerzo con nuestra herramienta innovadora de conversión instantánea

Más de 600+ Voces

Elija entre más de 600 voces IA realistas para conversiones diversificadas de texto a voz

Soporte para 142 Idiomas y Acentos

Amplia diversidad lingüística con soporte para 142 idiomas y acentos

Permite Personalización de Voces

Personalice voces IA incluyendo velocidad, tono, volumen y pronunciación

Opciones de Descarga en Formatos MP3 y WAV

Descargas de audio en formatos MP3 y WAV de alta calidad para uso versátil

Derechos Comerciales y de Transmisión

Derechos comerciales y de transmisión disponibles para amplia distribución de audio

Cómo Verbatik Puede Ayudarle

Voces IA en Cualquier Idioma y Acento del Mundo

Transforme Sus Artículos en Audio Atractivo

Testimonios

Lo Que Dicen Nuestros Clientes

Ricky G.

Ricky G.

Content Creator

I liked that verbatik has some of the most realistic voices on the market. There is a large amount and many ways to customize all of the AI voices.

Juan Carlos V.

Juan Carlos V.

Media Production

I've found Verbatik to be a game-changer in the realm of text-to-speech. The platform is user-friendly, and the voice outputs are impressively lifelike. Whether I'm working on a presentation, video, or just experimenting, Verbatik has consistently delivered high-quality audio that adds value to my projects.

Boris S.

Boris S.

Private Tutor

I discovered VERBATIK when I was looking for a way to create voiceovers for multiple dialogues and monologues for personal use and it provided a good quality service for low price.

Zoe D.

Zoe D.

Student

Verbatik is relatively easy to use and understand and gives the quick text-to-speech results I wanted. The audio was used to narrate an educational video and was easy to export!

Verified Reviewer

Verified Reviewer

Broadcast Media

The broad range of AI voices and the ability to personalize the voice experience is very valuable to me as a content creator. I look forward to using it on a regular basis and exploring more tools that can optimise my content production process.

Aswin V.

Aswin V.

Technical Associate

Verbatik software is a user-friendly software that easily converts text into speech. The accuracy of the transcription is really amazing. There are different customization options available. we can change the speaker. it supports different languages.

FAQ

Respuestas a Todas Sus Preguntas

Pruebe Voces con IA

Cree Su Cuenta Hoy

Descubra el poder de la tecnología de texto a voz realista. Regístrese ahora y transforme su contenido con voces que suenan naturales.