NOVO! Clonagem de Voz agora disponível em 37 Idiomas
Ver Todos

Texto para Fala
Uma Chamada de API
Simplificado

Crie aplicativos de voz, audiolivros e conteúdo acessível com nossa API de texto para fala. 600+ vozes naturais em 142 idiomas para desenvolvedores e empresas.

Premiado pela Excelência

Veja o que nossos usuários dizem sobre o Verbatik

PREÇOS DA API

Preços Simples e Transparentes

Valor Líder do Setor

Obtenha o melhor valor do setor com nossa precificação direta. Perfeito para empresas de qualquer tamanho, nossa API oferece conversão premium de texto para fala por apenas $0,000025 por caractere. Escale seu conteúdo de voz sem estourar o orçamento.

  • 50,000 Characters per $1
  • High-Quality TTS Voices
  • Fast TTS Speed
  • Commercial Rights
  • Simple API Integration
  • 600 Voices 142 Languages

API Pay-As-You-Go

50,000 caracteres/$1

ou $20 por 1M de caracteres

Para Desenvolvedores

Por Que Desenvolvedores Escolhem a API TTS da Verbatik

API de Voz IA Superior

Acesso a 600+ vozes com som natural usando tecnologia avançada de IA de voz

Processamento Ultra-Rápido

Converta milhões de caracteres em segundos com nossa API de texto para fala otimizada

142 Idiomas e Dialetos

Alcance um público global com nossa API abrangente de texto para áudio

Integração Fácil

Implemente nossa API de IA de voz com apenas algumas linhas de código e documentação clara

Escalabilidade Enterprise

Nossa API de texto para fala IA lida com tudo, desde pequenos projetos até demandas enterprise

Direitos de Uso Comercial

Direitos completos para usar arquivos de áudio gerados em seus produtos comerciais

Como Começar com Nossa API

Passo 1

Adicionar Créditos

Escolha o valor certo para suas necessidades. Você pode obter 10.000 caracteres gratuitos para a API de gerador de voz IA ao agendar uma reunião conosco.

Começar Agora

Passo 2

Gerar Chave de API

Acesse seu painel para criar sua chave secreta única para autenticação segura da API de voz.

Gerar Texto para Fala

Passo 3

Sintetizar Fala

Integre a API de texto para áudio com apenas algumas linhas de código e transforme seu texto em fala natural.

Começar Agora

Text-to-Speech API Documentation

Complete guide to integrating TTS API with 568 voices across multiple providers

API v1.0

VerbatikText-to-Speech API Overview

High-quality neural text-to-speech with multiple voices

The Verbatik Text-to-Speech API provides high-quality neural voice synthesis Convert text to natural-sounding speech with 568 voices across 142 languages.

Key Features

  • 568 neural voices
  • 142 supported languages
  • SSML support for advanced control
  • Multiple provider support

Technical Specs

  • MP3 output format
  • 24kHz sample rate
  • Character-based billing
  • S3 storage integration

Voice Library

568
High-Quality Neural Voices
142
Languages Supported
24kHz
Sample Rate

API Endpoints

POST /api/v1/tts

Synthesize Speech

Convert text to speech using neural voices. Supports both plain text and SSML input with optional audio storage.

Request Headers

HeaderRequiredDescription
AuthorizationYesBearer token for authentication
Content-TypeYestext/plain or application/ssml+xml
X-Voice-IDOptionalVoice to use (default: Matthew)
X-Store-AudioOptionalStore audio in S3 and return URL (true/false)
Example Request (Plain Text)
curl -X POST "https://api.verbatik.com/api/v1/tts" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: text/plain" \
  -H "X-Voice-ID: Sarah" \
  -H "X-Store-Audio: true" \
  -d "Hello, this is a test of our text-to-speech API."
Example Request (SSML)
curl -X POST "https://api.verbatik.com/api/v1/tts" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: application/ssml+xml" \
  -H "X-Voice-ID: Emma" \
  -H "X-Store-Audio: true" \
  -d '<speak version="1.0">
    Hello, <break time="500ms"/> this is a <emphasis level="strong">test</emphasis> 
    of our text-to-speech API with <prosody rate="slow">SSML support</prosody>.
</speak>'
Example Response (Stored Audio)
{
  "success": true,
  "audio_url": "https://s3.eu-west-2.amazonaws.com/speak.verbatik.com/audio/123/uuid-filename.mp3"
}
GET /api/v1/voices

Get Available Voices

Retrieve the list of all available voices with their details including name, gender, and language.

Example Request
curl -X GET "https://api.verbatik.com/api/v1/voices" \
  -H "Authorization: Bearer YOUR_API_TOKEN"
Example Response
[
  {
    "name": "Sarah",
    "gender": "Female",
    "language_code": "en-US"
  },
  {
    "name": "Matthew",
    "gender": "Male", 
    "language_code": "en-US"
  },
  {
    "name": "Emma",
    "gender": "Female",
    "language_code": "en-US"
  }
]

Integration Examples

Ready-to-use code examples for popular programming languages and frameworks.

JavaScript/Node.js Example

async function synthesizeSpeech(text, voice, apiToken, storeAudio = true) {
    try {
        const response = await fetch('https://api.verbatik.com/api/v1/tts', {
            method: 'POST',
            headers: {
                'Authorization': `Bearer ${apiToken}`,
                'Content-Type': 'text/plain',
                'X-Voice-ID': voice,
                'X-Store-Audio': storeAudio.toString()
            },
            body: text
        });

        if (!response.ok) {
            throw new Error(`HTTP error! status: ${response.status}`);
        }

        if (storeAudio) {
            const result = await response.json();
            console.log('Audio URL:', result.audio_url);
            return result;
        } else {
            // Direct audio stream
            const audioBlob = await response.blob();
            const audioUrl = URL.createObjectURL(audioBlob);
            return { audio_url: audioUrl };
        }

    } catch (error) {
        console.error('TTS synthesis failed:', error);
        throw error;
    }
}

// Usage example
synthesizeSpeech(
    'Hello, this is a test of our text-to-speech API.',
    'Sarah',
    'YOUR_API_TOKEN'
).then(result => {
    console.log('Success:', result);
}).catch(error => {
    console.error('Error:', error);
});

Available Voices

Browse our collection of 568 neural voices across 142 languages from multiple providers.

568
Total Voices
Neural Quality
142
Languages
Global Coverage
24kHz
Sample Rate
High Fidelity

Voice Browser

Use the Voice Library to browse, filter, and test all available voices with audio previews.

af-ZA 2 voices
Sample: Adri, Willem
am-ET 2 voices
Sample: Mekdes, Ameha
ar-AE 2 voices
Sample: Fatima, Hamdan
ar-BH 2 voices
Sample: Laila, Ali
ar-DZ 2 voices
Sample: Amina, Ismael
ar-EG 2 voices
Sample: Salma, Shakir

Troubleshooting & Error Handling

HTTP Error Codes

CodeError TypeDescriptionSolution
400Bad RequestInvalid SSML or request formatCheck SSML syntax and headers
401UnauthorizedMissing or invalid API tokenVerify Authorization header
402Payment RequiredInsufficient character balanceTop up account or reduce text length
422Unprocessable EntityValidation errorsCheck request parameters
429Too Many RequestsRate limit exceededWait before making more requests
500Internal Server ErrorService unavailableTry again later or contact support

Common Issues & Solutions

"Invalid SSML format"

  • Ensure SSML starts with <speak> and ends with </speak>
  • Check for properly closed tags
  • Validate XML syntax
  • Use proper Content-Type header for SSML

"Voice not found"

  • Use the /api/v1/voices endpoint to get valid voice names
  • Check voice name spelling and case sensitivity
  • Ensure the voice supports the target language
  • Try with a default voice like "Matthew" or "Sarah"

"Audio quality issues"

  • Use neural voices for best quality
  • Check text for special characters or formatting
  • Use SSML for pronunciation control
  • Try different voices for comparison

SSML Quick Reference

Common SSML Tags

<break time="500ms"/> - Add pause
<emphasis level="strong"> - Emphasize text
<prosody rate="slow"> - Control speed
<prosody pitch="high"> - Control pitch

Example Usage

<speak version="1.0">
  Hello <break time="1s"/>
  <emphasis level="strong">world</emphasis>
</speak>

Need Help?

Our support team is here to help you integrate the TTS API successfully.

Voice Cloning API Documentation

Advanced AI-powered multilingual voice cloning technology

API v1.0

Voice Cloning API Overview

Advanced AI-powered multilingual voice cloning technology

The Voice Cloning API provides state-of-the-art text-to-speech voice cloning capabilities powered by advanced AI technology. Clone any voice using a sample audio file and generate speech in multiple languages with remarkable accuracy and naturalness.

Key Features

  • 14 supported languages
  • High-quality voice cloning
  • Multiple audio formats
  • Voice enhancement options

Technical Specs

  • Max text: 3,000 characters
  • Max audio: 50MB
  • 2x character billing rate
  • S3 storage integration

API Endpoints

POST /api/v1/voice-cloning

Clone Voice

Clone a voice using provided text and speaker audio. Supports both file uploads and URL-based audio inputs.

Request Parameters

ParameterTypeRequiredDescription
textstringYesText to synthesize (max 3000 characters)
languagestringYesTarget language code (see supported languages)
speaker_audiofileYes*Audio file for voice cloning (max 50MB)
speaker_audio_urlstringYes*URL to audio file for voice cloning
cleanup_voicebooleanNoApply denoising to speaker audio (default: true)

*Either speaker_audio or speaker_audio_url is required.

Example Request (cURL)
curl -X POST "https://api.verbatik.com/api/v1/voice-cloning" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: multipart/form-data" \
  -F "text=Hello, this is a test of voice cloning technology." \
  -F "language=en" \
  -F "speaker_audio=@/path/to/speaker.wav" \
  -F "cleanup_voice=true"
Example Response
{
  "success": true,
  "audio_url": "https://s3.eu-west-2.amazonaws.com/speak.verbatik.com/voice-cloning/uuid-filename.wav",
  "characters_used": 94,
  "remaining_balance": 9906,
  "language": "en"
}
GET /api/v1/voice-cloning/languages

Get Supported Languages

Retrieve the list of supported languages for voice cloning.

Example Request
curl -X GET "https://api.verbatik.com/api/v1/voice-cloning/languages" \
  -H "Authorization: Bearer YOUR_API_TOKEN"
Example Response
{
  "success": true,
  "languages": {
    "en": "English 🇺🇸",
    "fr": "French 🇫🇷",
    "de": "German 🇩🇪",
    "es": "Spanish 🇪🇸",
    "it": "Italian 🇮🇹",
    "pt": "Portuguese 🇵🇹",
    "cs": "Czech 🇨🇿",
    "pl": "Polish 🇵🇱",
    "ru": "Russian 🇷🇺",
    "nl": "Dutch 🇳🇱",
    "tr": "Turkish 🇹🇷",
    "ar": "Arabic 🇦🇪",
    "zh-cn": "Mandarin Chinese 🇨🇳",
    "hi": "Hindi 🇮🇳"
  }
}
GET /api/v1/voice-cloning/info

Get Service Information

Get detailed information about the voice cloning service capabilities and limitations.

Example Request
curl -X GET "https://api.verbatik.com/api/v1/voice-cloning/info" \
  -H "Authorization: Bearer YOUR_API_TOKEN"
Example Response
{
  "success": true,
  "service": "Verbatik Voice Cloning",
  "description": "Advanced AI-powered multilingual voice cloning technology",
  "supported_formats": ["wav", "mp3", "m4a", "ogg", "flv"],
  "max_text_length": 3000,
  "max_audio_size": "50MB",
  "character_rate": "2x (voice cloning uses double character rate)",
  "min_audio_duration": "6 seconds recommended",
  "languages": { ... }
}

Integration Examples

Ready-to-use code examples for popular programming languages and frameworks.

JavaScript/Node.js Example

const FormData = require('form-data');
const fs = require('fs');
const fetch = require('node-fetch');

async function cloneVoice(text, audioFile, language, apiToken) {
    const formData = new FormData();
    formData.append('text', text);
    formData.append('language', language);
    formData.append('speaker_audio', fs.createReadStream(audioFile));
    formData.append('cleanup_voice', 'true');

    try {
        const response = await fetch('https://api.verbatik.com/api/v1/voice-cloning', {
            method: 'POST',
            headers: {
                'Authorization': `Bearer ${apiToken}`,
                ...formData.getHeaders()
            },
            body: formData
        });

        if (!response.ok) {
            throw new Error(`HTTP error! status: ${response.status}`);
        }

        const result = await response.json();
        console.log('Voice cloning successful:', result);
        return result;

    } catch (error) {
        console.error('Voice cloning failed:', error);
        throw error;
    }
}

// Usage example
cloneVoice(
    'Hello, this is a test of voice cloning technology.',
    '/path/to/speaker.wav',
    'en',
    'YOUR_API_TOKEN'
).then(result => {
    console.log('Audio URL:', result.audio_url);
}).catch(error => {
    console.error('Error:', error);
});

Supported Languages

The Voice Cloning API supports 14 languages for high-quality voice synthesis.

EN
English 🇺🇸
en
FR
French 🇫🇷
fr
DE
German 🇩🇪
de
ES
Spanish 🇪🇸
es
IT
Italian 🇮🇹
it
PT
Portuguese 🇵🇹
pt
CS
Czech 🇨🇿
cs
PL
Polish 🇵🇱
pl
RU
Russian 🇷🇺
ru
NL
Dutch 🇳🇱
nl
TR
Turkish 🇹🇷
tr
AR
Arabic 🇦🇪
ar
ZH-CN
Mandarin Chinese 🇨🇳
zh-cn
HI
Hindi 🇮🇳
hi

Troubleshooting & Error Handling

HTTP Error Codes

CodeError TypeDescriptionSolution
400Bad RequestInvalid request parametersCheck parameter format and values
401UnauthorizedMissing or invalid authenticationVerify API token in Authorization header
402Payment RequiredInsufficient character balanceTop up your account or reduce text length
413Payload Too LargeAudio file exceeds size limitReduce file size to under 50MB
422Unprocessable EntityValidation errorsCheck the errors field in response
429Too Many RequestsRate limit exceededWait before making more requests
500Internal Server ErrorService unavailableTry again later or contact support

Common Issues & Solutions

"Invalid speaker audio file"

  • Check file format (must be wav, mp3, m4a, ogg, or flv)
  • Ensure file size is under 50MB
  • Verify file is not corrupted
  • Make sure the file contains actual audio data

"Voice cloning timed out"

  • The service has a 5-minute timeout
  • Try with shorter text or smaller audio files
  • Check if the audio file is too large or complex
  • Retry the request after a few minutes

"Insufficient character balance"

  • Voice cloning uses 2x character rate
  • Check user's available character balance
  • Consider upgrading user's plan
  • Reduce text length to fit within available credits

"Unsupported language"

  • Use the /voice-cloning/languages endpoint to get supported languages
  • Ensure language code is lowercase (e.g., 'en', not 'EN')
  • Check for typos in the language code
  • Refer to the supported languages section above

Security Best Practices

API Token Security

Store API tokens securely and never expose them in client-side code

HTTPS Only

Always use HTTPS for API requests to ensure data encryption

Need Help?

Our support team is here to help you integrate the Voice Cloning API successfully.

Integração com Suas Ferramentas Favoritas

Conecte a poderosa API de texto para fala da Verbatik com milhares de aplicativos através da plataforma de automação do Zapier.

Conversão Instantânea de Texto para Fala

Transforme texto em fala com som natural sem esforço com nossa ferramenta inovadora de conversão instantânea

Mais de 600+ Vozes

Escolha entre mais de 600 vozes IA realistas para conversões diversificadas de texto para fala

Suporte a 142 Idiomas e Sotaques

Ampla diversidade linguística com suporte para 142 idiomas e sotaques

Permite Personalização de Vozes

Personalize vozes IA incluindo velocidade, tom, volume e pronúncia

Opções de Download em Formatos MP3 e WAV

Downloads de áudio em formatos MP3 e WAV de alta qualidade para uso versátil

Direitos Comerciais e de Transmissão

Direitos comerciais e de transmissão disponíveis para ampla distribuição de áudio

Como o Verbatik Pode Ajudar Você

Vozes IA em Qualquer Idioma e Sotaque do Mundo

Transforme Seus Artigos em Áudio Envolvente

Depoimentos

O Que Nossos Clientes Dizem

Ricky G.

Ricky G.

Content Creator

I liked that verbatik has some of the most realistic voices on the market. There is a large amount and many ways to customize all of the AI voices.

Juan Carlos V.

Juan Carlos V.

Media Production

I've found Verbatik to be a game-changer in the realm of text-to-speech. The platform is user-friendly, and the voice outputs are impressively lifelike. Whether I'm working on a presentation, video, or just experimenting, Verbatik has consistently delivered high-quality audio that adds value to my projects.

Boris S.

Boris S.

Private Tutor

I discovered VERBATIK when I was looking for a way to create voiceovers for multiple dialogues and monologues for personal use and it provided a good quality service for low price.

Zoe D.

Zoe D.

Student

Verbatik is relatively easy to use and understand and gives the quick text-to-speech results I wanted. The audio was used to narrate an educational video and was easy to export!

Verified Reviewer

Verified Reviewer

Broadcast Media

The broad range of AI voices and the ability to personalize the voice experience is very valuable to me as a content creator. I look forward to using it on a regular basis and exploring more tools that can optimise my content production process.

Aswin V.

Aswin V.

Technical Associate

Verbatik software is a user-friendly software that easily converts text into speech. The accuracy of the transcription is really amazing. There are different customization options available. we can change the speaker. it supports different languages.

FAQ

Respostas para Todas as Suas Perguntas

Experimente Vozes com IA

Crie Sua Conta Hoje

Descubra o poder da tecnologia de texto para fala realista. Cadastre-se agora e transforme seu conteúdo com vozes que soam naturais.