NEU! Stimmklonen jetzt verfügbar in 37 Sprachen
Alle anzeigen

Text-to-Speech
Ein API-Aufruf
Vereinfacht

Erstellen Sie Sprach-Apps, Hörbücher und barrierefreie Inhalte mit unserer Text-to-Speech API. 600+ natürliche Stimmen in 142 Sprachen für Entwickler und Unternehmen.

Ausgezeichnet für Exzellenz

Sehen Sie, was unsere Nutzer über Verbatik sagen

API-PREISE

Einfache, transparente Preise

Branchenführender Wert

Erhalten Sie den besten Wert der Branche mit unserer unkomplizierten Preisgestaltung. Perfekt für Unternehmen jeder Größe bietet unsere API Premium-Text-to-Speech-Konvertierung für nur 0,000025 $ pro Zeichen. Skalieren Sie Ihre Sprachinhalte ohne Ihr Budget zu sprengen.

  • 50,000 Characters per $1
  • High-Quality TTS Voices
  • Fast TTS Speed
  • Commercial Rights
  • Simple API Integration
  • 600 Voices 142 Languages

Pay-As-You-Go API

50,000 Zeichen/$1

oder 20 $ für 1M Zeichen

Für Entwickler

Warum Entwickler Verbatiks TTS API wählen

Überlegene KI-Sprach-API

Zugang zu 600+ natürlich klingenden Stimmen mit fortschrittlicher Sprach-KI-Technologie

Blitzschnelle Verarbeitung

Konvertieren Sie Millionen von Zeichen in Sekunden mit unserer optimierten Text-zu-Sprache-API

142 Sprachen & Dialekte

Erreichen Sie ein globales Publikum mit unserer umfassenden Text-zu-Audio-API

Einfache Integration

Implementieren Sie unsere Sprach-KI-API mit nur wenigen Codezeilen und klarer Dokumentation

Enterprise-Skalierbarkeit

Unsere Text-to-Speech-KI-API bewältigt alles von kleinen Projekten bis zu Enterprise-Anforderungen

Kommerzielle Nutzungsrechte

Volle Rechte zur Nutzung generierter Audiodateien in Ihren kommerziellen Produkten

So starten Sie mit unserer API

Schritt 1

Credits aufladen

Wählen Sie den richtigen Betrag für Ihre Bedürfnisse. Sie können 10.000 kostenlose Zeichen für die KI-Sprachgenerator-API erhalten, wenn Sie ein Meeting mit uns buchen.

Jetzt starten

Schritt 2

API-Schlüssel generieren

Greifen Sie auf Ihr Dashboard zu, um Ihren einzigartigen geheimen Schlüssel für die sichere Sprach-API-Authentifizierung zu erstellen.

Text-to-Speech generieren

Schritt 3

Sprache synthetisieren

Integrieren Sie die Text-zu-Audio-API mit nur wenigen Codezeilen und verwandeln Sie Ihren Text in natürliche Sprache.

Jetzt starten

Text-to-Speech API Documentation

Complete guide to integrating TTS API with 568 voices across multiple providers

API v1.0

VerbatikText-to-Speech API Overview

High-quality neural text-to-speech with multiple voices

The Verbatik Text-to-Speech API provides high-quality neural voice synthesis Convert text to natural-sounding speech with 568 voices across 142 languages.

Key Features

  • 568 neural voices
  • 142 supported languages
  • SSML support for advanced control
  • Multiple provider support

Technical Specs

  • MP3 output format
  • 24kHz sample rate
  • Character-based billing
  • S3 storage integration

Voice Library

568
High-Quality Neural Voices
142
Languages Supported
24kHz
Sample Rate

API Endpoints

POST /api/v1/tts

Synthesize Speech

Convert text to speech using neural voices. Supports both plain text and SSML input with optional audio storage.

Request Headers

HeaderRequiredDescription
AuthorizationYesBearer token for authentication
Content-TypeYestext/plain or application/ssml+xml
X-Voice-IDOptionalVoice to use (default: Matthew)
X-Store-AudioOptionalStore audio in S3 and return URL (true/false)
Example Request (Plain Text)
curl -X POST "https://api.verbatik.com/api/v1/tts" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: text/plain" \
  -H "X-Voice-ID: Sarah" \
  -H "X-Store-Audio: true" \
  -d "Hello, this is a test of our text-to-speech API."
Example Request (SSML)
curl -X POST "https://api.verbatik.com/api/v1/tts" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: application/ssml+xml" \
  -H "X-Voice-ID: Emma" \
  -H "X-Store-Audio: true" \
  -d '<speak version="1.0">
    Hello, <break time="500ms"/> this is a <emphasis level="strong">test</emphasis> 
    of our text-to-speech API with <prosody rate="slow">SSML support</prosody>.
</speak>'
Example Response (Stored Audio)
{
  "success": true,
  "audio_url": "https://s3.eu-west-2.amazonaws.com/speak.verbatik.com/audio/123/uuid-filename.mp3"
}
GET /api/v1/voices

Get Available Voices

Retrieve the list of all available voices with their details including name, gender, and language.

Example Request
curl -X GET "https://api.verbatik.com/api/v1/voices" \
  -H "Authorization: Bearer YOUR_API_TOKEN"
Example Response
[
  {
    "name": "Sarah",
    "gender": "Female",
    "language_code": "en-US"
  },
  {
    "name": "Matthew",
    "gender": "Male", 
    "language_code": "en-US"
  },
  {
    "name": "Emma",
    "gender": "Female",
    "language_code": "en-US"
  }
]

Integration Examples

Ready-to-use code examples for popular programming languages and frameworks.

JavaScript/Node.js Example

async function synthesizeSpeech(text, voice, apiToken, storeAudio = true) {
    try {
        const response = await fetch('https://api.verbatik.com/api/v1/tts', {
            method: 'POST',
            headers: {
                'Authorization': `Bearer ${apiToken}`,
                'Content-Type': 'text/plain',
                'X-Voice-ID': voice,
                'X-Store-Audio': storeAudio.toString()
            },
            body: text
        });

        if (!response.ok) {
            throw new Error(`HTTP error! status: ${response.status}`);
        }

        if (storeAudio) {
            const result = await response.json();
            console.log('Audio URL:', result.audio_url);
            return result;
        } else {
            // Direct audio stream
            const audioBlob = await response.blob();
            const audioUrl = URL.createObjectURL(audioBlob);
            return { audio_url: audioUrl };
        }

    } catch (error) {
        console.error('TTS synthesis failed:', error);
        throw error;
    }
}

// Usage example
synthesizeSpeech(
    'Hello, this is a test of our text-to-speech API.',
    'Sarah',
    'YOUR_API_TOKEN'
).then(result => {
    console.log('Success:', result);
}).catch(error => {
    console.error('Error:', error);
});

Available Voices

Browse our collection of 568 neural voices across 142 languages from multiple providers.

568
Total Voices
Neural Quality
142
Languages
Global Coverage
24kHz
Sample Rate
High Fidelity

Voice Browser

Use the Voice Library to browse, filter, and test all available voices with audio previews.

af-ZA 2 voices
Sample: Adri, Willem
am-ET 2 voices
Sample: Mekdes, Ameha
ar-AE 2 voices
Sample: Fatima, Hamdan
ar-BH 2 voices
Sample: Laila, Ali
ar-DZ 2 voices
Sample: Amina, Ismael
ar-EG 2 voices
Sample: Salma, Shakir

Troubleshooting & Error Handling

HTTP Error Codes

CodeError TypeDescriptionSolution
400Bad RequestInvalid SSML or request formatCheck SSML syntax and headers
401UnauthorizedMissing or invalid API tokenVerify Authorization header
402Payment RequiredInsufficient character balanceTop up account or reduce text length
422Unprocessable EntityValidation errorsCheck request parameters
429Too Many RequestsRate limit exceededWait before making more requests
500Internal Server ErrorService unavailableTry again later or contact support

Common Issues & Solutions

"Invalid SSML format"

  • Ensure SSML starts with <speak> and ends with </speak>
  • Check for properly closed tags
  • Validate XML syntax
  • Use proper Content-Type header for SSML

"Voice not found"

  • Use the /api/v1/voices endpoint to get valid voice names
  • Check voice name spelling and case sensitivity
  • Ensure the voice supports the target language
  • Try with a default voice like "Matthew" or "Sarah"

"Audio quality issues"

  • Use neural voices for best quality
  • Check text for special characters or formatting
  • Use SSML for pronunciation control
  • Try different voices for comparison

SSML Quick Reference

Common SSML Tags

<break time="500ms"/> - Add pause
<emphasis level="strong"> - Emphasize text
<prosody rate="slow"> - Control speed
<prosody pitch="high"> - Control pitch

Example Usage

<speak version="1.0">
  Hello <break time="1s"/>
  <emphasis level="strong">world</emphasis>
</speak>

Need Help?

Our support team is here to help you integrate the TTS API successfully.

Voice Cloning API Documentation

Advanced AI-powered multilingual voice cloning technology

API v1.0

Voice Cloning API Overview

Advanced AI-powered multilingual voice cloning technology

The Voice Cloning API provides state-of-the-art text-to-speech voice cloning capabilities powered by advanced AI technology. Clone any voice using a sample audio file and generate speech in multiple languages with remarkable accuracy and naturalness.

Key Features

  • 14 supported languages
  • High-quality voice cloning
  • Multiple audio formats
  • Voice enhancement options

Technical Specs

  • Max text: 3,000 characters
  • Max audio: 50MB
  • 2x character billing rate
  • S3 storage integration

API Endpoints

POST /api/v1/voice-cloning

Clone Voice

Clone a voice using provided text and speaker audio. Supports both file uploads and URL-based audio inputs.

Request Parameters

ParameterTypeRequiredDescription
textstringYesText to synthesize (max 3000 characters)
languagestringYesTarget language code (see supported languages)
speaker_audiofileYes*Audio file for voice cloning (max 50MB)
speaker_audio_urlstringYes*URL to audio file for voice cloning
cleanup_voicebooleanNoApply denoising to speaker audio (default: true)

*Either speaker_audio or speaker_audio_url is required.

Example Request (cURL)
curl -X POST "https://api.verbatik.com/api/v1/voice-cloning" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: multipart/form-data" \
  -F "text=Hello, this is a test of voice cloning technology." \
  -F "language=en" \
  -F "speaker_audio=@/path/to/speaker.wav" \
  -F "cleanup_voice=true"
Example Response
{
  "success": true,
  "audio_url": "https://s3.eu-west-2.amazonaws.com/speak.verbatik.com/voice-cloning/uuid-filename.wav",
  "characters_used": 94,
  "remaining_balance": 9906,
  "language": "en"
}
GET /api/v1/voice-cloning/languages

Get Supported Languages

Retrieve the list of supported languages for voice cloning.

Example Request
curl -X GET "https://api.verbatik.com/api/v1/voice-cloning/languages" \
  -H "Authorization: Bearer YOUR_API_TOKEN"
Example Response
{
  "success": true,
  "languages": {
    "en": "English 🇺🇸",
    "fr": "French 🇫🇷",
    "de": "German 🇩🇪",
    "es": "Spanish 🇪🇸",
    "it": "Italian 🇮🇹",
    "pt": "Portuguese 🇵🇹",
    "cs": "Czech 🇨🇿",
    "pl": "Polish 🇵🇱",
    "ru": "Russian 🇷🇺",
    "nl": "Dutch 🇳🇱",
    "tr": "Turkish 🇹🇷",
    "ar": "Arabic 🇦🇪",
    "zh-cn": "Mandarin Chinese 🇨🇳",
    "hi": "Hindi 🇮🇳"
  }
}
GET /api/v1/voice-cloning/info

Get Service Information

Get detailed information about the voice cloning service capabilities and limitations.

Example Request
curl -X GET "https://api.verbatik.com/api/v1/voice-cloning/info" \
  -H "Authorization: Bearer YOUR_API_TOKEN"
Example Response
{
  "success": true,
  "service": "Verbatik Voice Cloning",
  "description": "Advanced AI-powered multilingual voice cloning technology",
  "supported_formats": ["wav", "mp3", "m4a", "ogg", "flv"],
  "max_text_length": 3000,
  "max_audio_size": "50MB",
  "character_rate": "2x (voice cloning uses double character rate)",
  "min_audio_duration": "6 seconds recommended",
  "languages": { ... }
}

Integration Examples

Ready-to-use code examples for popular programming languages and frameworks.

JavaScript/Node.js Example

const FormData = require('form-data');
const fs = require('fs');
const fetch = require('node-fetch');

async function cloneVoice(text, audioFile, language, apiToken) {
    const formData = new FormData();
    formData.append('text', text);
    formData.append('language', language);
    formData.append('speaker_audio', fs.createReadStream(audioFile));
    formData.append('cleanup_voice', 'true');

    try {
        const response = await fetch('https://api.verbatik.com/api/v1/voice-cloning', {
            method: 'POST',
            headers: {
                'Authorization': `Bearer ${apiToken}`,
                ...formData.getHeaders()
            },
            body: formData
        });

        if (!response.ok) {
            throw new Error(`HTTP error! status: ${response.status}`);
        }

        const result = await response.json();
        console.log('Voice cloning successful:', result);
        return result;

    } catch (error) {
        console.error('Voice cloning failed:', error);
        throw error;
    }
}

// Usage example
cloneVoice(
    'Hello, this is a test of voice cloning technology.',
    '/path/to/speaker.wav',
    'en',
    'YOUR_API_TOKEN'
).then(result => {
    console.log('Audio URL:', result.audio_url);
}).catch(error => {
    console.error('Error:', error);
});

Supported Languages

The Voice Cloning API supports 14 languages for high-quality voice synthesis.

EN
English 🇺🇸
en
FR
French 🇫🇷
fr
DE
German 🇩🇪
de
ES
Spanish 🇪🇸
es
IT
Italian 🇮🇹
it
PT
Portuguese 🇵🇹
pt
CS
Czech 🇨🇿
cs
PL
Polish 🇵🇱
pl
RU
Russian 🇷🇺
ru
NL
Dutch 🇳🇱
nl
TR
Turkish 🇹🇷
tr
AR
Arabic 🇦🇪
ar
ZH-CN
Mandarin Chinese 🇨🇳
zh-cn
HI
Hindi 🇮🇳
hi

Troubleshooting & Error Handling

HTTP Error Codes

CodeError TypeDescriptionSolution
400Bad RequestInvalid request parametersCheck parameter format and values
401UnauthorizedMissing or invalid authenticationVerify API token in Authorization header
402Payment RequiredInsufficient character balanceTop up your account or reduce text length
413Payload Too LargeAudio file exceeds size limitReduce file size to under 50MB
422Unprocessable EntityValidation errorsCheck the errors field in response
429Too Many RequestsRate limit exceededWait before making more requests
500Internal Server ErrorService unavailableTry again later or contact support

Common Issues & Solutions

"Invalid speaker audio file"

  • Check file format (must be wav, mp3, m4a, ogg, or flv)
  • Ensure file size is under 50MB
  • Verify file is not corrupted
  • Make sure the file contains actual audio data

"Voice cloning timed out"

  • The service has a 5-minute timeout
  • Try with shorter text or smaller audio files
  • Check if the audio file is too large or complex
  • Retry the request after a few minutes

"Insufficient character balance"

  • Voice cloning uses 2x character rate
  • Check user's available character balance
  • Consider upgrading user's plan
  • Reduce text length to fit within available credits

"Unsupported language"

  • Use the /voice-cloning/languages endpoint to get supported languages
  • Ensure language code is lowercase (e.g., 'en', not 'EN')
  • Check for typos in the language code
  • Refer to the supported languages section above

Security Best Practices

API Token Security

Store API tokens securely and never expose them in client-side code

HTTPS Only

Always use HTTPS for API requests to ensure data encryption

Need Help?

Our support team is here to help you integrate the Voice Cloning API successfully.

Integration mit Ihren Lieblings-Tools

Verbinden Sie Verbatiks leistungsstarke Text-to-Speech API mit Tausenden von Apps über Zapiers Automatisierungsplattform.

Sofortige Umwandlung von Text in Sprache

Verwandeln Sie mühelos Text in natürlich klingende Sprache mit unserem innovativen Sofort-Konvertierungstool

Über 600+ Sprachstimmen

Wählen Sie aus über 600 realistischen KI-Stimmen für vielfältige Text-to-Speech-Konvertierungen

Unterstützt 142 Sprachen und Akzente

Breite sprachliche Vielfalt mit Unterstützung für 142 Sprachen und Akzente

Ermöglicht Anpassung der Stimmen

Anpassung von KI-Stimmen einschließlich Geschwindigkeit, Tonhöhe, Lautstärke und Aussprache

Download-Optionen in MP3- und WAV-Audioformaten

Audio-Downloads in hochwertigen MP3- und WAV-Formaten für vielseitige Nutzung

Kommerzielle und Rundfunkrechte

Kommerzielle und Rundfunkrechte für weitreichende Audioverbreitung verfügbar

Wie Verbatik Ihnen helfen kann

KI-Stimmen in jeder Sprache und jedem Akzent der Welt

Verwandeln Sie Ihre Artikel in fesselndes Audio

Kundenstimmen

Das sagen unsere Kunden

Ricky G.

Ricky G.

Content Creator

I liked that verbatik has some of the most realistic voices on the market. There is a large amount and many ways to customize all of the AI voices.

Juan Carlos V.

Juan Carlos V.

Media Production

I've found Verbatik to be a game-changer in the realm of text-to-speech. The platform is user-friendly, and the voice outputs are impressively lifelike. Whether I'm working on a presentation, video, or just experimenting, Verbatik has consistently delivered high-quality audio that adds value to my projects.

Boris S.

Boris S.

Private Tutor

I discovered VERBATIK when I was looking for a way to create voiceovers for multiple dialogues and monologues for personal use and it provided a good quality service for low price.

Zoe D.

Zoe D.

Student

Verbatik is relatively easy to use and understand and gives the quick text-to-speech results I wanted. The audio was used to narrate an educational video and was easy to export!

Verified Reviewer

Verified Reviewer

Broadcast Media

The broad range of AI voices and the ability to personalize the voice experience is very valuable to me as a content creator. I look forward to using it on a regular basis and exploring more tools that can optimise my content production process.

Aswin V.

Aswin V.

Technical Associate

Verbatik software is a user-friendly software that easily converts text into speech. The accuracy of the transcription is really amazing. There are different customization options available. we can change the speaker. it supports different languages.

FAQ

Antworten auf alle Ihre Fragen

Erleben Sie KI-gestützte Stimmen

Erstellen Sie noch heute Ihr Konto

Entdecken Sie die Kraft lebensechter Text-to-Speech-Technologie. Registrieren Sie sich jetzt und verwandeln Sie Ihre Inhalte mit natürlich klingenden Stimmen.