Text-to-Speech
Ein API-Aufruf
Vereinfacht
Erstellen Sie Sprach-Apps, Hörbücher und barrierefreie Inhalte mit unserer Text-to-Speech API. 600+ natürliche Stimmen in 142 Sprachen für Entwickler und Unternehmen.
API-PREISE
Einfache, transparente Preise
Branchenführender Wert
Erhalten Sie den besten Wert der Branche mit unserer unkomplizierten Preisgestaltung. Perfekt für Unternehmen jeder Größe bietet unsere API Premium-Text-to-Speech-Konvertierung für nur 0,000025 $ pro Zeichen. Skalieren Sie Ihre Sprachinhalte ohne Ihr Budget zu sprengen.
- 50,000 Characters per $1
- High-Quality TTS Voices
- Fast TTS Speed
- Commercial Rights
- Simple API Integration
- 600 Voices 142 Languages
Für Entwickler
Warum Entwickler Verbatiks TTS API wählen
Überlegene KI-Sprach-API
Zugang zu 600+ natürlich klingenden Stimmen mit fortschrittlicher Sprach-KI-Technologie
Blitzschnelle Verarbeitung
Konvertieren Sie Millionen von Zeichen in Sekunden mit unserer optimierten Text-zu-Sprache-API
142 Sprachen & Dialekte
Erreichen Sie ein globales Publikum mit unserer umfassenden Text-zu-Audio-API
Einfache Integration
Implementieren Sie unsere Sprach-KI-API mit nur wenigen Codezeilen und klarer Dokumentation
Enterprise-Skalierbarkeit
Unsere Text-to-Speech-KI-API bewältigt alles von kleinen Projekten bis zu Enterprise-Anforderungen
Kommerzielle Nutzungsrechte
Volle Rechte zur Nutzung generierter Audiodateien in Ihren kommerziellen Produkten
So starten Sie mit unserer API
Schritt 1
Credits aufladen
Wählen Sie den richtigen Betrag für Ihre Bedürfnisse. Sie können 10.000 kostenlose Zeichen für die KI-Sprachgenerator-API erhalten, wenn Sie ein Meeting mit uns buchen.
Jetzt startenSchritt 2
API-Schlüssel generieren
Greifen Sie auf Ihr Dashboard zu, um Ihren einzigartigen geheimen Schlüssel für die sichere Sprach-API-Authentifizierung zu erstellen.
Text-to-Speech generierenSchritt 3
Sprache synthetisieren
Integrieren Sie die Text-zu-Audio-API mit nur wenigen Codezeilen und verwandeln Sie Ihren Text in natürliche Sprache.
Jetzt startenText-to-Speech API Documentation
Complete guide to integrating TTS API with 568 voices across multiple providers
Quick Navigation
VerbatikText-to-Speech API Overview
High-quality neural text-to-speech with multiple voices
The Verbatik Text-to-Speech API provides high-quality neural voice synthesis Convert text to natural-sounding speech with 568 voices across 142 languages.
Key Features
- 568 neural voices
- 142 supported languages
- SSML support for advanced control
- Multiple provider support
Technical Specs
- MP3 output format
- 24kHz sample rate
- Character-based billing
- S3 storage integration
Voice Library
API Endpoints
/api/v1/ttsSynthesize Speech
Convert text to speech using neural voices. Supports both plain text and SSML input with optional audio storage.
Request Headers
| Header | Required | Description |
|---|---|---|
| Authorization | Yes | Bearer token for authentication |
| Content-Type | Yes | text/plain or application/ssml+xml |
| X-Voice-ID | Optional | Voice to use (default: Matthew) |
| X-Store-Audio | Optional | Store audio in S3 and return URL (true/false) |
Example Request (Plain Text)
curl -X POST "https://api.verbatik.com/api/v1/tts" \
-H "Authorization: Bearer YOUR_API_TOKEN" \
-H "Content-Type: text/plain" \
-H "X-Voice-ID: Sarah" \
-H "X-Store-Audio: true" \
-d "Hello, this is a test of our text-to-speech API."Example Request (SSML)
curl -X POST "https://api.verbatik.com/api/v1/tts" \
-H "Authorization: Bearer YOUR_API_TOKEN" \
-H "Content-Type: application/ssml+xml" \
-H "X-Voice-ID: Emma" \
-H "X-Store-Audio: true" \
-d '<speak version="1.0">
Hello, <break time="500ms"/> this is a <emphasis level="strong">test</emphasis>
of our text-to-speech API with <prosody rate="slow">SSML support</prosody>.
</speak>'Example Response (Stored Audio)
{
"success": true,
"audio_url": "https://s3.eu-west-2.amazonaws.com/speak.verbatik.com/audio/123/uuid-filename.mp3"
}/api/v1/voicesGet Available Voices
Retrieve the list of all available voices with their details including name, gender, and language.
Example Request
curl -X GET "https://api.verbatik.com/api/v1/voices" \
-H "Authorization: Bearer YOUR_API_TOKEN"Example Response
[
{
"name": "Sarah",
"gender": "Female",
"language_code": "en-US"
},
{
"name": "Matthew",
"gender": "Male",
"language_code": "en-US"
},
{
"name": "Emma",
"gender": "Female",
"language_code": "en-US"
}
]Integration Examples
Ready-to-use code examples for popular programming languages and frameworks.
JavaScript/Node.js Example
async function synthesizeSpeech(text, voice, apiToken, storeAudio = true) {
try {
const response = await fetch('https://api.verbatik.com/api/v1/tts', {
method: 'POST',
headers: {
'Authorization': `Bearer ${apiToken}`,
'Content-Type': 'text/plain',
'X-Voice-ID': voice,
'X-Store-Audio': storeAudio.toString()
},
body: text
});
if (!response.ok) {
throw new Error(`HTTP error! status: ${response.status}`);
}
if (storeAudio) {
const result = await response.json();
console.log('Audio URL:', result.audio_url);
return result;
} else {
// Direct audio stream
const audioBlob = await response.blob();
const audioUrl = URL.createObjectURL(audioBlob);
return { audio_url: audioUrl };
}
} catch (error) {
console.error('TTS synthesis failed:', error);
throw error;
}
}
// Usage example
synthesizeSpeech(
'Hello, this is a test of our text-to-speech API.',
'Sarah',
'YOUR_API_TOKEN'
).then(result => {
console.log('Success:', result);
}).catch(error => {
console.error('Error:', error);
});Python Example
import requests
import json
def synthesize_speech(text, voice="Sarah", api_token="YOUR_API_TOKEN", store_audio=True):
url = "https://api.verbatik.com/api/v1/tts"
headers = {
"Authorization": f"Bearer {api_token}",
"Content-Type": "text/plain",
"X-Voice-ID": voice,
"X-Store-Audio": str(store_audio).lower()
}
try:
response = requests.post(url, headers=headers, data=text)
response.raise_for_status()
if store_audio:
result = response.json()
print(f"Audio URL: {result['audio_url']}")
return result
else:
# Direct audio content
return {"audio_content": response.content}
except requests.exceptions.RequestException as e:
print(f"TTS synthesis failed: {e}")
raise
# Usage example
result = synthesize_speech(
"Hello, this is a test of our text-to-speech API.",
"Sarah",
"YOUR_API_TOKEN"
)
print("Success:", result)PHP Example
<?php
function synthesizeSpeech($text, $voice = 'Sarah', $apiToken = 'YOUR_API_TOKEN', $storeAudio = true) {
$url = 'https://api.verbatik.com/api/v1/tts';
$headers = [
'Authorization: Bearer ' . $apiToken,
'Content-Type: text/plain',
'X-Voice-ID: ' . $voice,
'X-Store-Audio: ' . ($storeAudio ? 'true' : 'false')
];
$ch = curl_init();
curl_setopt($ch, CURLOPT_URL, $url);
curl_setopt($ch, CURLOPT_POST, true);
curl_setopt($ch, CURLOPT_POSTFIELDS, $text);
curl_setopt($ch, CURLOPT_HTTPHEADER, $headers);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
$response = curl_exec($ch);
$httpCode = curl_getinfo($ch, CURLINFO_HTTP_CODE);
curl_close($ch);
if ($httpCode !== 200) {
throw new Exception("HTTP error! status: $httpCode");
}
if ($storeAudio) {
$result = json_decode($response, true);
echo "Audio URL: " . $result['audio_url'] . "\n";
return $result;
} else {
return ['audio_content' => $response];
}
}
// Usage example
try {
$result = synthesizeSpeech(
'Hello, this is a test of our text-to-speech API.',
'Sarah',
'YOUR_API_TOKEN'
);
echo "Success: " . json_encode($result) . "\n";
} catch (Exception $e) {
echo "Error: " . $e->getMessage() . "\n";
}
?>cURL Example
# Basic text-to-speech request
curl -X POST "https://api.verbatik.com/api/v1/tts" \
-H "Authorization: Bearer YOUR_API_TOKEN" \
-H "Content-Type: text/plain" \
-H "X-Voice-ID: Sarah" \
-H "X-Store-Audio: true" \
-d "Hello, this is a test of our text-to-speech API."
# SSML request
curl -X POST "https://api.verbatik.com/api/v1/tts" \
-H "Authorization: Bearer YOUR_API_TOKEN" \
-H "Content-Type: application/ssml+xml" \
-H "X-Voice-ID: Emma" \
-H "X-Store-Audio: true" \
-d '<speak version="1.0">
Hello, <break time="500ms"/> this is a <emphasis level="strong">test</emphasis>
of our text-to-speech API with <prosody rate="slow">SSML support</prosody>.
</speak>'
# Get available voices
curl -X GET "https://api.verbatik.com/api/v1/voices" \
-H "Authorization: Bearer YOUR_API_TOKEN"Available Voices
Browse our collection of 568 neural voices across 142 languages from multiple providers.
Voice Browser
Use the Voice Library to browse, filter, and test all available voices with audio previews.
Troubleshooting & Error Handling
HTTP Error Codes
| Code | Error Type | Description | Solution |
|---|---|---|---|
| 400 | Bad Request | Invalid SSML or request format | Check SSML syntax and headers |
| 401 | Unauthorized | Missing or invalid API token | Verify Authorization header |
| 402 | Payment Required | Insufficient character balance | Top up account or reduce text length |
| 422 | Unprocessable Entity | Validation errors | Check request parameters |
| 429 | Too Many Requests | Rate limit exceeded | Wait before making more requests |
| 500 | Internal Server Error | Service unavailable | Try again later or contact support |
Common Issues & Solutions
"Invalid SSML format"
- Ensure SSML starts with <speak> and ends with </speak>
- Check for properly closed tags
- Validate XML syntax
- Use proper Content-Type header for SSML
"Voice not found"
- Use the /api/v1/voices endpoint to get valid voice names
- Check voice name spelling and case sensitivity
- Ensure the voice supports the target language
- Try with a default voice like "Matthew" or "Sarah"
"Audio quality issues"
- Use neural voices for best quality
- Check text for special characters or formatting
- Use SSML for pronunciation control
- Try different voices for comparison
SSML Quick Reference
Common SSML Tags
Example Usage
<speak version="1.0">
Hello <break time="1s"/>
<emphasis level="strong">world</emphasis>
</speak>Need Help?
Our support team is here to help you integrate the TTS API successfully.
Voice Cloning API Documentation
Advanced AI-powered multilingual voice cloning technology
Quick Navigation
Voice Cloning API Overview
Advanced AI-powered multilingual voice cloning technology
The Voice Cloning API provides state-of-the-art text-to-speech voice cloning capabilities powered by advanced AI technology. Clone any voice using a sample audio file and generate speech in multiple languages with remarkable accuracy and naturalness.
Key Features
- 14 supported languages
- High-quality voice cloning
- Multiple audio formats
- Voice enhancement options
Technical Specs
- Max text: 3,000 characters
- Max audio: 50MB
- 2x character billing rate
- S3 storage integration
API Endpoints
/api/v1/voice-cloningClone Voice
Clone a voice using provided text and speaker audio. Supports both file uploads and URL-based audio inputs.
Request Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
| text | string | Yes | Text to synthesize (max 3000 characters) |
| language | string | Yes | Target language code (see supported languages) |
| speaker_audio | file | Yes* | Audio file for voice cloning (max 50MB) |
| speaker_audio_url | string | Yes* | URL to audio file for voice cloning |
| cleanup_voice | boolean | No | Apply denoising to speaker audio (default: true) |
*Either speaker_audio or speaker_audio_url is required.
Example Request (cURL)
curl -X POST "https://api.verbatik.com/api/v1/voice-cloning" \
-H "Authorization: Bearer YOUR_API_TOKEN" \
-H "Content-Type: multipart/form-data" \
-F "text=Hello, this is a test of voice cloning technology." \
-F "language=en" \
-F "speaker_audio=@/path/to/speaker.wav" \
-F "cleanup_voice=true"Example Response
{
"success": true,
"audio_url": "https://s3.eu-west-2.amazonaws.com/speak.verbatik.com/voice-cloning/uuid-filename.wav",
"characters_used": 94,
"remaining_balance": 9906,
"language": "en"
}/api/v1/voice-cloning/languagesGet Supported Languages
Retrieve the list of supported languages for voice cloning.
Example Request
curl -X GET "https://api.verbatik.com/api/v1/voice-cloning/languages" \
-H "Authorization: Bearer YOUR_API_TOKEN"Example Response
{
"success": true,
"languages": {
"en": "English 🇺🇸",
"fr": "French 🇫🇷",
"de": "German 🇩🇪",
"es": "Spanish 🇪🇸",
"it": "Italian 🇮🇹",
"pt": "Portuguese 🇵🇹",
"cs": "Czech 🇨🇿",
"pl": "Polish 🇵🇱",
"ru": "Russian 🇷🇺",
"nl": "Dutch 🇳🇱",
"tr": "Turkish 🇹🇷",
"ar": "Arabic 🇦🇪",
"zh-cn": "Mandarin Chinese 🇨🇳",
"hi": "Hindi 🇮🇳"
}
}/api/v1/voice-cloning/infoGet Service Information
Get detailed information about the voice cloning service capabilities and limitations.
Example Request
curl -X GET "https://api.verbatik.com/api/v1/voice-cloning/info" \
-H "Authorization: Bearer YOUR_API_TOKEN"Example Response
{
"success": true,
"service": "Verbatik Voice Cloning",
"description": "Advanced AI-powered multilingual voice cloning technology",
"supported_formats": ["wav", "mp3", "m4a", "ogg", "flv"],
"max_text_length": 3000,
"max_audio_size": "50MB",
"character_rate": "2x (voice cloning uses double character rate)",
"min_audio_duration": "6 seconds recommended",
"languages": { ... }
}Integration Examples
Ready-to-use code examples for popular programming languages and frameworks.
JavaScript/Node.js Example
const FormData = require('form-data');
const fs = require('fs');
const fetch = require('node-fetch');
async function cloneVoice(text, audioFile, language, apiToken) {
const formData = new FormData();
formData.append('text', text);
formData.append('language', language);
formData.append('speaker_audio', fs.createReadStream(audioFile));
formData.append('cleanup_voice', 'true');
try {
const response = await fetch('https://api.verbatik.com/api/v1/voice-cloning', {
method: 'POST',
headers: {
'Authorization': `Bearer ${apiToken}`,
...formData.getHeaders()
},
body: formData
});
if (!response.ok) {
throw new Error(`HTTP error! status: ${response.status}`);
}
const result = await response.json();
console.log('Voice cloning successful:', result);
return result;
} catch (error) {
console.error('Voice cloning failed:', error);
throw error;
}
}
// Usage example
cloneVoice(
'Hello, this is a test of voice cloning technology.',
'/path/to/speaker.wav',
'en',
'YOUR_API_TOKEN'
).then(result => {
console.log('Audio URL:', result.audio_url);
}).catch(error => {
console.error('Error:', error);
});Python Example
import requests
def clone_voice(text, audio_file_path, language, api_token):
url = "https://api.verbatik.com/api/v1/voice-cloning"
headers = {
"Authorization": f"Bearer {api_token}"
}
files = {
'speaker_audio': open(audio_file_path, 'rb')
}
data = {
'text': text,
'language': language,
'cleanup_voice': 'true'
}
try:
response = requests.post(url, headers=headers, files=files, data=data)
response.raise_for_status()
result = response.json()
print(f"Voice cloning successful: {result}")
return result
except requests.exceptions.RequestException as e:
print(f"Voice cloning failed: {e}")
raise
finally:
files['speaker_audio'].close()
# Usage example
result = clone_voice(
"Hello, this is a test of voice cloning technology.",
"/path/to/speaker.wav",
"en",
"YOUR_API_TOKEN"
)
print(f"Audio URL: {result['audio_url']}")PHP Example
<?php
function cloneVoice($text, $audioFilePath, $language, $apiToken) {
$url = 'https://api.verbatik.com/api/v1/voice-cloning';
$postFields = [
'text' => $text,
'language' => $language,
'cleanup_voice' => 'true',
'speaker_audio' => new CURLFile($audioFilePath)
];
$ch = curl_init();
curl_setopt($ch, CURLOPT_URL, $url);
curl_setopt($ch, CURLOPT_POST, true);
curl_setopt($ch, CURLOPT_POSTFIELDS, $postFields);
curl_setopt($ch, CURLOPT_HTTPHEADER, [
'Authorization: Bearer ' . $apiToken
]);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
$response = curl_exec($ch);
$httpCode = curl_getinfo($ch, CURLINFO_HTTP_CODE);
curl_close($ch);
if ($httpCode !== 200) {
throw new Exception("HTTP error! status: $httpCode");
}
$result = json_decode($response, true);
echo "Voice cloning successful: " . json_encode($result) . "\n";
return $result;
}
// Usage example
try {
$result = cloneVoice(
'Hello, this is a test of voice cloning technology.',
'/path/to/speaker.wav',
'en',
'YOUR_API_TOKEN'
);
echo "Audio URL: " . $result['audio_url'] . "\n";
} catch (Exception $e) {
echo "Error: " . $e->getMessage() . "\n";
}
?>cURL Example
# Voice cloning with file upload
curl -X POST "https://api.verbatik.com/api/v1/voice-cloning" \
-H "Authorization: Bearer YOUR_API_TOKEN" \
-H "Content-Type: multipart/form-data" \
-F "text=Hello, this is a test of voice cloning technology." \
-F "language=en" \
-F "speaker_audio=@/path/to/speaker.wav" \
-F "cleanup_voice=true"
# Voice cloning with audio URL
curl -X POST "https://api.verbatik.com/api/v1/voice-cloning" \
-H "Authorization: Bearer YOUR_API_TOKEN" \
-H "Content-Type: multipart/form-data" \
-F "text=Hello, this is a test of voice cloning technology." \
-F "language=en" \
-F "speaker_audio_url=https://example.com/speaker.wav" \
-F "cleanup_voice=true"
# Get supported languages
curl -X GET "https://api.verbatik.com/api/v1/voice-cloning/languages" \
-H "Authorization: Bearer YOUR_API_TOKEN"
# Get service information
curl -X GET "https://api.verbatik.com/api/v1/voice-cloning/info" \
-H "Authorization: Bearer YOUR_API_TOKEN"Supported Languages
The Voice Cloning API supports 14 languages for high-quality voice synthesis.
Troubleshooting & Error Handling
HTTP Error Codes
| Code | Error Type | Description | Solution |
|---|---|---|---|
| 400 | Bad Request | Invalid request parameters | Check parameter format and values |
| 401 | Unauthorized | Missing or invalid authentication | Verify API token in Authorization header |
| 402 | Payment Required | Insufficient character balance | Top up your account or reduce text length |
| 413 | Payload Too Large | Audio file exceeds size limit | Reduce file size to under 50MB |
| 422 | Unprocessable Entity | Validation errors | Check the errors field in response |
| 429 | Too Many Requests | Rate limit exceeded | Wait before making more requests |
| 500 | Internal Server Error | Service unavailable | Try again later or contact support |
Common Issues & Solutions
"Invalid speaker audio file"
- Check file format (must be wav, mp3, m4a, ogg, or flv)
- Ensure file size is under 50MB
- Verify file is not corrupted
- Make sure the file contains actual audio data
"Voice cloning timed out"
- The service has a 5-minute timeout
- Try with shorter text or smaller audio files
- Check if the audio file is too large or complex
- Retry the request after a few minutes
"Insufficient character balance"
- Voice cloning uses 2x character rate
- Check user's available character balance
- Consider upgrading user's plan
- Reduce text length to fit within available credits
"Unsupported language"
- Use the /voice-cloning/languages endpoint to get supported languages
- Ensure language code is lowercase (e.g., 'en', not 'EN')
- Check for typos in the language code
- Refer to the supported languages section above
Security Best Practices
API Token Security
Store API tokens securely and never expose them in client-side code
HTTPS Only
Always use HTTPS for API requests to ensure data encryption
Need Help?
Our support team is here to help you integrate the Voice Cloning API successfully.
Integration mit Ihren Lieblings-Tools
Verbinden Sie Verbatiks leistungsstarke Text-to-Speech API mit Tausenden von Apps über Zapiers Automatisierungsplattform.
Sofortige Umwandlung von Text in Sprache
Verwandeln Sie mühelos Text in natürlich klingende Sprache mit unserem innovativen Sofort-Konvertierungstool
Über 600+ Sprachstimmen
Wählen Sie aus über 600 realistischen KI-Stimmen für vielfältige Text-to-Speech-Konvertierungen
Unterstützt 142 Sprachen und Akzente
Breite sprachliche Vielfalt mit Unterstützung für 142 Sprachen und Akzente
Ermöglicht Anpassung der Stimmen
Anpassung von KI-Stimmen einschließlich Geschwindigkeit, Tonhöhe, Lautstärke und Aussprache
Download-Optionen in MP3- und WAV-Audioformaten
Audio-Downloads in hochwertigen MP3- und WAV-Formaten für vielseitige Nutzung
Kommerzielle und Rundfunkrechte
Kommerzielle und Rundfunkrechte für weitreichende Audioverbreitung verfügbar
Wie Verbatik Ihnen helfen kann
KI-Stimmen in jeder Sprache und jedem Akzent der Welt



Kundenstimmen
Das sagen unsere Kunden

Ricky G.
Content Creator
I liked that verbatik has some of the most realistic voices on the market. There is a large amount and many ways to customize all of the AI voices.

Juan Carlos V.
Media Production
I've found Verbatik to be a game-changer in the realm of text-to-speech. The platform is user-friendly, and the voice outputs are impressively lifelike. Whether I'm working on a presentation, video, or just experimenting, Verbatik has consistently delivered high-quality audio that adds value to my projects.

Boris S.
Private Tutor
I discovered VERBATIK when I was looking for a way to create voiceovers for multiple dialogues and monologues for personal use and it provided a good quality service for low price.

Zoe D.
Student
Verbatik is relatively easy to use and understand and gives the quick text-to-speech results I wanted. The audio was used to narrate an educational video and was easy to export!

Verified Reviewer
Broadcast Media
The broad range of AI voices and the ability to personalize the voice experience is very valuable to me as a content creator. I look forward to using it on a regular basis and exploring more tools that can optimise my content production process.

Aswin V.
Technical Associate
Verbatik software is a user-friendly software that easily converts text into speech. The accuracy of the transcription is really amazing. There are different customization options available. we can change the speaker. it supports different languages.
FAQ
Antworten auf alle Ihre Fragen
Erleben Sie KI-gestützte Stimmen
Erstellen Sie noch heute Ihr Konto
Entdecken Sie die Kraft lebensechter Text-to-Speech-Technologie. Registrieren Sie sich jetzt und verwandeln Sie Ihre Inhalte mit natürlich klingenden Stimmen.


