新機能! 音声クローニングが利用可能 37言語
すべて見る

テキスト読み上げ
1回のAPI呼び出し
シンプル化

テキスト読み上げAPIで音声アプリ、オーディオブック、アクセシブルなコンテンツを作成。開発者とビジネス向けに142言語で600以上の自然な音声。

卓越性が認められています

Verbatikについてのユーザーの声をご覧ください

API料金

シンプルで透明な料金

業界最高のコストパフォーマンス

わかりやすい料金設定で業界最高の価値を提供。あらゆる規模のビジネスに最適なAPIは、1文字あたりわずか$0.000025でプレミアムなテキスト読み上げ変換を提供。予算を気にせず音声コンテンツを拡大できます。

  • 50,000 Characters per $1
  • High-Quality TTS Voices
  • Fast TTS Speed
  • Commercial Rights
  • Simple API Integration
  • 600 Voices 142 Languages

従量課金API

50,000 文字/$1

または100万文字で$20

開発者向け

開発者がVerbatikのTTS APIを選ぶ理由

優れたAI音声API

高度な音声AI技術を搭載した600以上の自然な音声にアクセス

超高速処理

最適化されたテキストから音声APIで数秒で数百万文字を変換

142の言語と方言

包括的なテキストからオーディオAPIでグローバルな視聴者にリーチ

シンプルな統合

明確なドキュメントでわずか数行のコードで音声AI APIを実装

エンタープライズスケーラビリティ

テキスト読み上げAI APIは小規模プロジェクトからエンタープライズニーズまで対応

商用利用権

生成されたオーディオを商用製品で使用する完全な権利

APIの始め方

ステップ 1

クレジットをチャージ

ニーズに合った金額を選択。ミーティングを予約すると、AI音声ジェネレーターAPI用に10,000文字を無料で取得できます。

今すぐ始める

ステップ 2

APIキーを生成

ダッシュボードにアクセスして、安全な音声AI API認証用のユニークなシークレットキーを作成。

テキスト読み上げを生成

ステップ 3

音声を合成

わずか数行のコードでテキストからオーディオAPIを統合し、テキストを自然な音声に変換。

今すぐ始める

Text-to-Speech API Documentation

Complete guide to integrating TTS API with 568 voices across multiple providers

API v1.0

VerbatikText-to-Speech API Overview

High-quality neural text-to-speech with multiple voices

The Verbatik Text-to-Speech API provides high-quality neural voice synthesis Convert text to natural-sounding speech with 568 voices across 142 languages.

Key Features

  • 568 neural voices
  • 142 supported languages
  • SSML support for advanced control
  • Multiple provider support

Technical Specs

  • MP3 output format
  • 24kHz sample rate
  • Character-based billing
  • S3 storage integration

Voice Library

568
High-Quality Neural Voices
142
Languages Supported
24kHz
Sample Rate

API Endpoints

POST /api/v1/tts

Synthesize Speech

Convert text to speech using neural voices. Supports both plain text and SSML input with optional audio storage.

Request Headers

HeaderRequiredDescription
AuthorizationYesBearer token for authentication
Content-TypeYestext/plain or application/ssml+xml
X-Voice-IDOptionalVoice to use (default: Matthew)
X-Store-AudioOptionalStore audio in S3 and return URL (true/false)
Example Request (Plain Text)
curl -X POST "https://api.verbatik.com/api/v1/tts" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: text/plain" \
  -H "X-Voice-ID: Sarah" \
  -H "X-Store-Audio: true" \
  -d "Hello, this is a test of our text-to-speech API."
Example Request (SSML)
curl -X POST "https://api.verbatik.com/api/v1/tts" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: application/ssml+xml" \
  -H "X-Voice-ID: Emma" \
  -H "X-Store-Audio: true" \
  -d '<speak version="1.0">
    Hello, <break time="500ms"/> this is a <emphasis level="strong">test</emphasis> 
    of our text-to-speech API with <prosody rate="slow">SSML support</prosody>.
</speak>'
Example Response (Stored Audio)
{
  "success": true,
  "audio_url": "https://s3.eu-west-2.amazonaws.com/speak.verbatik.com/audio/123/uuid-filename.mp3"
}
GET /api/v1/voices

Get Available Voices

Retrieve the list of all available voices with their details including name, gender, and language.

Example Request
curl -X GET "https://api.verbatik.com/api/v1/voices" \
  -H "Authorization: Bearer YOUR_API_TOKEN"
Example Response
[
  {
    "name": "Sarah",
    "gender": "Female",
    "language_code": "en-US"
  },
  {
    "name": "Matthew",
    "gender": "Male", 
    "language_code": "en-US"
  },
  {
    "name": "Emma",
    "gender": "Female",
    "language_code": "en-US"
  }
]

Integration Examples

Ready-to-use code examples for popular programming languages and frameworks.

JavaScript/Node.js Example

async function synthesizeSpeech(text, voice, apiToken, storeAudio = true) {
    try {
        const response = await fetch('https://api.verbatik.com/api/v1/tts', {
            method: 'POST',
            headers: {
                'Authorization': `Bearer ${apiToken}`,
                'Content-Type': 'text/plain',
                'X-Voice-ID': voice,
                'X-Store-Audio': storeAudio.toString()
            },
            body: text
        });

        if (!response.ok) {
            throw new Error(`HTTP error! status: ${response.status}`);
        }

        if (storeAudio) {
            const result = await response.json();
            console.log('Audio URL:', result.audio_url);
            return result;
        } else {
            // Direct audio stream
            const audioBlob = await response.blob();
            const audioUrl = URL.createObjectURL(audioBlob);
            return { audio_url: audioUrl };
        }

    } catch (error) {
        console.error('TTS synthesis failed:', error);
        throw error;
    }
}

// Usage example
synthesizeSpeech(
    'Hello, this is a test of our text-to-speech API.',
    'Sarah',
    'YOUR_API_TOKEN'
).then(result => {
    console.log('Success:', result);
}).catch(error => {
    console.error('Error:', error);
});

Available Voices

Browse our collection of 568 neural voices across 142 languages from multiple providers.

568
Total Voices
Neural Quality
142
Languages
Global Coverage
24kHz
Sample Rate
High Fidelity

Voice Browser

Use the Voice Library to browse, filter, and test all available voices with audio previews.

af-ZA 2 voices
Sample: Adri, Willem
am-ET 2 voices
Sample: Mekdes, Ameha
ar-AE 2 voices
Sample: Fatima, Hamdan
ar-BH 2 voices
Sample: Laila, Ali
ar-DZ 2 voices
Sample: Amina, Ismael
ar-EG 2 voices
Sample: Salma, Shakir

Troubleshooting & Error Handling

HTTP Error Codes

CodeError TypeDescriptionSolution
400Bad RequestInvalid SSML or request formatCheck SSML syntax and headers
401UnauthorizedMissing or invalid API tokenVerify Authorization header
402Payment RequiredInsufficient character balanceTop up account or reduce text length
422Unprocessable EntityValidation errorsCheck request parameters
429Too Many RequestsRate limit exceededWait before making more requests
500Internal Server ErrorService unavailableTry again later or contact support

Common Issues & Solutions

"Invalid SSML format"

  • Ensure SSML starts with <speak> and ends with </speak>
  • Check for properly closed tags
  • Validate XML syntax
  • Use proper Content-Type header for SSML

"Voice not found"

  • Use the /api/v1/voices endpoint to get valid voice names
  • Check voice name spelling and case sensitivity
  • Ensure the voice supports the target language
  • Try with a default voice like "Matthew" or "Sarah"

"Audio quality issues"

  • Use neural voices for best quality
  • Check text for special characters or formatting
  • Use SSML for pronunciation control
  • Try different voices for comparison

SSML Quick Reference

Common SSML Tags

<break time="500ms"/> - Add pause
<emphasis level="strong"> - Emphasize text
<prosody rate="slow"> - Control speed
<prosody pitch="high"> - Control pitch

Example Usage

<speak version="1.0">
  Hello <break time="1s"/>
  <emphasis level="strong">world</emphasis>
</speak>

Need Help?

Our support team is here to help you integrate the TTS API successfully.

Voice Cloning API Documentation

Advanced AI-powered multilingual voice cloning technology

API v1.0

Voice Cloning API Overview

Advanced AI-powered multilingual voice cloning technology

The Voice Cloning API provides state-of-the-art text-to-speech voice cloning capabilities powered by advanced AI technology. Clone any voice using a sample audio file and generate speech in multiple languages with remarkable accuracy and naturalness.

Key Features

  • 14 supported languages
  • High-quality voice cloning
  • Multiple audio formats
  • Voice enhancement options

Technical Specs

  • Max text: 3,000 characters
  • Max audio: 50MB
  • 2x character billing rate
  • S3 storage integration

API Endpoints

POST /api/v1/voice-cloning

Clone Voice

Clone a voice using provided text and speaker audio. Supports both file uploads and URL-based audio inputs.

Request Parameters

ParameterTypeRequiredDescription
textstringYesText to synthesize (max 3000 characters)
languagestringYesTarget language code (see supported languages)
speaker_audiofileYes*Audio file for voice cloning (max 50MB)
speaker_audio_urlstringYes*URL to audio file for voice cloning
cleanup_voicebooleanNoApply denoising to speaker audio (default: true)

*Either speaker_audio or speaker_audio_url is required.

Example Request (cURL)
curl -X POST "https://api.verbatik.com/api/v1/voice-cloning" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: multipart/form-data" \
  -F "text=Hello, this is a test of voice cloning technology." \
  -F "language=en" \
  -F "speaker_audio=@/path/to/speaker.wav" \
  -F "cleanup_voice=true"
Example Response
{
  "success": true,
  "audio_url": "https://s3.eu-west-2.amazonaws.com/speak.verbatik.com/voice-cloning/uuid-filename.wav",
  "characters_used": 94,
  "remaining_balance": 9906,
  "language": "en"
}
GET /api/v1/voice-cloning/languages

Get Supported Languages

Retrieve the list of supported languages for voice cloning.

Example Request
curl -X GET "https://api.verbatik.com/api/v1/voice-cloning/languages" \
  -H "Authorization: Bearer YOUR_API_TOKEN"
Example Response
{
  "success": true,
  "languages": {
    "en": "English 🇺🇸",
    "fr": "French 🇫🇷",
    "de": "German 🇩🇪",
    "es": "Spanish 🇪🇸",
    "it": "Italian 🇮🇹",
    "pt": "Portuguese 🇵🇹",
    "cs": "Czech 🇨🇿",
    "pl": "Polish 🇵🇱",
    "ru": "Russian 🇷🇺",
    "nl": "Dutch 🇳🇱",
    "tr": "Turkish 🇹🇷",
    "ar": "Arabic 🇦🇪",
    "zh-cn": "Mandarin Chinese 🇨🇳",
    "hi": "Hindi 🇮🇳"
  }
}
GET /api/v1/voice-cloning/info

Get Service Information

Get detailed information about the voice cloning service capabilities and limitations.

Example Request
curl -X GET "https://api.verbatik.com/api/v1/voice-cloning/info" \
  -H "Authorization: Bearer YOUR_API_TOKEN"
Example Response
{
  "success": true,
  "service": "Verbatik Voice Cloning",
  "description": "Advanced AI-powered multilingual voice cloning technology",
  "supported_formats": ["wav", "mp3", "m4a", "ogg", "flv"],
  "max_text_length": 3000,
  "max_audio_size": "50MB",
  "character_rate": "2x (voice cloning uses double character rate)",
  "min_audio_duration": "6 seconds recommended",
  "languages": { ... }
}

Integration Examples

Ready-to-use code examples for popular programming languages and frameworks.

JavaScript/Node.js Example

const FormData = require('form-data');
const fs = require('fs');
const fetch = require('node-fetch');

async function cloneVoice(text, audioFile, language, apiToken) {
    const formData = new FormData();
    formData.append('text', text);
    formData.append('language', language);
    formData.append('speaker_audio', fs.createReadStream(audioFile));
    formData.append('cleanup_voice', 'true');

    try {
        const response = await fetch('https://api.verbatik.com/api/v1/voice-cloning', {
            method: 'POST',
            headers: {
                'Authorization': `Bearer ${apiToken}`,
                ...formData.getHeaders()
            },
            body: formData
        });

        if (!response.ok) {
            throw new Error(`HTTP error! status: ${response.status}`);
        }

        const result = await response.json();
        console.log('Voice cloning successful:', result);
        return result;

    } catch (error) {
        console.error('Voice cloning failed:', error);
        throw error;
    }
}

// Usage example
cloneVoice(
    'Hello, this is a test of voice cloning technology.',
    '/path/to/speaker.wav',
    'en',
    'YOUR_API_TOKEN'
).then(result => {
    console.log('Audio URL:', result.audio_url);
}).catch(error => {
    console.error('Error:', error);
});

Supported Languages

The Voice Cloning API supports 14 languages for high-quality voice synthesis.

EN
English 🇺🇸
en
FR
French 🇫🇷
fr
DE
German 🇩🇪
de
ES
Spanish 🇪🇸
es
IT
Italian 🇮🇹
it
PT
Portuguese 🇵🇹
pt
CS
Czech 🇨🇿
cs
PL
Polish 🇵🇱
pl
RU
Russian 🇷🇺
ru
NL
Dutch 🇳🇱
nl
TR
Turkish 🇹🇷
tr
AR
Arabic 🇦🇪
ar
ZH-CN
Mandarin Chinese 🇨🇳
zh-cn
HI
Hindi 🇮🇳
hi

Troubleshooting & Error Handling

HTTP Error Codes

CodeError TypeDescriptionSolution
400Bad RequestInvalid request parametersCheck parameter format and values
401UnauthorizedMissing or invalid authenticationVerify API token in Authorization header
402Payment RequiredInsufficient character balanceTop up your account or reduce text length
413Payload Too LargeAudio file exceeds size limitReduce file size to under 50MB
422Unprocessable EntityValidation errorsCheck the errors field in response
429Too Many RequestsRate limit exceededWait before making more requests
500Internal Server ErrorService unavailableTry again later or contact support

Common Issues & Solutions

"Invalid speaker audio file"

  • Check file format (must be wav, mp3, m4a, ogg, or flv)
  • Ensure file size is under 50MB
  • Verify file is not corrupted
  • Make sure the file contains actual audio data

"Voice cloning timed out"

  • The service has a 5-minute timeout
  • Try with shorter text or smaller audio files
  • Check if the audio file is too large or complex
  • Retry the request after a few minutes

"Insufficient character balance"

  • Voice cloning uses 2x character rate
  • Check user's available character balance
  • Consider upgrading user's plan
  • Reduce text length to fit within available credits

"Unsupported language"

  • Use the /voice-cloning/languages endpoint to get supported languages
  • Ensure language code is lowercase (e.g., 'en', not 'EN')
  • Check for typos in the language code
  • Refer to the supported languages section above

Security Best Practices

API Token Security

Store API tokens securely and never expose them in client-side code

HTTPS Only

Always use HTTPS for API requests to ensure data encryption

Need Help?

Our support team is here to help you integrate the Voice Cloning API successfully.

お気に入りのツールと統合

Zapierの自動化プラットフォームを通じて、Verbatikの強力なテキスト読み上げAPIを数千のアプリと接続。

テキストを音声に即座に変換

革新的な即時変換ツールで、テキストを自然な音声に簡単に変換

600以上の音声

多様なテキスト読み上げ変換のために600以上のリアルなAI音声から選択

142の言語とアクセントをサポート

142の言語とアクセントをサポートし、幅広い言語の多様性を提供

音声のカスタマイズが可能

速度、ピッチ、音量、発音を含むAI音声のカスタマイズ

MP3とWAVオーディオファイル形式でダウンロード

多目的な使用のために高品質なMP3とWAV形式でオーディオダウンロードを提供

商用および放送権

広範囲なオーディオ配信のための商用および放送権が利用可能

Verbatikがお手伝いできること

世界中のすべての言語とアクセントのAI音声

記事を魅力的なオーディオに変換

お客様の声

お客様からの評価

Ricky G.

Ricky G.

コンテンツクリエイター

Verbatikは市場で最もリアルな音声を持っていることが気に入りました。多くの種類があり、すべてのAI音声をカスタマイズする方法がたくさんあります。

Juan Carlos V.

Juan Carlos V.

メディア制作

Verbatikはテキスト読み上げの分野でゲームチェンジャーだと感じています。プラットフォームは使いやすく、音声出力は驚くほどリアルです。プレゼンテーション、動画、または実験的な作業でも、Verbatikは常に高品質なオーディオを提供し、プロジェクトに価値を加えてくれます。

Boris S.

Boris S.

個人講師

個人使用のために複数の対話やモノローグのナレーションを作成する方法を探していたときにVERBATIKを発見しました。低価格で良質なサービスを提供してくれました。

Zoe D.

Zoe D.

学生

Verbatikは比較的使いやすく理解しやすく、求めていた迅速なテキスト読み上げ結果を提供してくれます。オーディオは教育動画のナレーションに使用され、エクスポートも簡単でした!

認証済みレビュアー

認証済みレビュアー

放送メディア

幅広いAI音声と音声体験をパーソナライズできる機能は、コンテンツクリエイターとして非常に価値があります。定期的に使用し、コンテンツ制作プロセスを最適化できるツールをさらに探求することを楽しみにしています。

Aswin V.

Aswin V.

テクニカルアソシエイト

Verbatikソフトウェアは、テキストを音声に簡単に変換できるユーザーフレンドリーなソフトウェアです。文字起こしの精度は本当に素晴らしいです。さまざまなカスタマイズオプションがあり、スピーカーを変更でき、さまざまな言語をサポートしています。

よくある質問

すべての質問にお答えします

AI音声を体験

今すぐアカウントを作成

リアルなテキスト読み上げ技術の力を解放しましょう。今すぐ登録して、自然な音声でコンテンツを変革しましょう。