Skip to main content

Voice Options Overview

TopCalls offers different voice options depending on the conversation mode you choose:
ModeVoice OptionsVoice CloningLanguagesLatency
RealtimePreset voicesNoAuto-detect~200-500ms
LegacyHundreds of built-in voicesYes36+~300-600ms
Check available voices via the API:
  • All built-in voices: GET /v1/voices/builtin
  • Your account’s voices (including cloned): GET /v1/voices
Voice availability may change over time. Use the API endpoints for the most current list.

Realtime Mode Voices

In Realtime Mode, choose from preset voices optimized for low-latency conversations. The default is alloy.
{
  "mode": "realtime",
  "voice": "alloy"
}
Realtime Mode voices are optimized for ultra-low latency and natural conversation. Check available options via GET /v1/voices/builtin.

Legacy Mode: Full Voice Library

Legacy Mode gives you access to hundreds of voices across different styles, tones, and languages.

Using Built-in Voices

Browse available voices via GET /v1/voices/builtin and use the voice ID in your calls:
{
  "mode": "legacy",
  "voice": "voice-id-from-api"
}

Voice Cloning

Clone any voice with a short audio sample through the TopCalls dashboard: Step 1: Prepare Audio Samples
  • 1-5 audio files (MP3, WAV)
  • At least 1 minute total duration
  • Clear, high-quality recordings
  • Single speaker, minimal background noise
Step 2: Clone via Dashboard Upload your audio samples through the TopCalls dashboard. The platform processes them and creates a custom voice for your account. Step 3: Use Cloned Voice Once cloned, use the voice ID in your API calls:
{
  "mode": "legacy",
  "voice": "your_cloned_voice_id"
}
Your cloned voices appear in the GET /v1/voices endpoint alongside built-in voices.
Voice cloning requires high-quality audio samples. Poor quality samples produce poor voice quality. Use professional recordings when possible.

Choosing the Right Voice

For Customer Support

Choose warm, professional voices that sound patient and helpful. Check available voices via the API and test a few to find the best fit.

For Sales & Outreach

Choose confident, energetic voices that sound engaging. Match the voice to your brand personality.

For Brand Consistency

Clone your brand spokesperson’s voice and use the same voice across all channels.

Multi-Language Voices

Legacy Mode with Language Control

{
  "mode": "legacy",
  "stt_language": "es-ES",
  "voice": "spanish-voice-id"
}
Legacy Mode supports 36+ languages with proper dialect control. Set stt_language for accurate speech recognition and choose a voice that matches the target language. Supported languages include English, Spanish, German, French, Dutch, Italian, Japanese, and many more.

Best Practices

Do This

  • Test voices: Try different voices to find the best fit for your use case
  • Match tone: Choose voices that match your brand personality
  • Consider the use case: Support calls need different voices than sales calls
  • Use cloning for brands: Clone spokesperson voices for consistency
  • Test quality: Always test cloned voices before production

Avoid This

  • Ignore latency: Realtime Mode is faster but has fewer voice options
  • Poor audio samples: Use high-quality recordings for cloning
  • Mismatched languages: Make sure the voice language matches your instructions
  • Too many voices: Stick to 1-2 voices for consistency

Next Steps

AI & Voice Customization

Control your AI’s personality and behavior.

Making Calls

Use custom voices when making calls.