For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DocumentationAPI ReferenceClient LibrariesChangelog
DocumentationAPI ReferenceClient LibrariesChangelog
  • Introduction
    • Introduction
  • Getting Started
    • Quickstart
    • Models
    • Authentication
    • HTTP Streaming
  • Text to Speech
    • How to TTS
    • LLM to TTS
    • Voice Models & Languages
  • Voice Cloning
    • Types of Cloning
    • Voice Clone via UI
    • How to Voice Clone
    • Delete Cloned Voice
    • Professional Voice Cloning
  • Integrations
    • LiveKit
    • Plivo
    • Vonage
  • Product
    • Projects
  • Best Practices
    • Voice Cloning Best Practices
    • PVC Best Practices
    • TTS Best Practices
LogoLogo
Voice AgentsModels
Voice AgentsModels
On this page
  • Geo-location Based Routing
  • Model Overview
  • Pricing
Getting Started

Models

||View as Markdown|
Was this page helpful?
Previous

Quickstart

Next

Authentication

Built with
Lightning

Our fastest model, optimized for low-latency applications. It can generate 10 seconds of audio in just 100 milliseconds, making it ideal for real-time applications such as voicebots and interactive systems.

Lightning v2

An upgrade from the Lightning Large model, offering improved performance and quality. It supports 16 languages, making it suitable for a wider range of applications requiring expressive and high-quality speech synthesis.

Lightning Large [⚠️ To be Deprecated]

Offers more emotional depth and expressiveness compared to the Lightning model. It supports voice cloning and has a latency of just under 300 milliseconds, making it suitable for applications requiring high-quality, expressive speech.

Geo-location Based Routing

Waves intelligently routes every request to the nearest server cluster to ensure the lowest possible latency for your applications. We currently operate server clusters in:

  • 🇮🇳 India (Mumbai)
  • 🇺🇸 USA (Oregon)

Our routing system automatically detects the client’s geographical location and connects them to the optimal server based on network proximity and latency. This process is fully automated, no manual configuration is required on your side.

Model Overview

Model IDDescriptionLanguages Supported
lightningFastest model with an RTF of 0.01, generating 10 seconds of audio in 100 ms.English
Hindi
lightning-largeMore emotional depth and expressiveness, supports voice cloning, latency under 300 ms.English
Hindi
lightning-v2100ms TTFB, Supports 16 languages with voice cloning.English
Hindi
Tamil
Kannada
Gujarati
Bengali
Marathi
German
French
Spanish
Italian
Polish
Dutch
Russian
Arabic
Hebrew

Note: The API uses ISO 639-1 language codes - Set 1 (2-letter codes) to specify supported languages.

Pricing

Our pricing model is designed to be flexible and scalable, catering to different usage needs. For detailed pricing information, please visit our pricing page or contact our sales team at support@smallest.ai.