***

title: Models
description: >-
Find detailed description of each model along with their capabilities and
supported languages.
icon: cube
----------

## Text to Speech (TTS) Models

<CardGroup cols={2}>
  <Card title="Lightning v2" icon="square-2">
    An upgrade from the Lightning Large model, offering improved performance and
    quality. It supports 16 languages, making it suitable for a wider range of
    applications requiring expressive and high-quality speech synthesis.
  </Card>

  <Card title="Lightning v3.1" icon="square-3">
    A 44 kHz model delivering natural, expressive, and realistic speech. Supports voice cloning with ultra-low latency. Supports English, Hindi, Tamil, and Spanish.
  </Card>
</CardGroup>

## Speech to Text (STT) Models

<CardGroup cols={2}>
  <Card title="Pulse STT" icon="square-1">
    High-accuracy, low-latency automatic speech recognition model built for
    real-time transcription. It supports automatic language detection across 32
    languages and delivers fast, reliable results.
  </Card>
</CardGroup>

## Geo-location Based Routing

Waves intelligently routes every request to the nearest server cluster to ensure the lowest possible latency for your applications. We currently operate server clusters in:

* 🇮🇳 India (Mumbai)
* 🇺🇸 USA (Oregon)

Our routing system automatically detects the client's geographical location and connects them to the optimal server based on network proximity and latency. This process is fully automated, no manual configuration is required on your side.

## Model Overview (TTS)

| Model ID           | Description                                                                         | Languages Supported                                                                                                                                                                                                                                                                                                |
| ------------------ | ----------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
| **lightning-v2**   | 100ms TTFB, Supports 16 languages with voice cloning.                               | `English` <br /> `Hindi` <br /> `Tamil` <br /> `Kannada` <br /> `Malayalam` <br /> `Telugu` <br /> `Gujarati` <br /> `Bengali` <br /> `Marathi` <br /> `German` <br /> `French` <br /> `Spanish` <br /> `Italian` <br /> `Polish` <br /> `Dutch` <br /> `Russian` <br /> `Arabic` <br /> `Hebrew` <br /> `Swedish` |
| **lightning-v3.1** | 44 kHz model, natural expressive speech, ultra-low latency, supports voice cloning. | `English` <br /> `Hindi` <br /> `Tamil` <br /> `Spanish`                                                                                                                                                                                                                                                           |

## Model Overview (STT)

| Model ID  | Description                                                                                           | Languages Supported                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
| --------- | ----------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| **pulse** | Low-latency speech-to-text model supporting automatic language detection and real-time transcription. | `Italian` <br /> `Spanish` <br /> `English` <br /> `Portuguese` <br /> `Hindi` <br /> `German` <br /> `French` <br /> `Ukrainian` <br /> `Russian` <br /> `Kannada` <br /> `Malayalam` <br /> `Polish` <br /> `Marathi` <br /> `Gujarati` <br /> `Czech` <br /> `Slovak` <br /> `Telugu` <br /> `Oriya (Odia)` <br /> `Dutch` <br /> `Bengali` <br /> `Latvian` <br /> `Estonian` <br /> `Romanian` <br /> `Punjabi` <br /> `Finnish` <br /> `Swedish` <br /> `Bulgarian` <br /> `Tamil` <br /> `Hungarian` <br /> `Danish` <br /> `Lithuanian` <br /> `Maltese` |

<Note>
  Note: The API uses [ISO 639-1 language codes - Set
  1](https://en.wikipedia.org/wiki/List_of_ISO_639_language_codes) (2-letter
  codes) to specify supported languages.
</Note>

## Pricing

Our pricing model is designed to be flexible and scalable, catering to different usage needs. For detailed pricing information, please visit our [pricing page](https://smallest.ai/text-to-speech) or contact our sales team at [support@smallest.ai](mailto:support@smallest.ai).