Models
Text to Speech (TTS) Models
Latest Release Premium 44 kHz pool with improved naturalness and a curated voice catalog across American, British, and Indian accents. English + Hindi with code-switching. Same latency profile as standard Lightning v3.1; select via "model": "lightning_v3.1_pro" on the unified TTS routes.
A 44 kHz model delivering natural, expressive, and realistic speech. Supports voice cloning with ultra-low latency. 12 languages (English, Hindi, Spanish, and 9 Indian languages).
Lightning v2 is deprecated. New integrations should use Lightning v3.1 or Lightning v3.1 Pro. The v2 endpoints remain available for existing callers but are not recommended for new work.
Speech to Text (STT) Models
Latest Release English STT tied for #2 on the public Open ASR Leaderboard (5.42% ESB avg WER). Pre-recorded HTTP only. Select via ?model=pulse-pro on POST /waves/v1/stt/.
Low-latency multilingual speech recognition for real-time and pre-recorded transcription. Automatic language detection across 17 streaming + 26 pre-recorded languages. Select via ?model=pulse on POST /waves/v1/stt/ or WS /waves/v1/stt/live.
LLM Models
Speech to Speech (S2S) Models
API language fields use ISO 639-1 language codes (2-letter).
Pricing
Our pricing model is designed to be flexible and scalable, catering to different usage needs. For detailed pricing information, please visit our pricing page or contact our sales team at support@smallest.ai.

