OpenClaw | Smallest AI Docs

Add voice capabilities to your OpenClaw agent. Generate speech with sub-100ms latency and transcribe audio with the Smallest AI skill.

Installation

$ # Via ClawHub (recommended)
$ clawhub install smallest-ai
$ 
$ # Or manually
$ git clone https://github.com/smallest-inc/smallest-ai-openclaw.git
$ cp -r smallest-ai-openclaw ~/.openclaw/skills/smallest-ai

Setup

Set your API key:

$ export SMALLEST_API_KEY="your_key_here"

Get a free key at waves.smallest.ai.

Restart the gateway:

$ openclaw gateway stop && openclaw gateway start

Usage

The skill triggers automatically when you ask your agent to generate speech or transcribe audio. Just talk naturally:

Text-to-Speech:

“Say good morning in a male voice”
“Read this aloud: The meeting is at 3pm”
“Generate a voice note saying hello in Hindi”

Speech-to-Text:

“Transcribe this audio file”
“What did they say in this recording?”

Multilingual:

“Say ‘namaste, kaise hain aap’ in advika’s voice”
“Say ‘hola buenos dias’ using camilla”

Voices

The skill auto-selects voices based on your request:

Voice	Gender	Accent	Best For
`sophia`	Female	American	General use (default)
`robert`	Male	American	Professional (default male)
`advika`	Female	Indian	Hindi, code-switching
`vivaan`	Male	Indian	Bilingual English/Hindi
`camilla`	Female	Mexican/Latin	Spanish
`ella`	Female	American	Conversational
`mia`	Female	American	Storytelling
`arjun`	Male	Indian	English/Hindi bilingual
`vanessa`	Female	American	Expressive, warm