For AI agents: a documentation index is available at the root level at /llms.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
LogoLogo
DocumentationAPI ReferenceSelf HostModel CardsClient LibrariesIntegrationsDeveloper ToolsChangelog
DocumentationAPI ReferenceSelf HostModel CardsClient LibrariesIntegrationsDeveloper ToolsChangelog
  • Getting Started
    • Models
  • Text to Speech (Lightning)
    • Quickstart
    • Overview
    • Sync & Async
    • Streaming
    • Word Timestamps
    • Pronunciation Dictionaries
    • Voices & Languages
    • HTTP vs Streaming vs WebSockets
  • Speech to Text (Pulse)
    • Quickstart
    • Overview
  • Speech to Speech (Hydra)
    • Overview
    • Quickstart
    • WebSocket connection
    • Managing sessions
    • Audio I/O
    • Turn detection & barge-in
    • Tool calling
    • Prompting voice agents
    • Errors & reconnection
  • LLM (Electron)
    • Quickstart
    • Overview
    • Chat Completions
    • Streaming
    • Tool / Function Calling
    • Prefix Caching
    • Supported Parameters
    • Migrate from OpenAI
    • Best Practices
  • Cookbooks
    • Speech to Text
    • Text to Speech
    • Voice Agent (Electron + Pulse + Lightning)
  • Voice Cloning
    • Instant Clone (UI)
    • Instant Clone (API)
    • Instant Clone (Python SDK)
    • Delete Cloned Voice
  • Best Practices
    • Voice Cloning Best Practices
    • TTS Best Practices
  • Troubleshooting
    • Error reference
  • Models
  • Quickstart
  • Overview
  • Sync & Async
  • Streaming
  • Word Timestamps
  • Pronunciation Dictionaries
  • Voices & Languages
  • HTTP vs Streaming vs WebSockets
  • Performance
  • Metrics Overview
  • Quickstart
  • Overview
  • Quickstart
  • Audio Formats
  • Webhooks
  • Features
  • Troubleshooting
  • Best Practices
  • Code Examples
  • Quickstart
  • Response Format
  • Audio Formats
  • Features
  • Troubleshooting
  • Best Practices
  • Code Examples
  • Word Timestamps
  • Language Detection
  • Utterances
  • Diarization
  • Redaction
  • Gender Detection
  • Emotion Detection
  • Keyword Boosting
  • Punctuation Formatting
  • End-of-Utterance Timeout
  • Inverse Text Normalization
  • Finalize Control
  • VAD Events
  • Performance
  • Metrics Overview
  • Evaluation Walkthrough
  • Measuring Latency
  • Overview
  • Quickstart
  • WebSocket connection
  • Managing sessions
  • Audio I/O
  • Turn detection & barge-in
  • Tool calling
  • Prompting voice agents
  • Errors & reconnection
  • Performance
  • Metrics Overview
  • Quickstart
  • Overview
  • Chat Completions
  • Streaming
  • Tool / Function Calling
  • Prefix Caching
  • Supported Parameters
  • Migrate from OpenAI
  • Best Practices
  • Speech to Text
  • Text to Speech
  • Voice Agent (Electron + Pulse + Lightning)
  • Instant Clone (UI)
  • Instant Clone (API)
  • Instant Clone (Python SDK)
  • Delete Cloned Voice
  • Voice Cloning Best Practices
  • TTS Best Practices
  • Error reference
Cookbooks

Text to Speech Examples

||View as Markdown|

Explore complete, runnable examples from our cookbook repository.

Quickstart

Generate speech in 5 lines of code — the simplest way to start.

Streaming (SSE & WebSocket)

Real-time audio streaming with latency metrics and chunk-by-chunk playback.

Voice Explorer

List, filter, and preview 80+ voices by language, gender, and accent.

Pronunciation Dictionaries

Custom pronunciations for brand names, acronyms, and technical terms.

Podcast Generator

Give it a topic, get a two-host AI podcast with LLM-generated script.

Audiobook Generator

Convert any text file into a narrated, chaptered audiobook.

Voice Gallery App

Web app to browse and preview all voices — deploy to Vercel.

Language Translation App

Translate text between 40+ languages with TTS and STT.

Browse all examples on our GitHub repository.

Was this page helpful?
Previous

Speech to Text Examples

Next

Voice Agent (Electron + Pulse + Lightning)

Built with
Voice AgentsModels
Voice AgentsModels