For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DocumentationAPI ReferenceSelf HostModel CardsClient LibrariesIntegrationsDeveloper ToolsChangelog
DocumentationAPI ReferenceSelf HostModel CardsClient LibrariesIntegrationsDeveloper ToolsChangelog
  • Getting Started
    • Introduction
    • Models
    • Authentication
  • Text to Speech (Lightning)
    • Quickstart
    • Overview
    • Sync & Async
    • Streaming
    • Pronunciation Dictionaries
    • Voices & Languages
    • HTTP vs Streaming vs WebSockets
  • Speech to Text (Pulse)
    • Quickstart
    • Overview
  • Cookbooks
    • Speech to Text
    • Text to Speech
  • Voice Cloning
    • Instant Clone (UI)
    • Instant Clone (API)
    • Instant Clone (Python SDK)
    • Delete Cloned Voice
  • Best Practices
    • Voice Cloning Best Practices
    • TTS Best Practices
  • Troubleshooting
    • Error reference
Cookbooks

Speech to Text Examples

||View as Markdown|

Explore complete, runnable examples from our cookbook repository.

Real-time Microphone Transcription

Stream audio from your microphone over WebSocket and get real-time transcriptions.

Online Meeting Notetaker

Automatically transcribe and take notes from online meetings with speaker identification.

Podcast Summarizer

Transcribe podcast episodes and generate concise summaries.

Subtitle Generation

Generate SRT/VTT subtitle files from audio and video content.

Browse all examples on our GitHub repository.

Was this page helpful?
Previous

Evaluation Walkthrough

Next

Text to Speech Examples

Built with
LogoLogo
Voice AgentsModels
Voice AgentsModels