For AI agents: a documentation index is available at the root level at /llms.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
LogoLogo
DocumentationAPI ReferenceSelf HostModel CardsClient LibrariesIntegrationsDeveloper ToolsChangelog
DocumentationAPI ReferenceSelf HostModel CardsClient LibrariesIntegrationsDeveloper ToolsChangelog
  • Getting Started
    • Models
  • Text to Speech (Lightning)
    • Quickstart
    • Overview
    • Sync & Async
    • Streaming
    • Word Timestamps
    • Pronunciation Dictionaries
    • Voices & Languages
    • HTTP vs Streaming vs WebSockets
  • Speech to Text (Pulse)
    • Quickstart
    • Overview
      • Quickstart
      • Audio Formats
      • Webhooks
      • Features
      • Troubleshooting
      • Best Practices
      • Code Examples
  • Speech to Speech (Hydra)
    • Overview
    • Quickstart
    • WebSocket connection
    • Managing sessions
    • Audio I/O
    • Turn detection & barge-in
    • Tool calling
    • Prompting voice agents
    • Errors & reconnection
  • LLM (Electron)
    • Quickstart
    • Overview
    • Chat Completions
    • Streaming
    • Tool / Function Calling
    • Prefix Caching
    • Supported Parameters
    • Migrate from OpenAI
    • Best Practices
  • Cookbooks
    • Speech to Text
    • Text to Speech
    • Voice Agent (Electron + Pulse + Lightning)
  • Voice Cloning
    • Instant Clone (UI)
    • Instant Clone (API)
    • Instant Clone (Python SDK)
    • Delete Cloned Voice
  • Best Practices
    • Voice Cloning Best Practices
    • TTS Best Practices
  • Troubleshooting
    • Error reference
  • Models
  • Quickstart
  • Overview
  • Sync & Async
  • Streaming
  • Word Timestamps
  • Pronunciation Dictionaries
  • Voices & Languages
  • HTTP vs Streaming vs WebSockets
  • Performance
  • Metrics Overview
  • Quickstart
  • Overview
  • Quickstart
  • Audio Formats
  • Webhooks
  • Features
  • Troubleshooting
  • Best Practices
  • Code Examples
  • Quickstart
  • Response Format
  • Audio Formats
  • Features
  • Troubleshooting
  • Best Practices
  • Code Examples
  • Word Timestamps
  • Language Detection
  • Utterances
  • Diarization
  • Redaction
  • Gender Detection
  • Emotion Detection
  • Keyword Boosting
  • Punctuation Formatting
  • End-of-Utterance Timeout
  • Inverse Text Normalization
  • Finalize Control
  • VAD Events
  • Performance
  • Metrics Overview
  • Evaluation Walkthrough
  • Measuring Latency
  • Overview
  • Quickstart
  • WebSocket connection
  • Managing sessions
  • Audio I/O
  • Turn detection & barge-in
  • Tool calling
  • Prompting voice agents
  • Errors & reconnection
  • Performance
  • Metrics Overview
  • Quickstart
  • Overview
  • Chat Completions
  • Streaming
  • Tool / Function Calling
  • Prefix Caching
  • Supported Parameters
  • Migrate from OpenAI
  • Best Practices
  • Speech to Text
  • Text to Speech
  • Voice Agent (Electron + Pulse + Lightning)
  • Instant Clone (UI)
  • Instant Clone (API)
  • Instant Clone (Python SDK)
  • Delete Cloned Voice
  • Voice Cloning Best Practices
  • TTS Best Practices
  • Error reference
On this page
  • Available Features
Speech to Text (Pulse)Pre-Recorded

Features

||View as Markdown|

The Pre-Recorded Pulse STT API supports the following features:

Available Features

Word Timestamps

Get precise timing information for each word in the transcription

Language Detection

Automatically detect the language of the audio

Diarization

Identify and label different speakers in the audio

Gender Detection

Predict speaker gender alongside transcription

Emotion Detection

Detect emotional tone in the transcribed speech

Redaction

Automatically redact sensitive information from transcriptions

Utterances

Segment transcription into meaningful utterances (requires word_timestamps)

Keyword Boosting is available on the Real-Time WebSocket API only. It is not supported on the pre-recorded HTTP endpoint.

Was this page helpful?
Previous

Webhooks

Next

Troubleshooting

Built with
Voice AgentsModels
Voice AgentsModels