For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DocumentationAPI ReferenceSelf HostModel CardsClient LibrariesIntegrationsDeveloper ToolsChangelog
DocumentationAPI ReferenceSelf HostModel CardsClient LibrariesIntegrationsDeveloper ToolsChangelog
  • Getting Started
    • Introduction
    • Models
    • Authentication
  • Text to Speech (Lightning)
    • Quickstart
    • Overview
    • Sync & Async
    • Streaming
    • Pronunciation Dictionaries
    • Voices & Languages
    • HTTP vs Streaming vs WebSockets
  • Speech to Text (Pulse)
    • Quickstart
    • Overview
      • Quickstart
      • Audio Formats
      • Webhooks
      • Features
      • Troubleshooting
      • Best Practices
      • Code Examples
  • LLM (Electron)
    • Quickstart
    • Overview
    • Chat Completions
    • Streaming
    • Tool / Function Calling
    • Prefix Caching
    • Supported Parameters
    • Migrate from OpenAI
    • Best Practices
  • Cookbooks
    • Speech to Text
    • Text to Speech
    • Voice Agent (Electron + Pulse + Lightning)
  • Voice Cloning
    • Instant Clone (UI)
    • Instant Clone (API)
    • Instant Clone (Python SDK)
    • Delete Cloned Voice
  • Best Practices
    • Voice Cloning Best Practices
    • TTS Best Practices
  • Troubleshooting
    • Error reference
LogoLogo
Voice AgentsModels
Voice AgentsModels
On this page
  • Available Features
Speech to Text (Pulse)Pre-Recorded

Features

||View as Markdown|
Was this page helpful?
Previous

Webhooks

Next

Troubleshooting

Built with

The Pre-Recorded Pulse STT API supports the following features:

Available Features

Word Timestamps

Get precise timing information for each word in the transcription

Language Detection

Automatically detect the language of the audio

Diarization

Identify and label different speakers in the audio

Gender Detection

Predict speaker gender alongside transcription

Emotion Detection

Detect emotional tone in the transcribed speech

Redaction

Automatically redact sensitive information from transcriptions

Utterances

Segment transcription into meaningful utterances (requires word_timestamps)

Keyword Boosting is available on the Real-Time WebSocket API only. It is not supported on the pre-recorded HTTP endpoint.