DocumentationAPI ReferenceSelf HostModel CardsClient LibrariesDeveloper ToolsChangelog
DocumentationAPI ReferenceSelf HostModel CardsClient LibrariesDeveloper ToolsChangelog
  • Getting Started
    • Introduction
    • Models
    • Authentication
  • Text to Speech (Lightning)
    • Quickstart
    • Overview
    • Sync & Async
    • Streaming
    • Pronunciation Dictionaries
    • Voices & Languages
    • HTTP vs Streaming vs WebSockets
  • Speech to Text (Pulse)
    • Quickstart
    • Overview
      • Quickstart
      • Audio Formats
      • Webhooks
      • Features
      • Troubleshooting
      • Best Practices
      • Code Examples
  • Cookbooks
    • Speech to Text
    • Text to Speech
  • Voice Cloning
    • Instant Clone (UI)
    • How to Voice Clone
    • Delete Cloned Voice
  • Integrations
    • Vercel AI SDK
    • OpenClaw
    • LiveKit
    • Pipecat
    • Plivo
    • Vonage
    • n8n
  • Best Practices
    • Voice Cloning Best Practices
    • TTS Best Practices
On this page
  • Available Features
Speech to Text (Pulse)Pre-Recorded

Features

||View as Markdown|

The Pre-Recorded Pulse STT API supports the following features:

Available Features

Word Timestamps

Get precise timing information for each word in the transcription

Language Detection

Automatically detect the language of the audio

Diarization

Identify and label different speakers in the audio

Age & Gender Detection

Predict demographic attributes alongside transcription

Emotion Detection

Detect emotional tone in the transcribed speech

Full Transcript

Get the complete transcription of the audio

Redaction

Automatically redact sensitive information from transcriptions

Numeric Formatting

Format numbers, dates, and currencies in transcriptions

Utterances

Segment transcription into meaningful utterances (requires word_timestamps)

Keyword Boosting is available on the Real-Time WebSocket API only. It is not supported on the pre-recorded HTTP endpoint.

Was this page helpful?
Edit this page
Previous

Webhooks

Next

Troubleshooting

Built with
LogoLogo
Voice AgentsModels
Voice AgentsModels