Quickstart
This guide shows you how to convert an audio file into text using the unified Speech-to-Text endpoint. Both Pulse (multilingual, 38 languages) and Pulse Pro (leaderboard-ranked English) live behind the same path; you pick the model with ?model=.
Pre-Recorded Audio
Transcribe pre-recorded audio files using synchronous HTTPS POST requests. Perfect for batch processing, archived media, and offline transcription workflows.
The Pre-Recorded API takes an audio file and returns a complete transcript in a single request. Send raw bytes or, for the Pulse model, a URL.
Pick a model
See the Pulse Pro model card and Pulse model card for full benchmarks and feature matrices.
Endpoint
The existing path POST /waves/v1/pulse/get_text continues to work alongside the new unified path.
Authentication
Head over to the smallest console to generate an API key, if not done previously. Also look at the Authentication guide for more information about API keys.
Include your API key in the Authorization header:
Example Request: Pulse Pro (English)
Send raw audio bytes against ?model=pulse-pro. Word timestamps add per-word timing and confidence scores; omit for higher throughput.
Async via webhook (Pulse Pro)
For long audio files where you do not want to hold an HTTP connection open, pass webhook_url. The endpoint returns 200 immediately with {"status": "processing", "request_id": "..."}; the transcription hits your webhook when ready.
Example Request: Pulse (multilingual)
For non-English audio, code-switching, or when you need streaming, use ?model=pulse. Set language explicitly to the known code (en, hi, es, etc.) for best accuracy, or use a multi-* aggregator for unknown audio.
Raw audio bytes
Audio URL (Pulse only)
Pulse also accepts a URL for audio hosted in cloud storage. Pulse Pro does not support audio-by-URL.
For Pulse, set language explicitly to match the audio (en, hi, es, etc.) for the best accuracy. For unknown audio, pick the regional auto-detect scope: multi-eu (de, en, fr, it, nl, pt, ru, es), multi-indic (en, hi, mr, pa, gu, or, ka, ta, te, ml, bn), multi-asian (en, ja, ko, zh, yue), or multi for full multilingual auto-detection.
Example Response
Pulse Pro
Pulse
Full runnable source files: Python | JavaScript | cURL
Next Steps
- Learn about supported audio formats.
- Decide which enrichment options to enable in the features guide.
- Configure asynchronous callbacks with webhooks.
- Review a full code example here.

