The transcription endpoint converts audio files to text using Lightning ASR. Supports both batch processing and streaming.
Requires Bearer token authentication with your license key.
See Authentication for details.
Transcribe audio from a publicly accessible URL:
Upload audio directly:
URL to audio file (mutually exclusive with file upload)
Supported protocols: http://, https://, s3://
Audio file upload (mutually exclusive with URL)
Supported formats: WAV, MP3, FLAC, OGG, M4A
Language code (ISO 639-1)
Examples: en, es, fr, de, zh
Add punctuation to transcript
Enable speaker diarization (identify different speakers)
Expected number of speakers (for diarization)
If not specified, automatically detected
Include word-level timestamps
Webhook URL for async results delivery
If provided, returns immediately with job ID
Response:
Response:
Immediate response:
Later, webhook receives:
For best results:
Default rate limits:
Contact support@smallest.ai to increase limits for your license.
Typical performance metrics:
Performance varies based on:
Implement retry logic with exponential backoff:
For audio longer than 5 minutes, use callback URL:
Cache transcription results to avoid duplicate processing: