Quick Start
Overview
This guide walks you through deploying Smallest Self-Host using Docker Compose. You’ll have a fully functional speech-to-text service running in under 15 minutes.
Ensure you’ve completed all prerequisites before starting this guide.
Step 1: Create Project Directory
Create a directory for your deployment:
Step 2: Login to Container Registry
Authenticate with the Smallest container registry using credentials provided by support:
Enter your username and password when prompted.
Save your credentials securely. You’ll need them if you restart or redeploy the containers.
Step 3: Create Environment File
Create a .env file with your license key:
Replace your-license-key-here with the actual license key provided by Smallest.ai.
Never commit your .env file to version control. Add it to .gitignore if
using git.
Step 4: Create Docker Compose File
Lightning ASR (Standard)
Best for: Fast inference, real-time applications
Create a docker-compose.yml file:
Step 5: Additional Configuration for Lightning ASR
Lightning ASR
Add the model URL to your .env file (required for Lightning ASR):
The MODEL_URL is provided by Smallest.ai support.
Step 6: Start Services
Launch all services with Docker Compose:
Step 7: Monitor Startup
Watch the logs to monitor startup progress:
Look for these success indicators:
Common Startup Issues
GPU Not Found
Error: could not select device driver "nvidia"
Solution:
If this fails, reinstall NVIDIA Container Toolkit.
License Validation Failed
Error: License validation failed
Solution:
- Verify LICENSE_KEY in
.envis correct - Check internet connectivity
- Ensure firewall allows HTTPS to console-api.smallest.ai
Model Download Failed
Error: Failed to download model
Solution:
- Verify MODEL_URL in
.envis correct - Check disk space:
df -h - Check internet connectivity
Port Already in Use
Error: port is already allocated
Solution: Check what’s using the port:
Either stop the conflicting service or change the port in docker-compose.yml
Managing Your Deployment
Stop Services
Restart Services
View Logs
Examples:
Update Images
Pull latest images and restart:
Remove Deployment
Stop and remove all containers:
Remove containers and volumes (including downloaded models):
Using -v flag will delete all data including downloaded models. They will
need to be re-downloaded on next startup.

