The TTS Docker deployment consists of four main services that work together:
The API Server is the main entry point for all client requests.
quay.io/smallestinc/self-hosted-api-server:latest
7100 - Main API endpoint
Key log messages:
The core text-to-speech engine powered by GPU acceleration.
quay.io/smallestinc/lightning-tts:latest
8876 - TTS service endpoint
Lightning TTS requires NVIDIA GPU with CUDA support:
On first startup, Lightning TTS:
Models are embedded in the container - no separate download needed.
Key log messages:
Typical performance metrics:
Validates license keys and reports usage to Smallest servers.
quay.io/smallestinc/license-proxy:latest
3369 - License validation endpoint (internal)
License Proxy requires outbound HTTPS access to:
api.smallest.ai on port 443Ensure your firewall allows these connections.
License Proxy reports only metadata:
No audio or transcript data is transmitted to Smallest servers.
If connection to license server fails:
Key log messages:
Provides caching and state management for the system.
redis:latest or redis:7-alpine
6379 - Redis protocol
Default configuration with minimal setup:
Redis stores:
Data in Redis is temporary and can be safely cleared. No persistent state is stored.
Built-in health check:
Startup order and dependencies:
For development/testing:
For production workloads:
Check container status:
Monitor resource consumption:
Monitor GPU utilization:
View service logs: