Common Issues
GPU Not Accessible
Symptoms:
Error: could not select device driver "nvidia"
Error: no NVIDIA GPU devices found
Lightning TTS fails to start
Diagnosis:
Solution 1: Restart Docker Solution 2: Reinstall NVIDIA Container Toolkit Solution 3: Update NVIDIA Driver If driver version is below 470, update:
Solution 4: Check Docker Daemon Configuration Verify /etc/docker/daemon.json contains:
Restart Docker after changes:
License Validation Failed
Symptoms:
Error: License validation failed
Error: Invalid license key
Services fail to start
Diagnosis:
Check license-proxy logs:
Solution 1: Verify License Key Check .env file:
Ensure there are no:
Extra spaces
Quotes around the key
Line breaks
Correct format:
Solution 2: Check Network Connectivity Test connection to license server:
If this fails, check:
Firewall rules
Proxy settings
DNS resolution
Solution 3: Contact Support If the key appears correct and network is accessible, your license may be:
Contact support@smallest.ai with:
Your license key
License-proxy logs
Error messages
Model Loading Failed
Symptoms:
Lightning TTS stuck at startup
Error: Failed to load model
Container keeps restarting
Diagnosis:
Check Lightning TTS logs:
Solution 1: Check GPU Memory Verify GPU has enough VRAM:
Lightning TTS requires minimum 16GB VRAM.
Solution 2: Check Disk Space Models require space:
Free up space if needed:
Solution 3: Increase Startup Time Models may need more time to load:
Port Already in Use
Symptoms:
Error: port is already allocated
Error: bind: address already in use
Diagnosis:
Find what’s using the port:
Solution 1: Stop Conflicting Service If another service is using the port:
Or kill the process:
Solution 2: Change Port Solution 3: Remove Old Containers Old containers may still be bound:
Out of Memory
Symptoms:
Container killed unexpectedly
Error: OOMKilled
System becomes unresponsive
Diagnosis:
Check container status:
Solution 1: Increase System Memory Lightning TTS requires minimum 16 GB RAM
Check current memory:
Solution 2: Add Memory Limits Prevent one service from consuming all memory:
Solution 3: Enable Swap Add swap space (temporary solution):
Symptoms:
High latency (>500ms)
Low throughput
GPU underutilized
Diagnosis:
Monitor GPU usage:
Check container resources:
Solution 1: Optimize GPU Usage Ensure GPU is not throttling:
Enable persistence mode:
Solution 2: Increase CPU Allocation Solution 3: Optimize Redis Use Redis with persistence disabled for speed:
Best Practices
1 Enable GPU Persistence Mode Reduces GPU initialization time:
2 Optimize Container Resources Allocate appropriate CPU/memory:
3 Monitor and Tune Use monitoring tools:
Benchmark Your Deployment
Test TTS performance:
Expected performance:
Cold start : First request after container start (5-10 seconds)
Warm requests : Subsequent requests (100-300ms)
Real-time factor : 0.1-0.3x
View All Logs
Follow Specific Service
Last N Lines
Save Logs to File
Execute Commands in Container
Check Container Configuration
Network Debugging
Test connectivity between containers:
Health Checks
API Server
Expected: {"status": "healthy"}
Lightning TTS
Expected: {"status": "ready", "gpu": "NVIDIA A10"}
License Proxy
Expected: {"status": "valid"}
Redis
Expected: PONG
Getting Help
Collect the following information:
4 Configuration Sanitize and include:
docker-compose.yml
.env (remove license key)
Email: support@smallest.ai
Include:
Description of the issue
Steps to reproduce
System information
Logs and configuration
License key (via secure channel)
What’s Next?