Health Check
Overview
The health check endpoint provides a simple way to verify that the API server is running and responsive. Use this for monitoring, load balancer health checks, and readiness probes.
Endpoint
Authentication
No authentication required - This endpoint is publicly accessible.
Request
Simple GET request with no parameters:
Response
Healthy Response
HTTP Status: 200 OK
Unhealthy Response
HTTP Status: 503 Service Unavailable
Use Cases
Load Balancer Health Checks
Configure your load balancer to use the health endpoint:
AWS ALB/NLB
Kubernetes Service
Kubernetes Liveness Probe
Monitor pod health in Kubernetes:
Kubernetes Readiness Probe
Determine when pod is ready to receive traffic:
Monitoring and Alerting
Monitor service availability:
Prometheus
Python Script
Bash Script
Uptime Monitoring
Integration with uptime monitoring services:
UptimeRobot
Pingdom
Datadog
- Monitor Type: HTTP(s)
- URL:
https://api.example.com/health - Keyword:
healthy - Interval: 5 minutes
Advanced Health Checks
Detailed Health Status
For more detailed health information, add query parameter:
Response:
Component-Specific Checks
Check individual components:
Integration Examples
Docker Compose Healthcheck
Kubernetes Deployment
Automated Testing
Include health checks in CI/CD:
Best Practices
Set Appropriate Timeouts
Configure reasonable timeouts:
- Timeout: 5 seconds max
- Interval: 10-30 seconds
- Retries: 3-5 attempts
Use in Load Balancers
Always configure health checks in load balancers:
- Prevents traffic to unhealthy instances
- Enables automatic failover
- Reduces user-facing errors
Monitor Continuously
Set up continuous monitoring:
- External uptime monitoring
- Internal health checks
- Alerting on failures
Test Failure Scenarios
Regularly test health check behavior:
Verify:
- Health check fails
- Load balancer stops routing
- New pod becomes ready
- Health check succeeds
Troubleshooting
Health Check Failing
Check API server logs:
Common causes:
- Lightning ASR not available
- License proxy down
- Redis connection failed
Solutions:
- Verify all components running
- Check service connectivity
- Review component logs
False Positives
Symptoms: Health returns 200 but requests fail
Solutions:
- Use detailed health checks
- Test actual transcription endpoint
- Monitor error rates
Timeout Issues
Symptoms: Health checks timing out
Solutions:
- Increase timeout values
- Check network latency
- Verify no network policies blocking

