For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DocumentationAPI ReferenceSelf HostModel CardsClient LibrariesIntegrationsDeveloper ToolsChangelog
DocumentationAPI ReferenceSelf HostModel CardsClient LibrariesIntegrationsDeveloper ToolsChangelog
  • Getting Started
    • Introduction
    • Prerequisites
    • Why Self-Host?
    • Architecture
  • Docker Setup
  • Kubernetes Setup
    • Quick Start
    • Troubleshooting
  • Troubleshooting
    • Common Issues
    • Debugging Guide
    • Logs Analysis
  • API Reference
    • Authentication
    • Examples
LogoLogo
Voice AgentsModels
Voice AgentsModels
On this page
  • Overview
  • Credentials from Smallest.ai
  • Infrastructure Requirements
  • Minimum Resources
  • Network Requirements
  • Next Steps
Getting Started

Prerequisites

||View as Markdown|
Was this page helpful?
Previous

Introduction

Next

Why Self-Host?

Built with

Overview

Before deploying Smallest Self-Host, you’ll need credentials from Smallest.ai and infrastructure with GPU support.

Credentials from Smallest.ai

Contact support@smallest.ai to obtain the following:

License Key

Your unique license key for validation. This is required for all deployments.

You’ll add this to your configuration:

1global:
2 licenseKey: "your-license-key-here"

Or as an environment variable:

$LICENSE_KEY=your-license-key-here
Container Registry Credentials

Credentials to pull Docker images from quay.io:

  • Username
  • Password
  • Email

Login to the registry:

$docker login quay.io

For Kubernetes, you’ll add these to your values.yaml:

1global:
2 imageCredentials:
3 create: true
4 registry: quay.io
5 username: "your-username"
6 password: "your-password"
7 email: "your-email@example.com"
Model Download URLs

Download URLs for the AI models (STT and/or TTS).

For Docker deployments, add to your .env:

$MODEL_URL=your-model-url-here

For Kubernetes, add to values.yaml:

1models:
2 asrModelUrl: "your-asr-model-url"
3 ttsModelUrl: "your-tts-model-url"

Infrastructure Requirements

GPU Requirements
  • NVIDIA GPU with 16+ GB VRAM
  • Recommended: A10, L4, L40s, T4, or A100
  • NVIDIA Driver 525+ (for A10, A100, L4)
  • NVIDIA Driver 470+ (for T4, V100)
Container Runtime
  • Docker 20.10+ or Podman 4.0+
  • NVIDIA Container Toolkit
  • For Kubernetes: GPU Operator or Device Plugin

Minimum Resources

ComponentCPUMemoryGPUStorage
Lightning ASR4-8 cores12-16 GB1x NVIDIA (16+ GB VRAM)50+ GB
Lightning TTS4-8 cores12-16 GB1x NVIDIA (16+ GB VRAM)20+ GB
API Server0.5-2 cores512 MB - 2 GBNone1 GB
License Proxy0.25-1 core256-512 MBNone100 MB
Redis0.5-1 core512 MB - 2 GBNone1 GB

Network Requirements

The License Proxy requires outbound HTTPS access to validate licenses:

EndpointPortPurpose
api.smallest.ai443License validation and usage reporting

Ensure your firewall and network policies allow outbound HTTPS traffic to api.smallest.ai.

Next Steps

Choose your deployment method and follow the specific prerequisites:

Docker Prerequisites

Setup requirements for Docker deployments including NVIDIA Container Toolkit installation.

Kubernetes Prerequisites

Cluster requirements, GPU node setup, and Helm configuration for Kubernetes deployments.