For AI agents: a documentation index is available at the root level at /llms.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
LogoLogo
DocumentationAPI ReferenceSelf HostModel CardsClient LibrariesIntegrationsDeveloper ToolsChangelog
DocumentationAPI ReferenceSelf HostModel CardsClient LibrariesIntegrationsDeveloper ToolsChangelog
    • General
    • Lightning v3.1
    • Pulse STT
    • Hydra
  • General
  • June 12, 2026
  • June 3, 2026
  • May 23, 2026
  • May 22, 2026
  • May 22, 2026
  • May 12, 2026
  • May 7, 2026
  • April 22, 2026
  • April 20, 2026
  • Lightning v3.1
  • June 15, 2026
  • June 5, 2026
  • June 1, 2026
  • May 19, 2026
  • May 19, 2026
  • May 15, 2026
  • May 14, 2026
  • May 8, 2026
  • May 2, 2026
  • May 2, 2026
  • Pulse STT
  • June 16, 2026
  • June 15, 2026
  • May 30, 2026
  • May 30, 2026
  • May 28, 2026
  • May 22, 2026
  • May 15, 2026
  • May 8, 2026
  • May 6, 2026
  • May 6, 2026
  • May 4, 2026
  • May 3, 2026
  • May 1, 2026
  • May 1, 2026
  • May 1, 2026
  • April 30, 2026
  • April 21, 2026
  • April 20, 2026
  • Hydra
  • May 20, 2026

Hydra

May 20, 2026
May 20, 2026
Built with
Voice AgentsModels
Voice AgentsModels

Hydra S2S — concurrency limits + per-org rate metering

Hydra S2S sessions now route through a per-org concurrency feature in the billing service. This puts an explicit ceiling on simultaneous live voice sessions per org and meters usage for billing.

What changed:

  • New feature-waves-hydra-concurrency plan feature: caps the number of concurrent open WebSocket sessions to wss://api.smallest.ai/waves/v1/s2s per org.
  • Sessions opened beyond the cap receive an error event with code: "server_full" followed by close code 1013. Back off with jitter and retry.
  • Usage is metered against your billing entitlement; check the usage field on response.done for per-turn token counts.

Migration: no action — existing Hydra integrations keep working. If you hit server_full repeatedly, contact your account manager to lift the concurrency cap.