Skip to main content
Voice SDK · AVML R&D

One SDK. Every voice surface.

A containerized, end-to-end voice platform for building intelligent audio systems anywhere — transcription, synthesis, denoising, biometrics, and conversational AI in one self-hosted stack.

Core pipeline

Audio in. Decisions out. Five stages, one SDK.

Every voice workload follows the same shape. Voice SDK provides production-grade primitives at each stage — and lets you wire only the ones you need.

01
Audio input
mic · file · stream
02
Processing
denoise · VAD
03
Understanding
STT · embed
04
Response
LLM · TTS
05
Monitoring
SNR · latency
By the numbers

Numbers, units, and conditions stated plainly.

Languages (STT)
90+
Whisper Large V3
Real-time latency
150
ms · denoise streaming
Voice agent loop
sub-s
STT · LLM · TTS · WebRTC
Embedding dim
192
ECAPA-TDNN · VoxCeleb2
Capabilities

Production primitives, ready to compose.

01

Containerized everywhere

Every module ships as a Docker container. Deploy on local, cloud, or edge with zero environment friction.

02

Self-hosted by default

No data leaves your infrastructure. All models — Whisper, Kokoro, XTTS, Qwen, ECAPA, pyannote — run on your hardware.

03

GPU-accelerated inference

NVIDIA CUDA 12.x throughout. Sub-second voice agent loop, 14.2 ms file denoising, 150 ms real-time streaming.

04

OpenAI-compatible API

Drop-in replacement for OpenAI audio transcription and speech endpoints. Migrate without rewriting client code.

Quick start

OpenAI-compatible. Drop in. Move on.

Point your existing OpenAI client at voiceai.trouve.works and transcribe, synthesize, denoise, or identify speakers — without rewriting a line of integration code.

quickstart.py
from openai import OpenAI

client = OpenAI(
base_url="https://voiceai.trouve.works/services/v1",
api_key="not-needed",
)

# Speech-to-text — OpenAI-compatible, self-hosted
transcript = client.audio.transcriptions.create(
model="stt-1",
file=open("call.wav", "rb"),
)
print(transcript.text)
terminal
$ pip install openai
$ python quickstart.py