🎙️ Voice Agent Modular Testing Suite

STT Tab: Test speech-to-text models independently
TTS Tab: Test text-to-speech models independently
LLM Tab: Test language models independently
Pipeline Tab: Test the complete voice agent flow (STT → LLM → TTS)

Test individual components or the full voice agent pipeline. Each tab allows you to:

🎤 Speech-to-Text Testing

Select STT Model

Status

🎤 Record or Upload Audio

Transcription

Load models first: Click "Load Model" buttons before testing
Recording audio: Click the microphone icon 🎤 to start recording, click again to stop
Upload audio: Or drag & drop an audio file
GPU acceleration: Models run on GPU if available, otherwise CPU
Pipeline mode: Combines all modules for end-to-end voice interaction
Performance: Use smaller models (Whisper Base, Qwen 0.5B) for faster performance on CPU