๐ŸŽ™๏ธ Voice Agent Modular Testing Suite

Test individual components or the full voice agent pipeline. Each tab allows you to:

  • STT Tab: Test speech-to-text models independently
  • TTS Tab: Test text-to-speech models independently
  • LLM Tab: Test language models independently
  • Pipeline Tab: Test the complete voice agent flow (STT โ†’ LLM โ†’ TTS)

๐ŸŽค Speech-to-Text Testing

Select STT Model

Test Transcription


๐Ÿ“ Usage Tips

  • Load models first: Click "Load Model" buttons before testing
  • Recording audio: Click the microphone icon ๐ŸŽค to start recording, click again to stop
  • Upload audio: Or drag & drop an audio file
  • GPU acceleration: Models run on GPU if available, otherwise CPU
  • Pipeline mode: Combines all modules for end-to-end voice interaction
  • Performance: Use smaller models (Whisper Base, Qwen 0.5B) for faster performance on CPU