Provider
mbailey
Classification
COMMUNITY
Downloads
7.8k(+1.7k this week)
Released On
Jun 9, 2025

About

Facilitate interactive voice dialogues using various transport options such as local audio capture and LiveKit virtual meetings. Customize speech-to-text and text-to-speech features, alongside automatic transport backup, to develop cutting-edge voice applications.


Explore Similar MCP Servers

Official

MiniMax

Unlock top-tier text-to-speech, voice duplication, and video creation functions using MiniMax's API. Benefit from strong error prevention and efficient file control tools.

Official

AllVoiceLab

Unlock advanced voice and audio processing features via seamless integration with AllVoiceLab's API. Leverage cutting-edge functionalities such as text-to-speech, voice duplication, speech enhancement, subtitle extraction, and multilingual video dubbing.

Community

Audio Interface

Facilitates speech interaction with Claude using audio recording and playback functions, enabling personalized device choices and efficient file handling to enhance verbal exchanges.

Community

VOICEVOX

Enhance your AI systems with seamless integration to the VOICEVOX text-to-speech engine, enabling Japanese voice generation. This integration supports various speaker options and is compatible with both default transport and Server-Sent Events technology.

Community

Voice Call (Twilio)

Achieve seamless AI-driven outbound calling using Twilio integration, incorporating live speech analysis for authentic interactions in scenarios like scheduling appointments, delivering customer support, or collecting data, all autonomously.

Community

TTS Say

Enhance your applications with seamless text-to-speech conversion using both OpenAI's API and local sound playback integration. Empower diverse functionalities with voice output capabilities.

Community

Say (Text-to-Speech)

Empower your conversations with cutting-edge text-to-speech features that seamlessly integrate native system voices and ElevenLabs technology. Speak out responses directly within the conversation interface for a dynamic user experience.