AivisSpeech

GitHub Repo
N/A
Classification
COMMUNITY
Downloads
N/A(+N/A this week)
Released On
Mar 15, 2025

About

Empower artificial intelligence platforms to convert text inputs into speech audio using the AivisSpeech API, offering customizable speaker parameters for various voice applications.


Explore Similar MCP Servers

Official

ElevenLabs

Unlock advanced text-to-speech and voice cloning capabilities through seamless integration with ElevenLabs. Customize voice profiles and elevate audio processing for enriched conversational experiences.

Community

ElevenLabs

Enhance your applications with seamless integration to ElevenLabs API, enabling advanced text-to-speech functions. Create premium audio content from text with diverse voice options and personalized speech settings.

Community

Speech Interface (Faster Whisper)

Enhance your AI models with advanced voice interaction features by implementing faster-whisper and leveraging PyAudio for seamless speech recognition and synthesis. Experience the power of natural language voice interfaces through this cutting-edge Model Context Protocol (MCP).

Official

AllVoiceLab

Unlock advanced voice and audio processing features via seamless integration with AllVoiceLab's API. Leverage cutting-edge functionalities such as text-to-speech, voice duplication, speech enhancement, subtitle extraction, and multilingual video dubbing.

Community

PiAPI Image Generation

Utilizing the PiAPI integration, this protocol facilitates the creation of images through AI technology based on text input, catering to tasks related to content development and visual design.

Community

VOICEVOX

Enhance your AI systems with seamless integration to the VOICEVOX text-to-speech engine, enabling Japanese voice generation. This integration supports various speaker options and is compatible with both default transport and Server-Sent Events technology.

Community

Voice Call (Twilio)

Achieve seamless AI-driven outbound calling using Twilio integration, incorporating live speech analysis for authentic interactions in scenarios like scheduling appointments, delivering customer support, or collecting data, all autonomously.

Community

RetellAI Voice Services

Enhance your phone-based interactions with seamless integration to RetellAI voice solutions. Facilitate call management, agent settings, and voice options for a variety of purposes such as customer support, scheduling appointments, and collecting information.

Official

Daisys AI Text-to-Speech

Enhance your audio projects with seamless access to the Daisys AI text-to-speech API through the innovative Model Context Protocol (MCP). Customize voice characteristics such as gender, pace, pitch, and expression to achieve top-tier quality in voice audio generation. A must-have tool for perfecting your audio content.

Community

Audio Transcriber (OpenAI Whisper)

Unlock the power of speech-to-text transcription through the cutting-edge Model Context Protocol (MCP). Utilizing the advanced Whisper API from OpenAI, this tool offers customizable language options and the ability to save files at your discretion.

Community

OpenAI TTS

Achieve exceptional text-to-speech output by leveraging OpenAI's TTS API within the Model Context Protocol (MCP). Tailor voices, styles, and speech settings to create immersive audio experiences that enhance interactive dialogues.

Community

KoboldAI

Empower your creative writing, chatbot projects, and content creation with seamless integration of KoboldAI's text generation capabilities. Enhance local language model interactions effortlessly.

Community

Zonos TTS

Enhance your AI applications with dynamic, multilingual speech synthesis through seamless integration with the Zonos TTS API, leveraging PulseAudio playback for optimal performance.

Community

Blabber (OpenAI TTS)

Experience lifelike speech synthesis with a variety of voice choices, audio file selections, and auto-play functions through the cutting-edge OpenAI Text-to-Speech (TTS) API.

Community

Resemble

Enhances access to the Resemble AI text-to-speech API, facilitating real-time voice creation for diverse media using adaptable server connectivity and customizable voice model options.