OpenAI TTS
Enables high-quality voice generation from text using OpenAI's TTS API with customizable voices, for...
Obtain an OpenAI API Key
- Visit the OpenAI platform at https://platform.openai.com/account/api-keys.
- Log in to your OpenAI account (create one if necessary).
- Click the “Create new secret key” button.
- Copy the generated API key. You will need this key in the next step.
Add the API Key to FastMCP
- In the FastMCP connection interface, locate the "Install Now" button for integrating the TTS-MCP server.
- When prompted, fill in the
OPENAI_API_KEYvalue using the API key you obtained from OpenAI.
Save and Test
- Save your changes within FastMCP.
- Test the integration by initiating a text-to-speech request to confirm that the setup is successful.
Quick Start
Choose Connection Type for
Authentication Required
Please sign in to use FastMCP hosted connections
Run MCP servers without
local setup or downtime
Access to 1,000+ ready-to-use MCP servers
Skip installation, maintenance, and trial-and-error.
No local setup or infra
Run MCP servers without Docker, ports, or tunnels.
Always online
Your MCP keeps working even when your laptop is off.
One secure URL
Use the same MCP from any agent, anywhere.
Secure by default
Encrypted connections. Secrets never stored locally.
Configuration for
Environment Variables
Please provide values for the following environment variables:
HTTP Headers
Please provide values for the following HTTP headers:
started!
The MCP server should open in . If it doesn't open automatically, please check that you have the application installed.
Copy and run this command in your terminal:
Make sure Gemini CLI is installed:
Visit Gemini CLI documentation for installation instructions.
Make sure Claude Code is installed:
Visit Claude Code documentation for installation instructions.
Installation Steps:
Configuration
Installation Failed
More for Entertainment and Media
View All →Video & Audio Text Extraction
Extracts text from videos and audio files across platforms like YouTube, Bilibili, TikTok, Instagram, Twitter/X, Facebook, and Vimeo using Whisper speech recognition for transcription, content analysis, and accessibility improvements.
Video Edit (MoviePy)
MoviePy-based video editing server that provides comprehensive video and audio processing capabilities including trimming, merging, resizing, effects, format conversion, YouTube downloading, and text/image overlays through an in-memory object store for chaining operations efficiently.
Navidrome
Integrates with Navidrome music servers to provide complete music library management including playlist creation, playback control, search functionality, listening history tracking, Last.fm integration for recommendations, Radio Browser for internet radio discovery, and LRCLIB for synchronized lyrics retrieval.
ElevenLabs
Unleash powerful Text-to-Speech and audio processing with the official ElevenLabs MCP server. It enables MCP clients like Claude Desktop, Cursor, and OpenAI Agents to generate speech, clone voices, transcribe audio, and create unique sounds effortlessly. Customize voices, convert recordings, and build immersive audio scenes with easy-to-use APIs designed for creative and practical applications. This server integrates seamlessly, expanding your AI toolkit to bring rich, dynamic audio experiences to life across various platforms and projects.
Ableton Live
Enables control of Ableton Live music production software through a bidirectional communication system that supports track creation, MIDI editing, playback control, instrument loading, and library browsing for music composition and sound design workflows.
More for AI and Machine Learning
View All →Blender
Experience seamless AI-powered 3D modeling by connecting Blender with Claude AI via the Model Context Protocol. BlenderMCP enables two-way communication, allowing you to create, modify, and inspect 3D scenes directly through AI prompts. Control objects, materials, lighting, and execute Python code in Blender effortlessly. Access assets from Poly Haven and generate AI-driven models using Hyper3D Rodin. This integration enhances creative workflows by combining Blender’s robust tools with Claude’s intelligent guidance, making 3D content creation faster, interactive, and more intuitive. Perfect for artists and developers seeking AI-assisted 3D design within Blender’s environment.
Video & Audio Text Extraction
Extracts text from videos and audio files across platforms like YouTube, Bilibili, TikTok, Instagram, Twitter/X, Facebook, and Vimeo using Whisper speech recognition for transcription, content analysis, and accessibility improvements.
Video Edit (MoviePy)
MoviePy-based video editing server that provides comprehensive video and audio processing capabilities including trimming, merging, resizing, effects, format conversion, YouTube downloading, and text/image overlays through an in-memory object store for chaining operations efficiently.
Qwen Code
Bridges Qwen's code analysis capabilities through CLI integration, providing file-referenced queries with @filename syntax, automatic model fallback, and configurable execution modes for code review, codebase exploration, and automated refactoring workflows.
Similar MCP Servers
ElevenLabs
Unleash powerful Text-to-Speech and audio processing with the official ElevenLabs MCP server. It enables MCP clients like Claude Desktop, Cursor, and OpenAI Agents to generate speech, clone voices, transcribe audio, and create unique sounds effortlessly. Customize voices, convert recordings, and build immersive audio scenes with easy-to-use APIs designed for creative and practical applications. This server integrates seamlessly, expanding your AI toolkit to bring rich, dynamic audio experiences to life across various platforms and projects.