Rime Text-to-Speech
Text-to-speech server that converts text into spoken audio through Rime's API, streaming with optimi...
Log in to the Rime Dashboard
Go to https://rime.ai/dashboard/tokens.Generate or Copy Your API Key
If you do not have an API key already, follow the interface to create a new token.
Otherwise, copy the existing API key shown on the Tokens page.Fill In the API Key
In the FastMCP connection interface, locate the "Install Now" button for Rime MCP.
When prompted, enter your copied API key into the field forRIME_API_KEY.(Optional) Customize Additional Options
You can also provide optional values for:RIME_GUIDANCE: (e.g., "Give a brief overview of the answer.")RIME_WHO_TO_ADDRESS: (e.g., "Matt")RIME_WHEN_TO_SPEAK: (e.g., "when asked to speak")RIME_VOICE: (Choose from available voices)
Save and Complete Setup
Click "Save" or complete the setup process in FastMCP.
Your Rime Text-to-Speech integration is now ready to use!
Quick Start
Choose Connection Type for
Authentication Required
Please sign in to use FastMCP hosted connections
Configure Environment Variables for
Please provide values for the following environment variables:
started!
The MCP server should open in . If it doesn't open automatically, please check that you have the application installed.
Copy and run this command in your terminal:
Make sure Gemini CLI is installed:
Visit Gemini CLI documentation for installation instructions.
Make sure Claude Code is installed:
Visit Claude Code documentation for installation instructions.
Installation Steps:
Configuration
Installation Failed
More for Entertainment and Media
View All →Video Edit (MoviePy)
MoviePy-based video editing server that provides comprehensive video and audio processing capabilities including trimming, merging, resizing, effects, format conversion, YouTube downloading, and text/image overlays through an in-memory object store for chaining operations efficiently.
ElevenLabs
Unleash powerful Text-to-Speech and audio processing with the official ElevenLabs MCP server. It enables MCP clients like Claude Desktop, Cursor, and OpenAI Agents to generate speech, clone voices, transcribe audio, and create unique sounds effortlessly. Customize voices, convert recordings, and build immersive audio scenes with easy-to-use APIs designed for creative and practical applications. This server integrates seamlessly, expanding your AI toolkit to bring rich, dynamic audio experiences to life across various platforms and projects.
Video & Audio Text Extraction
Extracts text from videos and audio files across platforms like YouTube, Bilibili, TikTok, Instagram, Twitter/X, Facebook, and Vimeo using Whisper speech recognition for transcription, content analysis, and accessibility improvements.
Openverse
Integrates with Openverse's Creative Commons image collection to search and retrieve openly-licensed images with detailed filtering options, attribution information, and specialized essay illustration features for finding relevant academic content.