Gemini Image Generation MCP Server

Integrates with Google's Gemini API to generate images with built-in Sharp-based resizing, format co...

390 views

1 installs

Updated Nov 22, 2025

Not audited

Integrates with Google's Gemini API to generate images with built-in Sharp-based resizing, format conversion, and imagemin optimization for streamlined visual content creation workflows.

Go to the Google Cloud Console
- Visit https://console.cloud.google.com/ and log in with your Google account.
Create or Select a Google Cloud Project
- In the top bar, click on the project dropdown.
- Select your existing project or click "New Project" to create a new one.
Enable the Gemini API
- In the left sidebar, go to “APIs & Services” > “Library.”
- Use the search bar to find “Gemini API.”
- Click on “Gemini API” and then click “Enable.”
Obtain a Gemini API Key
- Go to “APIs & Services” > “Credentials” in the left sidebar.
- Click “+ CREATE CREDENTIALS” and select “API key.”
- A dialog will appear with your new API key. Copy it and store it safely.
Add the GEMINI_API_KEY via FastMCP
- In the FastMCP connection interface, click your ready-made "Install Now" button for adding ENVs.
- Fill in the key name as GEMINI_API_KEY and paste in your Gemini API key value.

Your environment variable is now configured and ready to be used by the Gemini Image MCP Server.

How to Install Gemini Image Generation

Install Gemini Image Generation MCP server with one click through FastMCP. Choose your preferred AI development tool below:

Claude Desktop

Click "Claude Desktop" in Quick Start

Cursor IDE

Click "Cursor IDE" in Quick Start

VS Code

Click "VS Code" in Quick Start

Alternatives to Gemini Image Generation

Looking for similar MCP servers? Browse other servers in the same categories on FastMCP, or check out the similar servers listed above.

AI and Machine Learning MCP servers →

Compare side by side

Gemini Image Generation vs Gemini Image Generator Gemini Image Generation vs Gemini 2.5 Flash Image Gemini Image Generation vs Gemini Nanobanana (Image Generation) Gemini Image Generation vs Nano Banana (Gemini Image Generator) Gemini Image Generation vs Universal Image Generator

Quick Start

View on GitHub

More for AI and Machine Learning

View All →

Penpot

Integrates with Penpot's API to enable project browsing, file retrieval, object searching, and visual component export with automatic screenshot generation for converting UI designs into functional code.

AI and Machine Learning Automation

3.0k

Blender

Experience seamless AI-powered 3D modeling by connecting Blender with Claude AI via the Model Context Protocol. BlenderMCP enables two-way communication, allowing you to create, modify, and inspect 3D scenes directly through AI prompts. Control objects, materials, lighting, and execute Python code in Blender effortlessly. Access assets from Poly Haven and generate AI-driven models using Hyper3D Rodin. This integration enhances creative workflows by combining Blender’s robust tools with Claude’s intelligent guidance, making 3D content creation faster, interactive, and more intuitive. Perfect for artists and developers seeking AI-assisted 3D design within Blender’s environment.

AI and Machine Learning Automation

2.5k

1-Click Ready

Ollama

Integrates Ollama's local LLM models with MCP-compatible applications, enabling on-premise AI processing and custom model deployment while maintaining data control.

AI and Machine Learning Developer Tools

2.0k

1-Click Ready

Llama.cpp Bridge

Bridges local llama-server instances with MCP clients, providing chat interface, health monitoring, and configurable generation parameters for integrating llama.cpp models with desktop applications

AI and Machine Learning Monitoring

2.2k

1-Click Ready

Video & Audio Text Extraction

Extracts text from videos and audio files across platforms like YouTube, Bilibili, TikTok, Instagram, Twitter/X, Facebook, and Vimeo using Whisper speech recognition for transcription, content analysis, and accessibility improvements.

Entertainment and Media AI and Machine Learning

2.6k

1-Click Ready

Qwen Code

Bridges Qwen's code analysis capabilities through CLI integration, providing file-referenced queries with @filename syntax, automatic model fallback, and configurable execution modes for code review, codebase exploration, and automated refactoring workflows.

AI and Machine Learning Developer Tools

2.0k

1-Click Ready

Video Edit (MoviePy)

MoviePy-based video editing server that provides comprehensive video and audio processing capabilities including trimming, merging, resizing, effects, format conversion, YouTube downloading, and text/image overlays through an in-memory object store for chaining operations efficiently.

Entertainment and Media AI and Machine Learning

1.9k

1-Click Ready

Ollama

Integrates with Ollama for local large language model inference, enabling text generation and model management without relying on cloud APIs.

AI and Machine Learning Developer Tools

1.4k

1-Click Ready

n8n Workflow Builder

Integrates with n8n workflow automation platform to enable natural language workflow creation, management, and deployment with encrypted credential handling, role-based access control, and automated testing capabilities.

AI and Machine Learning Automation

656

Think Tool

Remote

Provides a structured thought process management system for maintaining explicit reasoning steps, policy verification, and tool output analysis through persistent memory storage

Memory Management AI and Machine Learning

962

1-Click Ready

Similar MCP Servers

Gemini Image Generator

Integrates with Google's Gemini 2.5 Flash model to generate images with automatic prompt enhancement and file-based output, featuring character consistency maintenance and multi-image blending capabilities for content creators.

AI and Machine Learning

302

Gemini 2.5 Flash Image

Integrates with Google Gemini 2.5 Flash to provide text-to-image generation, image editing, composition, and style transfer capabilities with support for base64 and file path inputs.

AI and Machine Learning

452

1-Click Ready

Gemini Nanobanana (Image Generation)

Integrates with Google's Gemini 2.5 Flash Image API to provide text-to-image generation, single image editing with prompts, multi-image composition, and style transfer capabilities with automatic file saving and collision handling.

AI and Machine Learning

633

Nano Banana (Gemini Image Generator)

Generates images using Google's Gemini 2.5 Flash model and automatically uploads them to ImgBB, returning publicly accessible URLs for immediate web sharing without local file management.

AI and Machine Learning

298

Universal Image Generator

Provides multi-provider image generation and transformation capabilities across Google Gemini, ZhipuAI, and Alibaba Bailian with automatic prompt translation and optimization for each provider's preferred language, supporting URL-based editing with mask support and flexible input methods including base64 encoding, file paths, and public URLs.

AI and Machine Learning

519

Nano-Banana (Gemini 2.5 Flash Image)

Integrates with Google's Gemini 2.5 Flash to generate and edit images from text prompts, supporting iterative workflows with reference images and automatic cross-platform file management.

AI and Machine Learning

596

Gemini Bridge

Bridges Claude with Google's Gemini AI through the official Gemini CLI, enabling direct queries and file-based context sharing between the two language models.

AI and Machine Learning Developer Tools

292

1-Click Ready

AI Vision

Integrates with Google's Gemini and Vertex AI models to analyze images, compare multiple images, and process video content with intelligent file handling that automatically optimizes upload strategies for different file sizes.

AI and Machine Learning

448

Google AI Studio

Integrates with Google AI Studio/Gemini API to process multimodal content including images, videos, audio, PDFs, and text files for content generation, analysis, and document conversion tasks.

AI and Machine Learning

725

GPT Image Generator

Enables direct image generation and editing through OpenAI's gpt-image-1 model with support for text prompts, file paths, and base64 encoded inputs for creative workflows and visual content creation.

AI and Machine Learning

358

Report Issue

Operating System

Client

Client Name

Issue

Please describe the issue

Thank you! Your issue report has been submitted successfully.

Gemini Image Generation MCP Server

How to Install Gemini Image Generation

Alternatives to Gemini Image Generation

Compare side by side

Quick Start

Claude Desktop

Cursor IDE

VS Code

Claude Code

Gemini

Codex

Manual

Choose Connection Type for

Remote Connection by FastMCP

Local Connection

Authentication Required

Run MCP servers withoutlocal setup or downtime

Configuration for

Environment Variables

HTTP Headers

started!

Configuration

Installation Failed

More for AI and Machine Learning

Penpot

Blender

Ollama

Llama.cpp Bridge

Video & Audio Text Extraction

Qwen Code

Video Edit (MoviePy)

Ollama

n8n Workflow Builder

Think Tool

Similar MCP Servers

Gemini Image Generator

Gemini 2.5 Flash Image

Gemini Nanobanana (Image Generation)

Nano Banana (Gemini Image Generator)

Universal Image Generator

Nano-Banana (Gemini 2.5 Flash Image)

Gemini Bridge

AI Vision

Google AI Studio

GPT Image Generator

Report Issue

Stay ahead of the MCP ecosystem

Run MCP servers without
local setup or downtime