AI Vision MCP Server

Enables AI-powered image and video analysis using Google Gemini and Vertex AI models. Supports analy...

58 views

0 installs

Updated Feb 5, 2026

Not audited

Cloud Platforms AI and Machine Learning

Tools I Recommend

ClaudeKit Sponsor

Enables AI-powered image and video analysis using Google Gemini and Vertex AI models. Supports analyzing single or multiple images, detecting objects with bounding boxes, and video content analysis through natural language prompts.

Open the FastMCP connection interface (click “Install Now”) and prepare to fill the environment fields there.
- You will enter the environment variable names/values directly into the FastMCP connection form (the same keys shown in the README). (github.com)
If you’re using the Google AI Studio / Gemini provider (recommended)
1. Set IMAGE_PROVIDER and VIDEO_PROVIDER to "google" in the FastMCP form. (github.com)
2. Obtain a Gemini API key:
  - Sign in to Google AI Studio (aistudio.google.com).
  - Open Projects (or Import Projects if your Google Cloud project is not yet listed), then go to the API keys / “Get API key” area.
  - Create a new API key and copy it (the key is shown only once). (ai.google.dev)
3. In the FastMCP form fill:
  - IMAGE_PROVIDER = google
  - VIDEO_PROVIDER = google
  - GEMINI_API_KEY =
4. Keep the API key private (do not commit to source control). (ai.google.dev)
If you’re using the Vertex AI provider (production / alternative)
1. Set IMAGE_PROVIDER and VIDEO_PROVIDER to "vertex_ai" in the FastMCP form. (github.com)
2. Create or choose a Google Cloud project, enable Vertex AI and Cloud Storage APIs, and ensure billing is enabled for that project. (docs.cloud.google.com)
3. Create a service account and download a JSON key:
  - In the Google Cloud Console go to IAM & Admin → Service Accounts → Create Service Account.
  - Grant needed roles (e.g., Vertex AI User / roles/aiplatform.user and Storage Admin or appropriate storage permissions), finish, then open that service account → Keys → Add Key → Create new key → JSON → Download. You will get a service-account JSON file (keep it secure). (docs.cloud.google.com)
4. Create (or pick) a Google Cloud Storage bucket for model/video/object storage: create a globally-unique bucket name in Cloud Storage → Create. Note the exact bucket name (it must be globally unique). (cloud.google.com)
5. In the FastMCP form fill:
  - IMAGE_PROVIDER = vertex_ai
  - VIDEO_PROVIDER = vertex_ai
  - VERTEX_CREDENTIALS = /absolute/path/to/your-service-account.json (upload or point to the local path the MCP host will have access to)
  - GCS_BUCKET_NAME = your-gcs-bucket-name
After filling values in FastMCP, save/confirm the connection and start the MCP server
- Use the client’s “Install Now” / save button to apply the envs and start the MCP integration. The MCP will read GEMINI_API_KEY or VERTEX_CREDENTIALS + GCS_BUCKET_NAME at startup. (github.com)
Quick checks and security reminders
1. Verify the MCP startup logs for authentication success / any credential errors.
2. Never commit API keys or service-account JSON to git; restrict access to the JSON file and rotate/delete keys if exposed. (ai.google.dev)

If you want, tell me which provider you’ll use (google or vertex_ai) and I’ll produce the exact field values and a short copy-paste checklist for the FastMCP “Install Now” form.

Quick Start

View on GitHub

More for Cloud Platforms

View All →

Salesforce

Official

Unlock powerful Salesforce org management with the Salesforce DX MCP Server, designed for seamless interaction between large language models and Salesforce environments. This developer preview offers secure, direct access to Salesforce resources without exposing secrets, using TypeScript libraries and granular org allowlisting. Its modular toolsets cover org administration, data queries, user permissions, metadata deployment, and testing. Easily extendable and compatible with various clients like VS Code, Cursor, and more, it empowers developers to perform complex tasks with natural language commands while maintaining robust security. The MCP Server streamlines Salesforce DX workflows through an efficient, secure, and flexible protocol.

Cloud Platforms Developer Tools

1.4k

1-Click Ready

Hostinger API

Official

Integrates with Hostinger's hosting platform to enable domain registration and DNS management, VPS creation and configuration, firewall setup, backup operations, and billing subscription handling through over 100 specialized tools organized by service category.

Cloud Platforms Security

1.7k

Netlify

Official

Remote

Control your Netlify projects effortlessly using natural language through AI agents with Netlify MCP Server. This server follows the Model Context Protocol to enable code agents to create, deploy, and manage sites, configure access controls, handle environment variables, and more—all via simple prompts. It acts as a bridge between AI clients and Netlify’s API and CLI, empowering seamless automation and resource management. Whether retrieving team data or managing forms and extensions, Netlify MCP Server streamlines your workflow by integrating powerful AI-driven project control in an accessible, standardized way.

Cloud Platforms Automation

891

Railway

Official

Integrates with Railway's platform and CLI to enable deployment, service management, environment configuration, and infrastructure monitoring through conversational workflows.

Cloud Platforms Developer Tools

757

1-Click Ready

Azure All

Official

Supercharge AI agents with seamless access to Azure services using Azure MCP Server. This project enables powerful automation and management of Azure resources with tools for databases, storage, monitoring, security, and best practices. Easily interact with services like Cosmos DB, SQL, Key Vault, Service Bus, and more—all within compatible AI platforms. Azure MCP Server is in Public Preview and rapidly evolving, making it a versatile solution for both developers and enterprise environments looking to integrate Azure functionality.

Cloud Platforms Developer Tools

665

1-Click Ready

Kubernetes

Control and monitor K8s clusters for management and debugging.

Cloud Platforms Developer Tools

585

1-Click Ready

Tencent EdgeOne Pages

Official

Remote

Effortlessly publish HTML, folders, or zip files with instant public URLs using EdgeOne Pages MCP. This service streamlines rapid deployment of your static content via EdgeOne Pages, leveraging serverless edge functions and key-value storage for fast, reliable delivery. Users can quickly deploy content and receive shareable links, making it ideal for web developers and teams needing easy, scalable web hosting.

Cloud Platforms Content Management

782

1-Click Ready

AWS Athena

Integrates with AWS SDK to execute SQL queries against Athena databases, enabling large-scale data analysis and business intelligence for AWS data lakes.

Database Cloud Platforms

630

Kubernetes

Enables direct Kubernetes cluster management through kubectl command execution, providing a bridge for real-time resource administration within conversations.

Cloud Platforms Developer Tools

582

Cloudflare

Integrates with Cloudflare's API to enable management of DNS, CDN, and security configurations for web infrastructure automation.

Cloud Platforms Developer Tools

582

More for AI and Machine Learning

View All →

Penpot

Integrates with Penpot's API to enable project browsing, file retrieval, object searching, and visual component export with automatic screenshot generation for converting UI designs into functional code.

AI and Machine Learning Automation

2.5k

Ollama

Integrates Ollama's local LLM models with MCP-compatible applications, enabling on-premise AI processing and custom model deployment while maintaining data control.

AI and Machine Learning Developer Tools

1.7k

1-Click Ready

Blender

Experience seamless AI-powered 3D modeling by connecting Blender with Claude AI via the Model Context Protocol. BlenderMCP enables two-way communication, allowing you to create, modify, and inspect 3D scenes directly through AI prompts. Control objects, materials, lighting, and execute Python code in Blender effortlessly. Access assets from Poly Haven and generate AI-driven models using Hyper3D Rodin. This integration enhances creative workflows by combining Blender’s robust tools with Claude’s intelligent guidance, making 3D content creation faster, interactive, and more intuitive. Perfect for artists and developers seeking AI-assisted 3D design within Blender’s environment.

AI and Machine Learning Automation

2.0k

1-Click Ready

Llama.cpp Bridge

Bridges local llama-server instances with MCP clients, providing chat interface, health monitoring, and configurable generation parameters for integrating llama.cpp models with desktop applications

AI and Machine Learning Monitoring

1.7k

1-Click Ready

Video & Audio Text Extraction

Extracts text from videos and audio files across platforms like YouTube, Bilibili, TikTok, Instagram, Twitter/X, Facebook, and Vimeo using Whisper speech recognition for transcription, content analysis, and accessibility improvements.

Entertainment and Media AI and Machine Learning

2.0k

1-Click Ready

Video Edit (MoviePy)

MoviePy-based video editing server that provides comprehensive video and audio processing capabilities including trimming, merging, resizing, effects, format conversion, YouTube downloading, and text/image overlays through an in-memory object store for chaining operations efficiently.

Entertainment and Media AI and Machine Learning

1.5k

1-Click Ready

Ollama

Integrates with Ollama for local large language model inference, enabling text generation and model management without relying on cloud APIs.

AI and Machine Learning Developer Tools

1.2k

1-Click Ready

Qwen Code

Bridges Qwen's code analysis capabilities through CLI integration, providing file-referenced queries with @filename syntax, automatic model fallback, and configurable execution modes for code review, codebase exploration, and automated refactoring workflows.

AI and Machine Learning Developer Tools

1.2k

1-Click Ready

n8n Workflow Builder

Integrates with n8n workflow automation platform to enable natural language workflow creation, management, and deployment with encrypted credential handling, role-based access control, and automated testing capabilities.

AI and Machine Learning Automation

550

Think Tool

Remote

Provides a structured thought process management system for maintaining explicit reasoning steps, policy verification, and tool output analysis through persistent memory storage

Memory Management AI and Machine Learning

827

1-Click Ready

Similar MCP Servers

AI Vision

Integrates with Google's Gemini and Vertex AI models to analyze images, compare multiple images, and process video content with intelligent file handling that automatically optimizes upload strategies for different file sizes.

AI and Machine Learning

355

Google AI Studio

Integrates with Google AI Studio/Gemini API to process multimodal content including images, videos, audio, PDFs, and text files for content generation, analysis, and document conversion tasks.

AI and Machine Learning

626

Universal Image Generator

Provides multi-provider image generation and transformation capabilities across Google Gemini, ZhipuAI, and Alibaba Bailian with automatic prompt translation and optimization for each provider's preferred language, supporting URL-based editing with mask support and flexible input methods including base64 encoding, file paths, and public URLs.

AI and Machine Learning

451

Report Issue

Operating System

Client

Client Name

Issue

Please describe the issue

Thank you! Your issue report has been submitted successfully.

AI Vision MCP Server

Quick Start

Claude Desktop

Cursor IDE

VS Code

Claude Code

Gemini

Codex

Manual

Choose Connection Type for

Remote Connection by FastMCP

Local Connection

Authentication Required

Run MCP servers withoutlocal setup or downtime

Configuration for

Environment Variables

HTTP Headers

started!

Configuration

Installation Failed

More for Cloud Platforms

Salesforce

Hostinger API

Netlify

Railway

Azure All

Kubernetes

Tencent EdgeOne Pages

AWS Athena

Kubernetes

Cloudflare

More for AI and Machine Learning

Penpot

Ollama

Blender

Llama.cpp Bridge

Video & Audio Text Extraction

Video Edit (MoviePy)

Ollama

Qwen Code

n8n Workflow Builder

Think Tool

Similar MCP Servers

AI Vision

Google AI Studio

Universal Image Generator

Report Issue

Run MCP servers without
local setup or downtime