Best MCP Servers for Web Scraping

Turn any website into structured data with MCP servers built for web scraping. Extract content, crawl pages, and convert HTML to clean markdown — all from your AI assistant.

DeepWiki

DeepWiki

Official Remote Remote

Instantly turn any Deepwiki article into clean, structured Markdown you can use anywhere. Deepwiki MCP Server safely crawls deepwiki.com pages, removes clutter like ads and navigation, rewrites links for Markdown, and offers fast performance with customizable output formats. Choose a single document or organize content by page, and easily extract documentation or guides for any supported library. It’s designed for secure, high-speed conversion and clear, easy-to-read results—making documentation and learning seamless.

Web Scraping
J

Jina AI

Integrates with Jina AI's web services to enable web content extraction, search, and fact-checking through natural language interactions.

Web Search Web Scraping
2.2k
7
W

Web Fetcher

Remote Remote

Fetches and extracts web content using Playwright's headless browser capabilities, delivering clean, readable content from JavaScript-heavy websites in HTML or Markdown format for research and information gathering.

Browser Automation Web Scraping
DuckDuckGo Search

DuckDuckGo Search

Remote Remote

Integrates with DuckDuckGo to provide web search capabilities, content fetching, and parsing, with results formatted for large language model consumption.

Web Search Web Scraping
C

Chinese Trends Hub

Remote Remote

Provides real-time access to trending topics and content from major Chinese platforms including Weibo, Zhihu, Douyin, Bilibili, Douban, Toutiao, and 36kr through separate tools with temporary caching for improved performance.

Web Scraping Analytics and Data
FetchSERP

FetchSERP

Official

Integrates with FetchSERP API to provide SEO analysis, SERP data retrieval, web scraping, keyword research, backlink analysis, and domain intelligence across Google, Bing, Yahoo, and DuckDuckGo search engines.

Web Search Web Scraping
1.6k
1
D

Documentation Scraper

Provides specialized documentation scraping and retrieval from GitHub, NPM, PyPI, and web pages, enabling accurate reference to up-to-date library documentation without disrupting workflow.

Web Scraping Developer Tools
S

Selenium WebDriver

Enables browser automation through Selenium WebDriver with support for Chrome, Firefox, and Edge browsers, providing navigation, element interaction, form handling, screenshot capture, JavaScript execution, and advanced actions for automated testing and web scraping tasks.

Browser Automation Web Scraping
A

Airbnb

Remote Remote

Integrates with Airbnb to enable vacation rental search and detailed property information retrieval without requiring API keys

Web Search Web Scraping
DuckDuckGo Search

DuckDuckGo Search

Remote Remote

Provides web search capabilities through DuckDuckGo, enabling content retrieval, URL processing, and metadata extraction with customizable filtering options

Web Search Web Scraping
D

Deep Research (Tavily)

Enables comprehensive web research by leveraging Tavily's Search and Crawl APIs to aggregate information from multiple sources, extract detailed content, and structure data specifically for generating technical documentation and research reports.

Web Search Web Scraping
1.1k
2
B

Baidu Search

Remote Remote

Provides web search capabilities through Baidu's search engine, enabling retrieval of search results and webpage content with robust error handling and content parsing.

Web Search Web Scraping
S

Serper Search and Scrape

Integrates with the Serper API to enable web searches and webpage content extraction, supporting research, content aggregation, and data mining tasks.

Web Search Web Scraping
994
3
S

Serper (Google Search)

Enables AI to perform Google searches via the Serper API with support for location, language, and time period filters.

Web Search Web Scraping
998
2
Y

YouTube Transcript

Fetches and analyzes YouTube video transcripts by accepting URLs or video IDs and returning formatted transcript data with timestamps for video content analysis without watching.

Web Scraping Entertainment and Media
G

Google News & Trends

Remote Remote

Integrates with Google News RSS feeds and Google Trends to provide news article search, trending topic retrieval, and optional content summarization for news monitoring and trend analysis workflows.

Web Scraping Analytics and Data
Playwright

Playwright

Automate web browsers for testing, scraping, and visual analysis.

Browser Automation Web Scraping
Apify Actor

Apify Actor

Official Remote Remote

Use 4,000+ pre-built cloud tools, known as Actors, to extract data from websites, e-commerce, social media, search engines, maps, and more.

Web Scraping Automation
W

Web UI Copy

Transforms webpage content into a fully inlined, script-free HTML document with base64-encoded resources, enabling comprehensive web page analysis and extraction.

Web Scraping Developer Tools
O

One Search

Provides a unified search and web scraping platform that integrates multiple search providers like SearxNG and Tavily, along with Firecrawl for advanced web content extraction, enabling flexible web data retrieval and structured information gathering.

Web Search Web Scraping
695
3
P

Playwright Browser Automation

Enables LLM-powered browser automation for web tasks including navigation, interaction, and content extraction through Playwright's comprehensive browser control capabilities.

Browser Automation Web Scraping
P

Puppeteer Real Browser

Provides stealth browser automation using puppeteer-real-browser with anti-detection features, human-like interactions, proxy support, and captcha solving for web scraping, testing, and form automation that bypasses bot detection mechanisms.

Browser Automation Web Scraping
F

Fetch and Convert

Remote Remote

Fetches and converts web content to Markdown using JSDOM and Turndown.

Web Scraping Developer Tools
Y

YggTorrent

Provides secure access to YggTorrent through an unofficial API wrapper, enabling torrent searching with category filtering, detailed metadata retrieval, and magnet link generation with automatic passkey injection for authenticated downloads.

Web Scraping Entertainment and Media
634
4
Y

YouTube Transcripts

Remote Remote

Extract and analyze video captions and subtitles in multiple languages.

Web Scraping Entertainment and Media
Y

YouTube Subtitles

Integrates YouTube subtitle retrieval for natural language queries about video content.

Web Scraping Entertainment and Media
R

Read Website Fast

Remote Remote

Extracts web content and converts it to clean Markdown format using Mozilla Readability for intelligent article detection, with disk-based caching, robots.txt compliance, and concurrent crawling capabilities for fast content processing workflows.

Web Scraping Content Management
G

GitHub Repo Extractor

Connects to GitHub repositories, enabling natural language queries about code structure, dependencies, and development history.

Web Scraping Developer Tools
R

RSS Feed Parser

Provides RSS feed parsing and retrieval with RSSHub integration, automatically trying multiple instances when one fails and supporting custom rsshub:// protocol URLs for accessing current content from websites, social platforms, and news sources that don't natively provide RSS feeds.

Web Scraping
Bright Data

Bright Data

Official

Integrates with Bright Data's web scraping infrastructure to provide real-time access to public web data through specialized tools for search engine scraping, webpage extraction, and structured data retrieval from popular websites.

Browser Automation Web Scraping
532
4

Stay ahead of the MCP ecosystem

Get the top new MCP servers, trending tools, and dev tips delivered weekly. Free, no spam, unsubscribe anytime.

Join 2,847 developers. We send one email per week.