Filters

Clear All Filters

Web Scraping Servers

98 servers found

View:

Web Scraping MCP servers enable your AI assistant to extract structured data from websites at scale. From simple page fetches to complex JavaScript-rendered content, these servers handle the technical challenges of web data extraction so you can focus on analysis.

MCP Server Listings

J

JinaAI

Extracts and processes web content for efficient parsing and analysis of online information

Web Scraping
293
1
Browser Use

Browser Use

Official

Enables LLMs, agents, and apps to access, search, and extract web data in real-time using the browser-use.com API.

Browser Automation Web Scraping
310
0
F

Fetch (Web Content & YouTube Transcripts)

Fetches web content and YouTube video transcripts, converting HTML to Markdown and extracting timestamps for reference in conversations.

Web Scraping Entertainment and Media
G

Google Search

Provides web search capabilities and webpage content extraction through Google Custom Search API, enabling access to search results and cleaned webpage content with minimal setup.

Web Search Web Scraping
306
0
pure.md

pure.md

Official

Enables AI access to web content in clean markdown format through unblock-url extraction and search-web capabilities, bypassing anti-bot measures for reliable information retrieval.

Web Search Web Scraping
303
0
AnyCrawl

AnyCrawl

Official

Integrates with the AnyCrawl API to provide web scraping and crawling capabilities with configurable depth limits, multiple scraping engines, and structured data extraction in various formats including markdown and JSON.

Web Search Web Scraping
298
0
W

Webpage Timestamps

Extracts webpage creation, modification, and publication timestamps from HTML meta tags, HTTP headers, JSON-LD structured data, microdata, OpenGraph, and Twitter cards with confidence scoring and intelligent consolidation for content freshness analysis and temporal metadata extraction.

Web Scraping Analytics and Data
Scraper.is

Scraper.is

Official

Integrates with Scraper.is API to enable web content extraction, structured data parsing, and Markdown conversion for tasks like product research, news aggregation, and content analysis.

Web Scraping
294
0
P

Prysm Web Scraper

Provides web scraping capabilities with three specialized tools (scrapeFocused, scrapeBalanced, scrapeDeep) for efficient content extraction, image processing, and pagination handling with customizable parameters.

Web Scraping
284
0
AgentQL

AgentQL

Official

Extracts structured data from web pages based on natural language descriptions, converting website content into JSON format without custom scraping code.

Web Scraping
276
0
R

RFC Document Bridge

Provides a bridge to IETF RFC documents for retrieving, searching, and extracting specific sections from technical standards documentation with support for both HTML and TXT formats

Web Scraping
W

Web Fetch

Fetches and converts web pages to markdown format with automatic image extraction and proxy support for accessing content through corporate networks or restricted environments.

Web Scraping
251
1
W

Web Browser

Integrates web browsing capabilities for realtime data retrieval, content extraction, and task automation using popular Python libraries.

Browser Automation Web Scraping
267
0
H

HTTP Request

Enables LLMs to make advanced HTTP requests with realistic browser emulation, bypassing anti-bot measures while supporting all HTTP methods, authentication, and automatic response handling for web scraping and API interactions.

Browser Automation Web Scraping
P

PubMed Research

Integrates with PubMed's biomedical literature database to search academic papers, retrieve detailed metadata and abstracts, generate formatted citations in multiple styles, and track citation metrics for research and literature review workflows.

Web Scraping Research
LSD Web Data Extraction

LSD Web Data Extraction

Official

Provides web data extraction and manipulation capabilities through the LSD programming language, enabling structured data retrieval from websites, web searches, and community-created extraction patterns without complex scraping code.

Browser Automation Web Scraping
Dumpling AI

Dumpling AI

Official

Provides a bridge to Dumpling AI's data extraction API for performing web searches, scraping content, extracting structured data, and processing various document formats through 20+ specialized tools.

Web Scraping AI and Machine Learning
230
0
O

Omnisearch

Unifies search and content processing by dynamically selecting optimal providers like Tavily, Brave, and Perplexity to enable flexible information retrieval and enhancement across multiple domains.

Web Search Web Scraping
227
0
FireCrawl

FireCrawl

Official Remote Remote

Integration with FireCrawl to provide advanced web scraping capabilities for extracting structured data from complex websites.

Web Scraping Research
W

WebScout

Automates reverse engineering of chat interfaces through browser automation and network traffic analysis, capturing streaming API endpoints and providing browser control for analyzing chat APIs without official documentation.

Browser Automation Web Scraping
L

LinkedIn API

Bridges AI systems with LinkedIn's API for searching users, retrieving profiles, accessing posts, managing connections, and sending messages to support sales prospecting, recruitment, and professional networking workflows.

Communication Web Scraping
213
0