Filters
Clear All FiltersWeb Scraping Servers
51 servers found
Fetch and Convert
Fetches and converts web content to Markdown using JSDOM and Turndown.
Airbnb
Integrates with Airbnb to enable vacation rental search and detailed property information retrieval without requiring API keys
Bright Data
Integrates with Bright Data's web scraping infrastructure to provide real-time access to public web data through specialized tools for search engine scraping, webpage extraction, and structured data retrieval from popular websites.
Web Fetcher
Fetches and extracts web content using Playwright's headless browser capabilities, delivering clean, readable content from JavaScript-heavy websites in HTML or Markdown format for research and information gathering.
Chinese Trends Hub
Provides real-time access to trending topics and content from major Chinese platforms including Weibo, Zhihu, Douyin, Bilibili, Douban, Toutiao, and 36kr through separate tools with temporary caching for improved performance.
Playwright
Automate web browsers for testing, scraping, and visual analysis.
DuckDuckGo Search
Provides web search capabilities through DuckDuckGo, enabling content retrieval, URL processing, and metadata extraction with customizable filtering options
YouTube Transcripts
Extract and analyze video captions and subtitles in multiple languages.
YouTube Transcript
Extracts and formats YouTube video transcripts with language selection, paragraph formatting, and metadata enrichment for content analysis and research workflows.
DeepWiki
Instantly turn any Deepwiki article into clean, structured Markdown you can use anywhere. Deepwiki MCP Server safely crawls deepwiki.com pages, removes clutter like ads and navigation, rewrites links for Markdown, and offers fast performance with customizable output formats. Choose a single document or organize content by page, and easily extract documentation or guides for any supported library. It’s designed for secure, high-speed conversion and clear, easy-to-read results—making documentation and learning seamless.
Web Content Pick
Extracts structured content from web pages using customizable selectors for crawling, parsing, and analyzing HTML elements without leaving the assistant interface.
Puppeteer Real Browser
Provides stealth browser automation using puppeteer-real-browser with anti-detection features, human-like interactions, proxy support, and captcha solving for web scraping, testing, and form automation that bypasses bot detection mechanisms.
Fetch (TypeScript)
Integrates with web content sources to fetch, convert, and summarize online information for real-time data retrieval and analysis.
Hotnews (Chinese Social)
Aggregates real-time trending topics from major Chinese social platforms and news sites.
YouTube Subtitles
Integrates YouTube subtitle retrieval for natural language queries about video content.
Read Website Fast
Extracts web content and converts it to clean Markdown format using Mozilla Readability for intelligent article detection, with disk-based caching, robots.txt compliance, and concurrent crawling capabilities for fast content processing workflows.
Enables scraping of Weibo user information, feeds, and search functionality with tools for user discovery, profile retrieval, and feed access
GitHub Repo Extractor
Connects to GitHub repositories, enabling natural language queries about code structure, dependencies, and development history.
Chrome Debug Protocol
Provides browser automation capabilities through Chrome's debugging protocol with session persistence, enabling web scraping, testing, and automation tasks with tools for screenshots, navigation, element interaction, and content retrieval.
OSRS Wiki
Provides tools for accessing Old School RuneScape game data through wiki searches and structured file queries with pagination support
Webpage Timestamps
Extracts webpage creation, modification, and publication timestamps from HTML meta tags, HTTP headers, JSON-LD structured data, microdata, OpenGraph, and Twitter cards with confidence scoring and intelligent consolidation for content freshness analysis and temporal metadata extraction.