How do I install Fetcher MCP?

Install Fetcher MCP with a single command: npx mdskills install jae-jae/fetcher-mcp. This downloads the skill files into your project and your AI agent picks them up automatically.

What platforms support Fetcher MCP?

Fetcher MCP works with Claude Code, Claude Desktop, Cursor, Vscode Copilot, Windsurf, Continue Dev, Gemini Cli, Amp, Roo Code, Goose. Skills use the open SKILL.md format which is compatible with any AI coding agent that reads markdown instructions.

← Back to MCP servers

Fetcher MCP

Name: Fetcher MCP: AI Agent Skill
Rating: 8 (1 reviews)
Author: jae-jae

Verified

MCP ServerTesting & QAIntermediate

Português | MCP server for fetch web page content using Playwright headless browser. - JavaScript Support: Unlike traditional web scrapers, Fetcher MCP uses Playwright to execute JavaScript, making it capable of handling dynamic web content and modern web applications. - Intelligent Content Extraction: Built-in Readability algorithm automatically extracts the main content from web pages, removing

by @jae-jae0Updated 2/24/2026

Add this skill

npx mdskills install jae-jae/fetcher-mcp

Fork & Edit

Skill Advisor8.0

Well-documented web scraping MCP with Playwright, intelligent content extraction, and parallel fetching

+Provides powerful tools for JavaScript-enabled web scraping with clear parameter descriptions
+Supports flexible content extraction with HTML/Markdown output and batch parallel processing
+Includes comprehensive setup instructions with multiple deployment options and debugging tips
-Requires filesystem write and shell execution permissions only for browser installation, not core fetching

SKILL.md

Edit in Browser

1<div align="center">
2  <img src="https://raw.githubusercontent.com/jae-jae/fetcher-mcp/refs/heads/main/icon.svg" width="100" height="100" alt="Fetcher MCP Icon" />
3</div>
4 
5[中文](https://www.readme-i18n.com/jae-jae/fetcher-mcp?lang=zh) |
6[Deutsch](https://www.readme-i18n.com/jae-jae/fetcher-mcp?lang=de) |
7[Español](https://www.readme-i18n.com/jae-jae/fetcher-mcp?lang=es) |
8[français](https://www.readme-i18n.com/jae-jae/fetcher-mcp?lang=fr) |
9[日本語](https://www.readme-i18n.com/jae-jae/fetcher-mcp?lang=ja) |
10[한국어](https://www.readme-i18n.com/jae-jae/fetcher-mcp?lang=ko) |
11[Português](https://www.readme-i18n.com/jae-jae/fetcher-mcp?lang=pt) |
12[Русский](https://www.readme-i18n.com/jae-jae/fetcher-mcp?lang=ru)
13 
14# Fetcher MCP
15 
16MCP server for fetch web page content using Playwright headless browser.
17 
18> 🌟 **Recommended**: [OllaMan](https://ollaman.com/) - Powerful Ollama AI Model Manager.
19 
20## Advantages
21 
22- **JavaScript Support**: Unlike traditional web scrapers, Fetcher MCP uses Playwright to execute JavaScript, making it capable of handling dynamic web content and modern web applications.
23 
24- **Intelligent Content Extraction**: Built-in Readability algorithm automatically extracts the main content from web pages, removing ads, navigation, and other non-essential elements.
25 
26- **Flexible Output Format**: Supports both HTML and Markdown output formats, making it easy to integrate with various downstream applications.
27 
28- **Parallel Processing**: The `fetch_urls` tool enables concurrent fetching of multiple URLs, significantly improving efficiency for batch operations.
29 
30- **Resource Optimization**: Automatically blocks unnecessary resources (images, stylesheets, fonts, media) to reduce bandwidth usage and improve performance.
31 
32- **Robust Error Handling**: Comprehensive error handling and logging ensure reliable operation even when dealing with problematic web pages.
33 
34- **Configurable Parameters**: Fine-grained control over timeouts, content extraction, and output formatting to suit different use cases.
35 
36## Quick Start
37 
38Run directly with npx:
39 
40```bash
41npx -y fetcher-mcp
42```
43 
44First time setup - install the required browser by running the following command in your terminal:
45 
46```bash
47npx playwright install chromium
48```
49 
50### HTTP and SSE Transport
51 
52Use the `--transport=http` parameter to start both Streamable HTTP endpoint and SSE endpoint services simultaneously:
53 
54```bash
55npx -y fetcher-mcp --log --transport=http --host=0.0.0.0 --port=3000
56```
57 
58After startup, the server provides the following endpoints:
59 
60- `/mcp` - Streamable HTTP endpoint (modern MCP protocol)
61- `/sse` - SSE endpoint (legacy MCP protocol)
62 
63Clients can choose which method to connect based on their needs.
64 
65### Debug Mode
66 
67Run with the `--debug` option to show the browser window for debugging:
68 
69```bash
70npx -y fetcher-mcp --debug
71```
72 
73## Configuration MCP
74 
75Configure this MCP server in Claude Desktop:
76 
77On MacOS: `~/Library/Application Support/Claude/claude_desktop_config.json`
78 
79On Windows: `%APPDATA%/Claude/claude_desktop_config.json`
80 
81```json
82{
83  "mcpServers": {
84    "fetcher": {
85      "command": "npx",
86      "args": ["-y", "fetcher-mcp"]
87    }
88  }
89}
90```
91 
92## Docker Deployment
93 
94### Running with Docker
95 
96```bash
97docker run -p 3000:3000 ghcr.io/jae-jae/fetcher-mcp:latest
98```
99 
100### Deploying with Docker Compose
101 
102Create a `docker-compose.yml` file:
103 
104```yaml
105version: "3.8"
106 
107services:
108  fetcher-mcp:
109    image: ghcr.io/jae-jae/fetcher-mcp:latest
110    container_name: fetcher-mcp
111    restart: unless-stopped
112    ports:
113      - "3000:3000"
114    environment:
115      - NODE_ENV=production
116    # Using host network mode on Linux hosts can improve browser access efficiency
117    # network_mode: "host"
118    volumes:
119      # For Playwright, may need to share certain system paths
120      - /tmp:/tmp
121    # Health check
122    healthcheck:
123      test: ["CMD", "wget", "--spider", "-q", "http://localhost:3000"]
124      interval: 30s
125      timeout: 10s
126      retries: 3
127```
128 
129Then run:
130 
131```bash
132docker-compose up -d
133```
134 
135## Features
136 
137- `fetch_url` - Retrieve web page content from a specified URL
138 
139  - Uses Playwright headless browser to parse JavaScript
140  - Supports intelligent extraction of main content and conversion to Markdown
141  - Supports the following parameters:
142    - `url`: The URL of the web page to fetch (required parameter)
143    - `timeout`: Page loading timeout in milliseconds, default is 30000 (30 seconds)
144    - `waitUntil`: Specifies when navigation is considered complete, options: 'load', 'domcontentloaded', 'networkidle', 'commit', default is 'load'
145    - `extractContent`: Whether to intelligently extract the main content, default is true
146    - `maxLength`: Maximum length of returned content (in characters), default is no limit
147    - `returnHtml`: Whether to return HTML content instead of Markdown, default is false
148    - `waitForNavigation`: Whether to wait for additional navigation after initial page load (useful for sites with anti-bot verification), default is false
149    - `navigationTimeout`: Maximum time to wait for additional navigation in milliseconds, default is 10000 (10 seconds)
150    - `disableMedia`: Whether to disable media resources (images, stylesheets, fonts, media), default is true
151    - `debug`: Whether to enable debug mode (showing browser window), overrides the --debug command line flag if specified
152 
153- `fetch_urls` - Batch retrieve web page content from multiple URLs in parallel
154  - Uses multi-tab parallel fetching for improved performance
155  - Returns combined results with clear separation between webpages
156  - Supports the following parameters:
157    - `urls`: Array of URLs to fetch (required parameter)
158    - Other parameters are the same as `fetch_url`
159 
160- `browser_install` - Install Playwright Chromium browser binary automatically
161 
162  - Installs required Chromium browser binary when not available
163  - Automatically suggested when browser installation errors occur
164  - Supports the following parameters:
165    - `withDeps`: Install system dependencies required by Chromium browser, default is false
166    - `force`: Force installation even if Chromium is already installed, default is false
167 
168## Tips
169 
170### Handling Special Website Scenarios
171 
172#### Dealing with Anti-Crawler Mechanisms
173 
174- **Wait for Complete Loading**: For websites using CAPTCHA, redirects, or other verification mechanisms, include in your prompt:
175 
176  ```
177  Please wait for the page to fully load
178  ```
179 
180  This will use the `waitForNavigation: true` parameter.
181 
182- **Increase Timeout Duration**: For websites that load slowly:
183  ```
184  Please set the page loading timeout to 60 seconds
185  ```
186  This adjusts both `timeout` and `navigationTimeout` parameters accordingly.
187 
188#### Content Retrieval Adjustments
189 
190- **Preserve Original HTML Structure**: When content extraction might fail:
191 
192  ```
193  Please preserve the original HTML content
194  ```
195 
196  Sets `extractContent: false` and `returnHtml: true`.
197 
198- **Fetch Complete Page Content**: When extracted content is too limited:
199 
200  ```
201  Please fetch the complete webpage content instead of just the main content
202  ```
203 
204  Sets `extractContent: false`.
205 
206- **Return Content as HTML**: When HTML format is needed instead of default Markdown:
207  ```
208  Please return the content in HTML format
209  ```
210  Sets `returnHtml: true`.
211 
212### Debugging and Authentication
213 
214#### Enabling Debug Mode
215 
216- **Dynamic Debug Activation**: To display the browser window during a specific fetch operation:
217  ```
218  Please enable debug mode for this fetch operation
219  ```
220  This sets `debug: true` even if the server was started without the `--debug` flag.
221 
222#### Using Custom Cookies for Authentication
223 
224- **Manual Login**: To login using your own credentials:
225 
226  ```
227  Please run in debug mode so I can manually log in to the website
228  ```
229 
230  Sets `debug: true` or uses the `--debug` flag, keeping the browser window open for manual login.
231 
232- **Interacting with Debug Browser**: When debug mode is enabled:
233 
234  1. The browser window remains open
235  2. You can manually log into the website using your credentials
236  3. After login is complete, content will be fetched with your authenticated session
237 
238- **Enable Debug for Specific Requests**: Even if the server is already running, you can enable debug mode for a specific request:
239  ```
240  Please enable debug mode for this authentication step
241  ```
242  Sets `debug: true` for this specific request only, opening the browser window for manual login.
243 
244## Development
245 
246### Install Dependencies
247 
248```bash
249npm install
250```
251 
252### Install Playwright Browser
253 
254Install the browsers needed for Playwright:
255 
256```bash
257npm run install-browser
258```
259 
260### Build the Server
261 
262```bash
263npm run build
264```
265 
266## Debugging
267 
268Use MCP Inspector for debugging:
269 
270```bash
271npm run inspector
272```
273 
274You can also enable visible browser mode for debugging:
275 
276```bash
277node build/index.js --debug
278```
279 
280## Related Projects
281 
282- [g-search-mcp](https://github.com/jae-jae/g-search-mcp): A powerful MCP server for Google search that enables parallel searching with multiple keywords simultaneously. Perfect for batch search operations and data collection.
283 
284## License
285 
286Licensed under the [MIT License](https://choosealicense.com/licenses/mit/)
287 
288[![Powered by DartNode](https://dartnode.com/branding/DN-Open-Source-sm.png)](https://dartnode.com "Powered by DartNode - Free VPS for Open Source")
289

Full transparency — inspect the skill content before installing.