How do I install Charlotte?

Install Charlotte with a single command: npx mdskills install TickTockBent/charlotte. This downloads the skill files into your project and your AI agent picks them up automatically.

What platforms support Charlotte?

Charlotte works with Claude Code, Claude Desktop, Cursor, Vscode Copilot, Windsurf, Continue Dev, Gemini Cli, Amp, Roo Code, Goose. Skills use the open SKILL.md format which is compatible with any AI coding agent that reads markdown instructions.

← Back to MCP servers

Charlotte

Name: Charlotte: AI Agent Skill
Rating: 8.7 (1 reviews)
Author: TickTockBent

Verified

MCP ServerAPI & BackendIntermediate

The Web, Readable. Your AI agent spends 60,000 tokens just to look at a web page. Charlotte does it in 336. Charlotte is an MCP server that gives AI agents structured, token-efficient access to the web. Instead of dumping the full accessibility tree on every call, Charlotte returns only what the agent needs: a compact page summary on arrival, targeted queries for specific elements, and full detail

by @TickTockBent0Updated 3/10/2026

Add this skill

npx mdskills install TickTockBent/charlotte

Fork & Edit

Skill Advisor8.7

Token-efficient browser automation MCP with 42 tools, structured page representation, and 25-182x data reduction

+Delivers massive token savings through three detail levels and structured page decomposition
+Provides comprehensive tool suite with profiles to control definition overhead
+Includes excellent benchmarks, clear setup docs, and multi-client configuration examples
-Shell execution permission not clearly justified by documented capabilities

SKILL.md

Edit in Browser

1# Charlotte
2 
3**The Web, Readable.**
4 
5Your AI agent spends 60,000 tokens just to look at a web page. Charlotte does it in 336.
6 
7Charlotte is an MCP server that gives AI agents structured, token-efficient access to the web.
8Instead of dumping the full accessibility tree on every call, Charlotte returns only what
9the agent needs: a compact page summary on arrival, targeted queries for specific elements,
10and full detail only when explicitly requested. The result is 25-182x less data per page
11compared to [Playwright MCP](https://github.com/anthropics/playwright-mcp), saving thousands of dollars across production workloads.
12 
13## Why Charlotte?
14 
15Most browser MCP servers dump the entire accessibility tree on every call — a flat text blob that can exceed a million characters on content-heavy pages. Agents pay for all of it whether they need it or not.
16 
17Charlotte decomposes each page into a typed, structured representation — landmarks, headings, interactive elements, forms, content summaries — and lets agents control how much they receive with three detail levels. When an agent navigates to a new page, it gets a compact orientation (336 characters for Hacker News) instead of the full element dump (61,000+ characters). When it needs specifics, it asks for them.
18 
19### Benchmarks
20 
21Charlotte v0.5.0 vs Playwright MCP, measured by characters returned per tool call on real websites:
22 
23**Navigation** (first contact with a page):
24 
25| Site | Charlotte `navigate` | Playwright `browser_navigate` |
26|:---|---:|---:|
27| example.com | 612 | 817 |
28| Wikipedia (AI article) | 7,667 | 1,040,636 |
29| Hacker News | 336 | 61,230 |
30| GitHub repo | 3,185 | 80,297 |
31 
32Charlotte's `navigate` returns minimal detail by default — landmarks, headings, and interactive element counts grouped by page region. Enough to orient, not enough to overwhelm. On Wikipedia, that's **135x smaller** than Playwright's response.
33 
34**Tool definition overhead** (invisible cost per API call):
35 
36| Profile | Tools | Def. tokens/call | Savings vs full |
37|:---|---:|---:|---:|
38| full | 42 | ~7,400 | — |
39| browse (default) | 23 | ~3,900 | **~47%** |
40| core | 7 | 1,677 | **~77%** |
41 
42Tool definitions are sent on every API round-trip. With the default `browse` profile, Charlotte carries ~47% less definition overhead than loading all tools. Over a 20-call browsing session, that's **~38% fewer total tokens**. See the [profile benchmark report](docs/charlotte-profile-benchmark-report.md) for full results.
43 
44**The workflow difference:** Playwright agents receive 61K+ characters every time they look at Hacker News, whether they're reading headlines or looking for a login button. Charlotte agents get 336 characters on arrival, call `find({ type: "link", text: "login" })` to get exactly what they need, and never pay for the rest.
45 
46## How It Works
47 
48Charlotte maintains a persistent headless Chromium session and acts as a translation layer between the visual web and the agent's text-native reasoning. Every page is decomposed into a structured representation:
49 
50```
51┌─────────────┐     MCP Protocol     ┌──────────────────┐
52│   AI Agent  │<────────────────────>│    Charlotte     │
53└─────────────┘                      │                  │
54                                     │  ┌────────────┐  │
55                                     │  │  Renderer  │  │
56                                     │  │  Pipeline  │  │
57                                     │  └─────┬──────┘  │
58                                     │        │         │
59                                     │  ┌─────▼──────┐  │
60                                     │  │  Headless  │  │
61                                     │  │  Chromium  │  │
62                                     │  └────────────┘  │
63                                     └──────────────────┘
64```
65 
66Agents receive landmarks, headings, interactive elements with typed metadata, bounding boxes, form structures, and content summaries — all derived from what the browser already knows about every page.
67 
68## Features
69 
70**Navigation** — `navigate`, `back`, `forward`, `reload`
71 
72**Observation** — `observe` (3 detail levels, structural tree view), `find` (spatial + semantic search, CSS selector mode), `screenshot` (with persistent artifact management), `screenshots`, `screenshot_get`, `screenshot_delete`, `diff` (structural comparison against snapshots)
73 
74**Interaction** — `click`, `click_at` (coordinate-based), `type`, `select`, `toggle`, `submit`, `scroll`, `hover`, `drag`, `key` (single/sequence with element targeting), `wait_for` (async condition polling), `upload` (file input), `dialog` (accept/dismiss JS dialogs)
75 
76**Monitoring** — `console` (all severity levels, filtering, timestamps), `requests` (full HTTP history, method/status/resource type filtering)
77 
78**Session Management** — `tabs`, `tab_open`, `tab_switch`, `tab_close`, `viewport` (device presets), `network` (throttling, URL blocking), `set_cookies`, `get_cookies`, `clear_cookies`, `set_headers`, `configure`
79 
80**Development Mode** — `dev_serve` (static server + file watching with auto-reload), `dev_inject` (CSS/JS injection), `dev_audit` (a11y, performance, SEO, contrast, broken links)
81 
82**Utilities** — `evaluate` (arbitrary JS execution in page context)
83 
84## Tool Profiles
85 
86Charlotte ships 42 tools (41 registered + the `charlotte:tools` meta-tool), but most workflows only need a subset. Startup profiles control which tools load into the agent's context, reducing definition overhead by up to 77%.
87 
88```bash
89charlotte --profile browse    # 23 tools (default) — navigate, observe, interact, tabs
90charlotte --profile core      # 7 tools — navigate, observe, find, click, type, submit
91charlotte --profile full      # 42 tools — everything
92charlotte --profile interact  # 30 tools — full interaction + dialog + evaluate
93charlotte --profile develop   # 33 tools — interact + dev_serve, dev_inject, dev_audit
94charlotte --profile audit     # 14 tools — navigation + observation + dev_audit + viewport
95```
96 
97Agents can activate more tools mid-session without restarting:
98 
99```
100charlotte:tools enable dev_mode    → activates dev_serve, dev_audit, dev_inject
101charlotte:tools disable dev_mode   → deactivates them
102charlotte:tools list               → see what's loaded
103```
104 
105## Quick Start
106 
107### Prerequisites
108 
109- Node.js >= 22
110- npm
111 
112### Installation
113 
114Charlotte is listed on the [MCP Registry](https://registry.modelcontextprotocol.io) as `io.github.TickTockBent/charlotte` and published on npm as [`@ticktockbent/charlotte`](https://www.npmjs.com/package/@ticktockbent/charlotte):
115 
116```bash
117npm install -g @ticktockbent/charlotte
118```
119 
120Docker images are available on [Docker Hub](https://hub.docker.com/r/ticktockbent/charlotte) and [GitHub Container Registry](https://github.com/ticktockbent/charlotte/pkgs/container/charlotte):
121 
122```bash
123# Alpine (default, smaller)
124docker pull ticktockbent/charlotte:alpine
125 
126# Debian (if you need glibc compatibility)
127docker pull ticktockbent/charlotte:debian
128 
129# Or from GHCR
130docker pull ghcr.io/ticktockbent/charlotte:latest
131```
132 
133Or install from source:
134 
135```bash
136git clone https://github.com/ticktockbent/charlotte.git
137cd charlotte
138npm install
139npm run build
140```
141 
142### Run
143 
144Charlotte communicates over stdio using the MCP protocol:
145 
146```bash
147# If installed globally (default browse profile)
148charlotte
149 
150# With a specific profile
151charlotte --profile core
152 
153# If installed from source
154npm start
155```
156 
157### MCP Client Configuration
158 
159#### Claude Code
160 
161Create `.mcp.json` in your project root:
162 
163```json
164{
165  "mcpServers": {
166    "charlotte": {
167      "type": "stdio",
168      "command": "npx",
169      "args": ["@ticktockbent/charlotte"],
170      "env": {}
171    }
172  }
173}
174```
175 
176#### Claude Desktop
177 
178Add to `claude_desktop_config.json`:
179 
180```json
181{
182  "mcpServers": {
183    "charlotte": {
184      "command": "npx",
185      "args": ["@ticktockbent/charlotte"]
186    }
187  }
188}
189```
190 
191#### Cursor
192 
193Add to `.cursor/mcp.json`:
194 
195```json
196{
197  "mcpServers": {
198    "charlotte": {
199      "command": "npx",
200      "args": ["@ticktockbent/charlotte"]
201    }
202  }
203}
204```
205 
206#### Windsurf
207 
208Add to `~/.codeium/windsurf/mcp_config.json`:
209 
210```json
211{
212  "mcpServers": {
213    "charlotte": {
214      "command": "npx",
215      "args": ["@ticktockbent/charlotte"]
216    }
217  }
218}
219```
220 
221#### VS Code (Copilot)
222 
223Add to `.vscode/mcp.json`:
224 
225```json
226{
227  "servers": {
228    "charlotte": {
229      "type": "stdio",
230      "command": "npx",
231      "args": ["@ticktockbent/charlotte"]
232    }
233  }
234}
235```
236 
237#### Cline
238 
239Add to Cline MCP settings (via the Cline sidebar > MCP Servers > Configure):
240 
241```json
242{
243  "mcpServers": {
244    "charlotte": {
245      "command": "npx",
246      "args": ["@ticktockbent/charlotte"]
247    }
248  }
249}
250```
251 
252#### Amp
253 
254Add to `~/.amp/settings.json`:
255 
256```json
257{
258  "mcpServers": {
259    "charlotte": {
260      "command": "npx",
261      "args": ["@ticktockbent/charlotte"]
262    }
263  }
264}
265```
266 
267See [docs/mcp-setup.md](docs/mcp-setup.md) for the full setup guide, including development mode, generic MCP clients, verification steps, and troubleshooting.
268 
269## Usage Examples
270 
271Once connected, an agent can use Charlotte's tools:
272 
273### Browse a website
274 
275```
276navigate({ url: "https://example.com" })
277// → 612 chars: landmarks, headings, interactive element counts
278 
279find({ type: "link", text: "More information" })
280// → just the matching element with its ID
281 
282click({ element_id: "lnk-a3f1" })
283```
284 
285### Fill out a form
286 
287```
288navigate({ url: "https://httpbin.org/forms/post" })
289find({ type: "text_input" })
290type({ element_id: "inp-c7e2", text: "hello@example.com" })
291select({ element_id: "sel-e8a3", value: "option-2" })
292submit({ form_id: "frm-b1d4" })
293```
294 
295### Local development feedback loop
296 
297```
298dev_serve({ path: "./my-site", watch: true })
299observe({ detail: "full" })
300dev_audit({ checks: ["a11y", "contrast"] })
301dev_inject({ css: "body { font-size: 18px; }" })
302```
303 
304## Page Representation
305 
306Charlotte returns structured representations with three detail levels that let agents control how much context they consume:
307 
308### Minimal (default for `navigate`)
309 
310Landmarks, headings, and interactive element counts grouped by page region. Designed for orientation — "what's on this page?" — without listing every element.
311 
312```json
313{
314  "url": "https://news.ycombinator.com",
315  "title": "Hacker News",
316  "viewport": { "width": 1280, "height": 720 },
317  "structure": {
318    "headings": [{ "level": 1, "text": "Hacker News", "id": "h-a1b2" }]
319  },
320  "interactive_summary": {
321    "total": 93,
322    "by_landmark": {
323      "(page root)": { "link": 91, "text_input": 1, "button": 1 }
324    }
325  }
326}
327```
328 
329### Summary (default for `observe`)
330 
331Full interactive element list with typed metadata, form structures, and content summaries.
332 
333```json
334{
335  "url": "https://example.com/dashboard",
336  "title": "Dashboard",
337  "viewport": { "width": 1280, "height": 720 },
338  "structure": {
339    "landmarks": [
340      { "id": "rgn-b2c1", "role": "banner", "label": "Site header", "bounds": { "x": 0, "y": 0, "w": 1280, "h": 64 } },
341      { "id": "rgn-d4e5", "role": "main", "label": "Content", "bounds": { "x": 240, "y": 64, "w": 1040, "h": 656 } }
342    ],
343    "headings": [{ "level": 1, "text": "Dashboard", "id": "h-1a2b" }],
344    "content_summary": "main: 2 headings, 5 links, 1 form"
345  },
346  "interactive": [
347    {
348      "id": "btn-a3f1",
349      "type": "button",
350      "label": "Create Project",
351      "bounds": { "x": 960, "y": 80, "w": 160, "h": 40 },
352      "state": {}
353    }
354  ],
355  "forms": []
356}
357```
358 
359### Full
360 
361Everything in summary, plus all visible text content on the page.
362 
363## Detail Levels
364 
365| Level | Tokens | Use case |
366|:---|:---|:---|
367| `minimal` | ~50-200 | Orientation after navigation. What regions exist? How many interactive elements? |
368| `summary` | ~500-5000 | Working with the page. Full element list, form structures, content summaries. |
369| `full` | variable | Reading page content. All visible text included. |
370 
371Navigation tools default to `minimal`. The `observe` tool defaults to `summary`. Both accept an optional `detail` parameter to override.
372 
373## Element IDs
374 
375Element IDs are stable across minor DOM mutations. They're generated by hashing a composite key of element type, ARIA role, accessible name, and DOM path signature:
376 
377```
378btn-a3f1  (button)    inp-c7e2  (text input)
379lnk-d4b9  (link)      sel-e8a3  (select)
380chk-f1a2  (checkbox)  frm-b1d4  (form)
381rgn-e0d2  (landmark)  hdg-0f40  (heading)
382dom-b2c3  (DOM element, from CSS selector queries)
383```
384 
385IDs survive unrelated DOM changes and element reordering within the same container. When an agent navigates at minimal detail (no individual element IDs), it uses `find` to locate elements by text, type, or spatial proximity — the returned elements include IDs ready for interaction.
386 
387## Development
388 
389```bash
390# Run in watch mode
391npm run dev
392 
393# Run all tests
394npm test
395 
396# Run only unit tests
397npm run test:unit
398 
399# Run only integration tests
400npm run test:integration
401 
402# Type check
403npx tsc --noEmit
404```
405 
406### Project Structure
407 
408```
409src/
410  browser/          # Puppeteer lifecycle, tab management, CDP sessions
411  renderer/         # Accessibility tree extraction, layout, content, element IDs
412  state/            # Snapshot store, structural differ
413  tools/            # MCP tool definitions (navigation, observation, interaction, session, dev-mode)
414  dev/              # Static server, file watcher, auditor
415  types/            # TypeScript interfaces
416  utils/            # Logger, hash, wait utilities
417tests/
418  unit/             # Fast tests with mocks
419  integration/      # Full Puppeteer tests against fixture HTML
420  fixtures/pages/   # Test HTML files
421```
422 
423### Architecture
424 
425The **Renderer Pipeline** is the core — it calls extractors in order and assembles a `PageRepresentation`:
426 
4271. Accessibility tree extraction (CDP `Accessibility.getFullAXTree`)
4282. Layout extraction (CDP `DOM.getBoxModel`)
4293. Landmark, heading, interactive element, and content extraction
4304. Element ID generation (hash-based, stable across re-renders)
431 
432All tools go through `renderActivePage()` which handles snapshots, reload events, dialog detection, and response formatting.
433 
434## Sandbox
435 
436Charlotte includes a test website in `tests/sandbox/` that exercises all tools without touching the public internet. Serve it locally with:
437 
438```
439dev_serve({ path: "tests/sandbox" })
440```
441 
442Four pages cover navigation, forms, interactive elements, delayed content, scroll containers, and more. See [docs/sandbox.md](docs/sandbox.md) for the full page reference and a tool-by-tool exercise checklist.
443 
444## Known Issues
445 
446**Tool naming convention** — Charlotte uses `:` as a namespace separator in tool names (e.g., `charlotte:navigate`, `charlotte:observe`). MCP SDK v1.26.0+ logs validation warnings for this character, as the emerging [SEP standard](https://github.com/modelcontextprotocol/modelcontextprotocol/issues/986) restricts tool names to `[A-Za-z0-9_.-]`. This does not affect functionality — all tools work correctly — but produces stderr warnings on server startup. Will be addressed in a future release to comply with the SEP standard.
447 
448**Shadow DOM** — Open shadow DOM works transparently. Chromium's accessibility tree pierces open shadow boundaries, so web components (e.g., GitHub's `<relative-time>`, `<tool-tip>`) render their content into Charlotte's representation without special handling. Closed shadow roots are opaque to the accessibility tree and will not be captured.
449 
450## Roadmap
451 
452### Interaction Gaps
453 
454**Batch Form Fill** — Add a `charlotte:fill_form` tool that accepts an array of `{element_id, value}` pairs and fills an entire form in a single tool call, reducing N sequential `type`/`select`/`toggle` calls to one.
455 
456**Slow Typing** — Add a `slowly` or `character_delay` parameter to `charlotte:type` for character-by-character input. Required for sites with key-by-key event handlers (autocomplete, search-as-you-type, input validation).
457 
458### Session & Configuration
459 
460**Connect to Existing Browser** — Add a `--cdp-endpoint` CLI argument so Charlotte can attach to an already-running browser via `puppeteer.connect()` instead of always launching a new instance. Enables working with logged-in sessions and browser extensions.
461 
462**Persistent Init Scripts** — Add a `--init-script` CLI argument to inject JavaScript on every page load via `page.evaluateOnNewDocument()`. Charlotte's `dev_inject` currently applies CSS/JS once and does not persist across navigations.
463 
464**Configuration File** — Support a `--config` CLI argument to load settings from a JSON file, simplifying repeatable setups and CI/CD integration.
465 
466**Full Device Emulation** — Extend `charlotte:viewport` to accept named devices (e.g., "iPhone 15") and configure user agent, touch support, and device pixel ratio via CDP, not just viewport dimensions.
467 
468### Feature Roadmap
469 
470**Video Recording** — Record interactions as video, capturing the full sequence of agent-driven navigation and manipulation for debugging, documentation, and review.
471 
472**ARM64 Docker Images** — Add `linux/arm64` platform support to the Docker publish workflow for native performance on Apple Silicon Macs and ARM servers.
473 
474See [docs/playwright-mcp-gap-analysis.md](docs/playwright-mcp-gap-analysis.md) for the full gap analysis against Playwright MCP, including lower-priority items (vision tools, testing/verification, tracing, transport, security) and areas where Charlotte has advantages.
475 
476## Full Specification
477 
478See [docs/CHARLOTTE_SPEC.md](docs/CHARLOTTE_SPEC.md) for the complete specification including all tool parameters, the page representation format, element identity strategy, and architecture details.
479 
480## License
481 
482[MIT](LICENSE)
483 
484## Contributing
485 
486See [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines.
487---
488 
489*Part of a growing suite of literary-named MCP servers. See more at [github.com/TickTockBent](https://github.com/TickTockBent).*
490

Full transparency — inspect the skill content before installing.