"Not a browser for humans that AI can use. A browser built for AI that humans can watch." Quick Start • Benchmarks • Architecture • Security • Documentation Traditional browsers (Chrome, Firefox) and automation tools (Playwright, Selenium) were built for human retinas and pixels. Treating a web page as an XML document or an Accessibility Tree is the wrong abstraction for intelligence. Axon is enti
Add this skill
npx mdskills install rennaisance-jomt/axonComprehensive AI-native browser documentation with strong technical depth and clear value proposition
1<div align="center">2 <h1>AXON</h1>3</div>45<div align="center">67**"Not a browser for humans that AI can use. A browser built for AI that humans can watch."**89[](https://github.com/rennaisance-jomt/axon)10[](https://go.dev/)11[](LICENSE)12[](docs/ARCHITECTURE.md)1314[Quick Start](#quick-start) • [Benchmarks](#proven-benchmarks) • [Architecture](#architecture) • [Security](#security-first) • [Documentation](docs/)1516</div>1718---1920## DOM is the old way of interacting with AI2122Traditional browsers (Chrome, Firefox) and automation tools (Playwright, Selenium) were built for human retinas and pixels. Treating a web page as an XML document or an Accessibility Tree is the wrong abstraction for intelligence.2324**Axon is entirely different.** It is not an automation library; it is a fundamental integration layer—a sensory cortex for LLMs. It translates the chaotic visual web into a **Semantic Intent Space**.2526Think of Axon as a **core LEGO block in your AI infrastructure**. It plugs seamlessly into your agent frameworks (via MCP or SDKs) and gives them native, structured understanding of the web.2728---2930## The Economic Reality: 98% Cost Reduction3132The biggest bottleneck in AI agent adoption is API cost—spending dollars in tokens just to read a single webpage. **Axon guarantees the economic survival of agents.**3334| Scenario | Standard Browser (Raw HTML) | Axon (Semantic Space) |35| :--- | :--- | :--- |36| **Summarize Hacker News** | ~50,000 tokens ($2.00) | **~150 tokens ($0.02)** |37| **Find & Post a Tweet** | ~85,000 tokens ($3.40) | **~350 tokens ($0.05)** |38| **Login to GitHub** | ~120,000 tokens ($4.80) | **~500 tokens ($0.08)** |3940---4142## Proven Benchmarks4344Axon is fundamentally more efficient than standard browser automation. Verified against real-world targets:4546| Metric | Axon Performance | Standard Headless | Result |47| :--- | :--- | :--- | :--- |48| **Token Usage** | **~100 tokens** | ~8,000+ tokens | **98% Reduction** |49| **Page Latency** | **1.2s** | 2.7s | **55.5% Faster** |50| **Boot Time** | **~15ms** | ~800ms | **Sub-50ms Sessions** |51| **Memory Footprint** | **~10MB** | ~200MB+ | **Massive Density** |5253> *Benchmarks verified on Wikipedia and CNN.com (March 2026).*5455---5657## Under The Hood: How Axon Dominates5859Standard "Agent Browsers" wrap heavy QA-testing tools like Playwright in Node.js or Python, resulting in massive dependency chains, slow boot times, and sluggish execution.6061**Axon changes the rules of the game at the lowest level:**62631. **Native C++ CDP (Zero Wrappers):** Axon is a single compiled Go binary. It speaks directly to Chromium's C++ rendering and accessibility layers. No Node.js. No Playwright. Just pure, native speed.642. **Semantic Network Filtering:** Axon actively intercepts all network traffic at the protocol level. We drop heavy fonts (`.woff2`), images, video strings, trackers, and ad-networks *before* they ever hit browser memory. **We strip the visual web away**, making page loads virtually instantaneous.653. **Event-Driven Auto-Waiting:** Flaky integrations use `time.Sleep()` or guess when a page is ready. Axon listens to native C++ `DOMNodeInserted` and `AnimationCanceled` rendering events. When an agent clicks a button, Axon waits synchronously at the engine level until the element is perfectly stable.664. **Cross-Session Intent Memory:** Axon caches learned semantic relationships in an embedded database (BadgerDB). If an agent learns what the "Login" button looks like on a site today, it never has to wait for the LLM to search the DOM for it tomorrow.675. **The Cognitive Firewall:** Before an agent even sees the DOM, Axon actively scans it for prompt injections. Actions classified as "Irreversible" (like deleting data or spending money) are dynamically quarantined for explicit agent confirmation.686. **Action-Triggered Visual Proofs:** Video recordings are expensive and heavy. Axon dynamically captures lightweight, frame-burst visual proofs of what the AI did (before, during, and after an action), ensuring verifiable agent execution at a fraction of the storage cost.6970---7172## Comparison: How Axon Fits In7374There are many great tools in the AI browser space. Here is how Axon compares to other popular approaches:7576| Feature | **Axon** | **Stagehand** | **Lightpanda** | **Vercel Agent** | **Skyvern** |77| :--- | :--- | :--- | :--- | :--- | :--- |78| **Core Tech** | Go (Native) | TS/Node.js | Zig (Native) | Rust (Native) | Python/Vision |79| **Engine** | Optimized Chromium| Standard Chromium| Custom (Unique) | Headless Chrome | Computer Vision |80| **Logic** | Semantic Graphs | Hybrid/Playwright | Pure Runtime | Snapshot JSON | Visual Analysis |81| **Security** | Built-in Vault | App Layer | Sandbox | CLI-based | System-level |82| **Main Advantage** | Native Efficiency | Ease of Use | Ultra-Low RAM | Developer Speed | Adaptability |8384### Key Differentiators85- **Axon vs. Vercel Agent Browser**: Vercel’s tool is a fast Rust-based CLI that focuses on snapshotting pages into JSON for LLMs. Axon goes deeper by providing a persistent, stateful engine with real-time semantic interaction and a secure credential vault.86- **Axon vs. Skyvern**: Skyvern uses computer vision to "see" and interact like a human. It's great for handling complex UI changes. Axon focuses on the underlying semantic tree, making it much faster and more token-efficient for high-volume automated tasks.87- **Axon vs. Stagehand**: Stagehand is an excellent framework wrapping Playwright. Axon is a standalone engine that replaces heavy automation libraries with a single, high-performance binary.88- **Axon vs. Lightpanda**: Lightpanda is a from-scratch Zig engine. Axon maintains total web compatibility by using a customized Chromium core while keeping resource usage extremely low.8990---9192## Architecture9394Axon is built in pure **Go** with a unique modular architecture designed for agents:9596```mermaid97graph TD98 V[<b>Layer 6: Visual Debugger</b><br/>Real-time Perception Overlay] --> A99 A[<b>Layer 5: Agent Interface</b><br/>REST API / MCP / Python / Node.js] --> B100 B[<b>Layer 4: Axon Intelligence</b><br/>Semantic Graphs / Intent Classifier] --> C101 C[<b>Layer 3: Axon Security</b><br/>SSRF Guard / Action Classifier] --> D102 D[<b>Layer 2: Control Server</b><br/>Session Pool / State Management] --> E103 E[<b>Layer 1: Browser Runtime</b><br/>Optimized Chromium / Go-Rod]104105 style V fill:#333333,stroke:#ffffff,stroke-width:2px,color:#ffffff106 style A fill:#333333,stroke:#ffffff,stroke-width:2px,color:#ffffff107 style B fill:#333333,stroke:#ffffff,stroke-width:2px,color:#ffffff108 style C fill:#333333,stroke:#ffffff,stroke-width:2px,color:#ffffff109 style D fill:#333333,stroke:#ffffff,stroke-width:2px,color:#ffffff110 style E fill:#333333,stroke:#ffffff,stroke-width:2px,color:#ffffff111```112113---114115## The Axon Ecosystem116117Axon provides a complete toolkit to bring production-grade browser capabilities to your agents:118- **Python & Node.js SDKs**: Full-featured client libraries for rapid integration.119- **Model Context Protocol (MCP)**: Use Axon natively as a server with Claude Desktop or MCP clients.120- **Vision Debugger API**: A 60fps WebSocket overlay that lets you *watch* what your agent is thinking.121- **Axon CLI**: Manage sessions and interact with the browser directly from the terminal.122123---124125## The Vision: Why the AI-native web matters126127Currently, AI is strapped to a browser designed for humans. We are wasting compute parsing pixels, flexboxes, and Javascript UI state. The internet is built for human consumption. Axon creates an invisible, machine-to-machine version of the internet that LLMs can naturally perceive, without losing the ability to interact with dynamic web apps. Read the full manifesto in [docs/VISION.md](docs/VISION.md).128129## What Can You Build with Axon?130131Because Axon makes interactions 98% cheaper and 10x more stable, entirely new agent architectures become possible:132- **Autonomous Researchers:** Agents that read thousands of pages a day to compile deep market analysis without bankrupting their creators on API tokens.133- **Social & Community Managers:** Bots that navigate Twitter, Discord, and Reddit to actively monitor sentiment, flag issues, and politely engage.134- **Financial Scrapers:** Systems that execute real-time extractions of complex financial data from highly dynamic, Javascript-heavy terminal UIs.135136Read more examples in [docs/USE_CASES.md](docs/USE_CASES.md).137138---139140## Quick Start141142### 1. Installation143Requires **Go 1.22+**.144145```bash146# Clone the vault147git clone https://github.com/rennaisance-jomt/axon.git148cd axon149150# Build the binary151make build152```153154### 2. Launch the Engine155```bash156./bin/axon157```158159### 3. Basic Session160Axon speaks pure REST. Any language can control it.161162```bash163# Create a session164curl -X POST http://localhost:8020/api/v1/sessions -d '{"id": "demo"}'165166# Navigate & Analyze167curl -X POST http://localhost:8020/api/v1/sessions/demo/navigate -d '{"url": "https://news.ycombinator.com"}'168curl -X POST http://localhost:8020/api/v1/sessions/demo/snapshot169```170171### 4. Zero-Config LangChain Integration172173Axon is designed to slip perfectly into your existing reasoning loop:174175```python176from axon.langchain import AxonBrowserToolkit177from langchain.agents import initialize_agent178179# Give the agent its sensory organs180tools = AxonBrowserToolkit(session="x_main").get_tools()181182# Let it loose183agent = initialize_agent(tools, llm)184agent.run("Go to Hacker News, find the top AI post, and summarize the comments.")185```186187---188189## The Perception Shift: Death of the DOM190191Standard headless tools force agents to parse miles of useless HTML or accessibility nodes. Axon collapses the web into pure semantic reality.192193### Before: The Standard Way (Playwright/Puppeteer)194```html195<div class="header-nav-wrapper">196 <nav aria-label="Primary" role="navigation">197 <ul class="nav-list">198 <li class="nav-item"><a href="/new" class="nav-link" tabindex="0">new</a></li>199 <li class="nav-item"><a href="/past" class="nav-link" tabindex="0">past</a></li>200 <!-- 60,000 more characters of divs, spans, and attributes -->...201```202*Total tokens: ~8,000+. High hallucination risk. Massive API cost.*203204### After: The Axon Way (Semantic Intent Space)205```text206PAGE: news.ycombinator.com | State: ready207TITLE: Hacker News208209NAV:210 [n1] new [n2] past [n3] comments [n4] ask [n5] show [n6] jobs [n7] submit211212FEED:213 [e1] "Show HN: Axon - An AI-native browser" (link)214 [e2] "96% token reduction is real" (link)215 [e3] "Why traditional browsers fail agents" (link)216217ACTIONS:218 [a1] login (link) — auth.login219 [a2] search (textbox) — search.query220```221*Total tokens: ~85. Ready for immediate LLM reasoning.*222223---224225## Security First226227Axon is built for the hostile web. It includes native defenses that standard automation lacks:228229- **SSRF Guard**: Pre-navigation validation to prevent internal network scanning.230- **Action Reversibility**: Actions like "Delete Account" or "Post" are classified as **Irreversible** and require explicit "confirm: true".231- **Prompt Injection Scanner**: Detects malicious instructions hidden in webpage text before the agent parses it.232- **Secure Intelligence Vault**: Domain-bound credential storage that prevents agents from leaking secrets to unauthorized origins.233- **Cryptographic Audit**: Every action is hashed into an append-only, tamper-evident ledger.234235---236237## The Secure Intelligence Vault (BadgerVault)238239Axon includes a military-grade secret vault built directly into the engine. Unlike standard browsers where agents might "see" and "leak" credentials, Axon's vault ensures secrets are only injected into the correct domains.240241- **Domain Binding**: A secret for `github.com` cannot be used on `evil-phish.com`, even if the agent is tricked.242- **Physical Masking**: Credentials injected into the DOM are physically masked (rendered as `******`) from visual snapshots and stream replays.243- **Encrypted at Rest**: All secrets are stored in an AES-256-GCM encrypted local database.244245---246247## 📚 Deep Dives248249| Guide | Description |250| :--- | :--- |251| [**The Vision**](docs/VISION.md) | Why the AI-native web matters for the future. |252| [**Real-World Use Cases**](docs/USE_CASES.md) | What agents can actually *do* with Axon. |253| [**Getting Started**](docs/GETTING_STARTED.md) | Step-by-step installation and first session. |254| [**Architecture**](docs/ARCHITECTURE.md) | Deep technical dive into the 5-layer stack. |255| [**API Reference**](docs/API_SPEC.md) | Full REST API specification. |256| [**Security Model**](docs/SECURITY.md) | How we keep your agents (and data) safe. |257258---259260<div align="center">261262*Axon Project | 2026*263*An AI-native browser built with for AI agents.*264265</div>266267
Full transparency — inspect the skill content before installing.