Open source · MIT licensed

Token-efficient web browser
for LLM agents

A typical web page is 50,000+ tokens. The useful content? 2,000–5,000. BotBrowser strips the bloat and returns clean markdown — saving 90–95% of tokens.

Raw HTML

52,000

BotBrowser

3,200

94% savings

$ npm install botbrowser

$ pip install botbrowser

No API key. No server. No config. Just install and extract.

Three lines to clean markdown

JavaScript

import { extract } from 'botbrowser';

const result = await extract('https://example.com');
console.log(result.content);
// # Article Title
// Clean markdown content...

console.log(result.metadata.tokenSavingsPercent);
// 94

Python

from botbrowser import extract

result = extract("https://example.com")
print(result.content)
# # Article Title
# Clean markdown content...

print(result.metadata.token_savings_percent)
# 94

How it works

URL → Fetch → Extract → Clean → Markdown

🌐

Step 1

Fetch

Smart HTTP with user-agent rotation, redirect handling, timeouts

🔍

Step 2

Extract

Identifies main content using Readability (JS) / Trafilatura (Python)

✨

Step 3

Clean

Strips scripts, styles, ads, nav, footers, cookie banners, tracking

📄

Step 4

Convert

Clean Markdown preserving headings, lists, links, tables, code blocks

Why BotBrowser?

Purpose-built for LLM agents that need to read the web.

Token-first

Built specifically to minimize LLM token usage. Every design decision optimizes for fewer tokens while preserving meaning.

Dual native SDKs

Real implementations in both JS and Python, not thin wrappers. Use whichever fits your stack.

Zero setup

npm install or pip install. No API key, no account, no server to run. Works offline.

Battle-tested extraction

Mozilla Readability and Trafilatura — the same engines powering Firefox Reader View and academic web research.

Open source

MIT licensed. Self-host, fork, embed, do what you want. No vendor lock-in.

MCP ready

Hosted MCP server for AI agents at scale. JS rendering, batch processing, search + extract.

Pricing

Start free with open source. Scale with the hosted service.

Open Source

Free forever

Self-hosted extraction engine

✓ npm install / pip install
✓ 100% local, no server
✓ Markdown + text output
✓ Link extraction
✓ Works offline
✓ MIT licensed

Install Now

Pro

$29 /mo

Hosted MCP server for agents at scale

✓ 3,000 requests/day
✓ Render JavaScript pages (300/day)
✓ Summarize to a token budget
✓ Target specific page sections
✓ Browse up to 10 pages at once
✓ Search the web + extract results
✓ 24-hour caching
✓ Email support

Get API Key

Enterprise

$199 /mo

Custom infrastructure + SLA

✓ Custom request limits
✓ All Pro features
✓ Priority queue
✓ Custom caching policy
✓ Webhook callbacks
✓ Dedicated support
✓ 99.9% SLA
✓ Audit logs

Token-efficient web browser for LLM agents