The Open-Source AI Browser Agent

WebBrain is a free, open-source browser extension that brings AI agent capabilities to Chrome and Firefox. Read pages, extract data, and automate web tasks — powered by your choice of LLM. The self-hostable alternative to proprietary browser AI plugins.

Install Extension View on GitHub
https://example.com/products

Product Catalog

🧠 WebBrain.one
Extract all product names and prices from this page
Reading page
Extracting data
Found 24 products. Here are the results:
1. Widget Pro — $29.99
2. Super Gadget — $49.99
3. MegaTool X — $19.99
...and 21 more
Ask anything about this page...
Demo

Watch WebBrain in action

See how WebBrain reads pages, extracts data, and automates browser tasks.

Features

Everything you need in a browser AI

A full-featured AI agent that lives in your browser sidebar and understands any web page.

📖

Page Understanding

Reads and comprehends any web page — articles, docs, dashboards, forms. Ask questions and get instant answers from the current page content.

🤖

Full Browser Agent

Click, type, scroll, navigate, and interact with pages on your behalf. Automate repetitive tasks with natural language instructions.

📊

Data Extraction

Extract structured data from any page — tables, lists, links, forms. Export product catalogs, search results, or any page content. Works with PDFs.

🔌

Multi-Provider LLM

Works with local llama.cpp, OpenAI, Claude, and OpenRouter. Use your preferred model — or run completely offline with local AI.

🛡️

Privacy First

Your data stays yours. Run with a local LLM for zero data leakage. No telemetry, no tracking, no accounts required. Fully open-source.

Smart Context

Automatic context management prevents token overflow. Intelligently trims conversation history and limits tool output for smooth, uninterrupted sessions.

👁️

Dedicated Vision Model

Pair a fast text-only model for planning with a separate vision-capable model for reading screenshots. Cheaper and faster than using a single big multimodal model for everything.

👤

Profile Auto-fill

Optional plaintext bio — name, work email, company, a throwaway password — lets the agent breeze through low-stakes signup forms without asking every time. Off by default, all stored locally.

🍪

Cookie & Paywall Aware

Dismisses consent banners (OneTrust, Cookiebot, Didomi, Quantcast) before reasoning about the page. Detects paywalls and tells you honestly instead of fabricating article content or trying to bypass them.

🧩

Optional CAPTCHA Solver

Plug in a CapSolver API key and the agent will auto-solve reCAPTCHA v2/v3, hCaptcha, and Cloudflare Turnstile when they block a step — instead of stopping to ask. Off by default, BYO key, no captcha service is shipped or contacted unless you turn it on.

🌐

Multilingual UI

The plugin ships in English, Español, Français, Türkçe and 中文. Auto-detects your browser language on first use; switch anytime from the globe icon in the side panel. The marketing site is localized to match.

💰

Token-Conscious

Screenshots are resized and iteratively JPEG-compressed before they leave your machine, keeping image tokens small. Smart context trimming and tool-output caps keep cloud bills predictable — no surprise spend on long sessions.

LLM Providers

Bring your own AI

Connect to any OpenAI-compatible API or run a local model. Switch providers anytime from the extension settings.

🦙
llama.cpp
Ollama
OpenAI
Claude
OpenRouter
LM Studio
vLLM
Grok
Gemini
DeepSeek
Mistral
Get Started

Install WebBrain

Available for Chrome and Firefox. Free, open-source, no account required.

🌐

Chrome & Chromium

Manifest V3 · Chrome 116+ · Also works with Brave, Edge, Opera, Vivaldi, and other Chromium-compatible browsers.

🦊

Firefox

Manifest V2 · Firefox 109+

Why WebBrain?

How does WebBrain compare?

WebBrain sits at the intersection of browser-native AI plugins and full agent frameworks. Here's how it stacks up.

vs. Browser AI Plugins

Feature WebBrain Claude in Chrome
Open SourceMIT LicenseProprietary
PriceFree foreverRequires Claude Pro ($20/mo)
Local LLM supportllama.cpp, OllamaNo — Claude only
Multi-providerAll OpenAI-compatible endpointsClaude only
ChromeYes (MV3)Yes
FirefoxYes (MV2)No
Side panel UIYesYes
Ask / Act modesYesSimilar
Fully offlineYes (with local LLM)No — cloud required
Self-hostableYesNo

vs. AI Agent Frameworks (different category)

Aspect WebBrain OpenClaw / Browser-Use / etc.
What is it?Browser extension (end-user tool)Agent framework / SDK (developer tool)
Target userAnyone — no coding neededDevelopers building automations
InstallationOne-click browser installPython/Docker setup required
UIBuilt-in side panel chatNo UI — code or API only
Browser controlContent script (lightweight)CDP / Playwright (full control)
Multi-tab workflowsPer-tab conversationsProgrammable multi-tab orchestration
Headless modeNo — runs in your browserYes — headless automation
ExtensibilityAdd custom LLM providersFull Python SDK, custom tools
Best forDaily browsing AI assistantAutomated scraping / testing pipelines

WebBrain is a browser extension for end users who want an AI assistant while they browse. Agent frameworks like OpenClaw are developer tools for building automated browser pipelines. Different tools for different jobs — and you can use both.

FAQ

Frequently Asked Questions

Is WebBrain a free alternative to Claude's browser plugin?

Yes. WebBrain provides similar AI browser agent capabilities — reading pages, extracting data, clicking buttons, filling forms, and automating multi-step workflows. Unlike Claude's proprietary browser plugin which requires a Claude Pro subscription and only works with Anthropic's models, WebBrain is completely free, open-source (MIT license), and supports multiple LLM providers including local models that run entirely on your machine.

How does WebBrain compare to OpenClaw, Browser-Use, and other AI agent frameworks?

They're different categories of tools. WebBrain is a browser extension — you install it in Chrome or Firefox and chat with it in a side panel, no coding required. Frameworks like OpenClaw and Browser-Use are developer SDKs for building automated browser pipelines in Python, typically using headless browsers and CDP. Think of it this way: WebBrain is for daily browsing with an AI assistant; agent frameworks are for building scraping bots and test automation. You can use both — they complement each other.

Can I use WebBrain completely offline?

Yes. WebBrain's default provider is llama.cpp, which runs a local AI model on your computer. No API keys needed, no internet required for the AI, and no data ever leaves your machine. Just download a GGUF model, start llama-server, and you have a fully private AI browser agent. You can also use Ollama with its OpenAI-compatible endpoint.

Which AI models does WebBrain support?

WebBrain supports four provider types: llama.cpp (any local GGUF model), OpenAI (GPT-4o, GPT-4, etc.), Claude (Claude Opus, Sonnet, Haiku via native API), and OpenRouter (access to 100+ models from various providers). Any OpenAI-compatible API endpoint works, so you can also use services like Together AI, Groq, Mistral, or any local server with an OpenAI-compatible interface.

What's the most recommended model?

As of April 21, 2026, the top recommendation is Qwen 3.6 35B. Why: in our vision benchmark (vision-model-shootout), it outperformed Gemma 4 on screenshot understanding while remaining practical for local inference.

For consumer GPUs, RTX 5090 is ideal, and RTX 4090 is often workable with INT4 AutoRound quantization via Intel/Qwen3.6-35B-A3B-int4-AutoRound.

For max speed, we recommend serving on vLLM. Sample command:

python -u -m vllm.entrypoints.openai.api_server --model Intel/Qwen3.6-35B-A3B-int4-AutoRound --served-model-name qwen3.6-35b --quantization auto --dtype bfloat16 --max-model-len 65536 --max-num-batched-tokens 32768 --max-num-seqs 4 --host 0.0.0.0 --port 8000 --gpu-memory-utilization 0.92 --enable-prefix-caching --enable-chunked-prefill --limit-mm-per-prompt '{"image": 4, "video": 1}' --mm-processor-cache-type shm --reasoning-parser qwen3 --enable-auto-tool-choice --tool-call-parser qwen3_coder --trust-remote-code --allowed-origins '["*"]' --speculative-config '{"method": "dflash", "model": "z-lab/Qwen3.6-35B-A3B-DFlash", "num_speculative_tokens": 15}' --attention-backend flash_attn

DFlash speculative decoding is optional.

I'm getting "Failed to fetch" when connecting to a local LLM server (vLLM, Ollama, llama.cpp) on my network

If your LLM server is on a different machine on your local network (e.g. http://192.168.1.x:8000), Chrome blocks the request unless the server sends CORS headers. The fix depends on your server:

vLLM: Start with --allowed-origins '["*"]' (the value must be a JSON list).
Ollama: Set the environment variable OLLAMA_ORIGINS=* before starting.
llama.cpp: CORS is enabled by default — no changes needed.

If your server runs on localhost (same machine as the browser), CORS is usually not required. The issue only affects cross-machine connections on the local network. Make sure your base URL in WebBrain settings ends with /v1 (e.g. http://192.168.1.47:8000/v1).

Does WebBrain work on Firefox?

Yes. WebBrain ships with both a Chrome version (Manifest V3, using the sidePanel API) and a Firefox version (Manifest V2, using sidebar_action). Both versions have identical features. The Firefox version can be loaded as a temporary add-on for development, or published to addons.mozilla.org for permanent installation.

Can I move the Firefox sidebar from the left to the right, like Chrome's side panel?

Yes — Firefox's sidebar defaults to the left, but you can flip it. Right-click anywhere in the sidebar header and choose Move Sidebar to Right (or use View → Sidebar → Move Sidebar to Right from the menu bar). The position persists across restarts. Chrome's sidePanel defaults to the right and isn't user-movable from the panel itself.

Is WebBrain safe to use? Can it modify web pages?

WebBrain has two modes: Ask mode (default) is read-only and cannot modify anything on the page. Act mode enables full browser agent capabilities (clicking, typing, navigating) but requires explicit user confirmation before activation, and comes with a visible warning banner. You can stop the agent at any time with the Stop button. The extension's source code is fully open for audit on GitHub.

How do I use WebBrain for web scraping and data extraction?

Simply open any web page, open the WebBrain side panel, and ask in natural language: "Extract all product names and prices from this page", "Get all email addresses on this page", or "Summarize this article in bullet points". The AI agent reads the page content, understands the structure, and returns the extracted data. For more complex scraping, switch to Act mode and the agent can navigate between pages, click pagination buttons, and aggregate data across multiple pages.

Does WebBrain call APIs directly, or does it always click through the UI?

By default, WebBrain always goes through the visible UI for any action that creates, modifies, deletes, sends, submits, posts, or buys anything. It will navigate to the page, fill the form, and click the button — exactly the way you would. It refuses to call REST/GraphQL endpoints directly via background fetch() for mutations. This is deliberate: API actions are invisible (you don't see what's being sent), often require separate auth tokens you may not have configured, and have a much larger blast radius than a visible mis-click. UI-first means everything is on screen, in your normal browser session, and stoppable.

For reading data — fetching a README, looking up an issue, comparing prices across sites, checking a status page — WebBrain freely uses background HTTP requests via the fetch_url and research_url tools. Reading is not the same as acting; it doesn't change anything on a remote service, so the safety concerns don't apply.

If you specifically want to allow API mutations for a particular task, type /allow-api at the start of your message (optionally followed by a short task description). This per-conversation override lets WebBrain fall back to API endpoints when the UI is genuinely failing or unworkable, while still preferring UI when UI works. A sticky badge stays visible above the input area while the override is active, and it clears when you reset the conversation.

Can I use it in LM Studio too?

Yes. WebBrain's read-only network tools — fetch_url and research_url — also ship as a standalone LM Studio plugin at webbrain/web-tools. Install with lms clone webbrain/web-tools and toggle it on in any LM Studio chat — any tool-capable model can then call those two tools without you installing the browser extension. Pure Node, no headless browser. Source: lmstudio-plugin/.

Can I switch to another tab while WebBrain is working on a page?

Yes, on Chrome — the agent runs in the background service worker and is bound to the tab it started on, so it keeps clicking, typing, and reading that specific tab even when you move focus elsewhere. Tools that target a tab (CDP click, type, navigate, screenshot) all work on backgrounded tabs in Chrome. The sidebar locks the input while a task is running so you can't accidentally start a second task on the new tab — you'll need to wait or stop the current one. Note that browsers throttle timers and animations on background tabs, so heavily animated sites may respond slightly slower.

On Firefox, the agent will keep running on its original tab too, but auto-screenshots are limited: Firefox's screenshot API can only capture the currently active tab, not a specific tab in the background. WebBrain detects this and skips the screenshot for that turn rather than feeding the model an image of an unrelated page. The agent will continue planning from text-based context until you switch back to its tab.

Avoid actively clicking or typing on the same tab the agent is working on — that creates race conditions where you and the agent fight over the same page. Switching tabs is fine; co-driving the same tab is not.

How does Profile auto-fill work, and is it safe?

Profile auto-fill is an optional feature in Settings → Profile. You enter a short bio — name, work email, company, and a throwaway password for low-stakes signups — and toggle it on. When enabled, WebBrain appends that text to the agent's system prompt so it can fill signup forms without asking every time.

The text is stored in plaintext in your browser's local storage. It is not transmitted to the WebBrain project, but it is sent to whichever LLM provider you have configured on every turn, as part of the system prompt. Off by default.

Do not put passwords for important accounts (Google, Apple, iCloud, banking, work SSO, primary email) here. Those accounts should use 2FA and shouldn't be handed to an agent anyway. A throwaway password you reuse for newsletter signups and free trials is the intended use case.

What does WebBrain do with cookie banners and paywalls?

Cookie banners: WebBrain recognizes consent banners from common frameworks (OneTrust, Cookiebot, Didomi, Quantcast, Google Funding Choices, TrustArc) and dismisses them before reasoning about the page. Priority is "Reject all" / "Reject non-essential" / "Only necessary" when clearly visible; it falls back to "Accept all" rather than disappearing into the "Manage preferences" maze.

Paywalls: WebBrain reports the paywall honestly and tells you what it could actually see (headline, dek, first paragraphs). It does not attempt to bypass paywalls — no archive.today, 12ft.io, cookie clearing, JS disabling, or reader-mode tricks. If you want the full article, log in with a subscription or ask WebBrain to look for free coverage of the same story.

Does WebBrain support a dry-run mode?

As of 7.0.0, not yet. Dry-run mode is planned and already on the roadmap.

How does WebBrain keep cloud LLM bills under control?

Three independent layers:

Token-conscious screenshots. Before any image leaves your machine, WebBrain resizes it (shorter side capped, preserving aspect ratio) and iteratively JPEG-compresses it until it fits a per-turn image-token budget. A 2000×1200 screenshot that would cost you ~1,500 input tokens on GPT-4o gets compressed down to ~300–500 tokens with no practical loss for page-reading tasks. Implemented in _fitImageDimensions with unit tests for the budget math.

Smart context trimming. Conversation history, tool outputs, and inline DOM dumps are bounded per turn and trimmed oldest-first when the active model's context window is approaching full. You won't see a run silently balloon from 10k tokens to 100k because a read_page returned a novel-length article.

Dedicated vision model. Pair a cheap text model (e.g. GPT-4o-mini) for planning and tool calls with a separate vision-capable model (e.g. GPT-4o) only for screenshots, so you don't pay multimodal-model prices on every single turn. Configure under Settings → Vision.

Net result: long sessions with cloud providers stay predictable. For full control, use llama.cpp locally — zero per-token cost.

Can I contribute to WebBrain?

Absolutely! WebBrain is MIT-licensed and welcomes contributions. Check out the GitHub repository for issues, feature requests, and contribution guidelines.

Spread the word, share the love

WebBrain is MIT-licensed and runs entirely in your browser. If it's useful, drop us a star or share it — that's how independent open-source projects get found.