48 results for “topic:browser-agent”
The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web
AI Browser Agent is an advanced Browser AI tool developed by Oxylabs AI Studio that automates real user browsing tasks using natural language instructions.
Browser4: a lightning-fast, coroutine-safe browser for your AI.
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).
High-performance web crawler API optimized for LLMs. Turn any search or website into clean Markdown using remote browsers. Firecrawl alternative
Autonomous web browser agent that audits performance, functionality & UX for engineers and vibe-coding creators. 网站自主评估测试 Agent,一键完成性能、功能使用与交互体验的测试评估
Open‑source alternative to Perplexity Comet, director.ai and firecrawl combined
✉️ Use the power of browser-use to contact any person or organization... by any means necessary
A smart AI agent that controls your browser with natural language, using Playwright and advanced LLMs to navigate, analyze, and perform tasks.
Real-time Google Search API for AI Agents & RAG pipelines. Get structured SERP data instantly using remote browsers.
OpenBrowser is an open-source, AI-native browser built on Chromium — a truly privacy-first alternative to ChatGPT Atlas, Perplexity Comet, and Dia.
Auto-Browse: AI Enabled Browser Automation
Trajectory Recording and Capture Environments
The Anal-Queen of AI Browser Automation 🏴☠️ A beautifully fucked-up Skynet-powered browser automation script that harnesses neural brainfuck and machine learning chaos to give zero shits about anything while somehow still working perfectly.
CLI + SDK to automate, scrape, and extract from the web — for AI agents and humans. Cloud or local browser, one command.
Build your own AI operators like OpenAI
Screen recording and computer interaction capture tool that records keyboard/mouse input, screen video, DOM snapshots, and accessibility trees. Perfect for creating datasets to train and evaluate computer-use AI models.
Screen recording and computer interaction capture tool that records keyboard/mouse input, screen video, DOM snapshots, and accessibility trees. Perfect for creating datasets to train and evaluate computer-use AI models.
User-Agent information harvester
Session management library for Playwright remote browsers. Handles pooling, health checks, and auto-recovery for high-frequency scraping.
Turn your existing webpages into an MCP server for agent control
Agent-CE is a containerized continuous evaluation (CE) platform for web browsing agents. It provides production-ready Docker images and CI/CD pipelines for running and evaluating multiple agent frameworks including Browser Use, Notte, Anthropic Computer Use, and OpenAI Computer Use.
Heybro transforms Chrome into an intelligent AI agent that executes browser tasks through natural language commands. Powered by Google Gemini, this open-source side-panel extension interprets your requests, analyzes page DOM structures, and autonomously performs clicks, form fills, and multi-page navigation— eliminating manual browser interactions
Serverless AI browser agent
Crab-Agent is an LLM-powered Chrome extension that automates browser tasks using natural language commands.
Antibot Browser Agent
Perplexity Comet Alternative. Chrome extension for browser automation, multi-tab chat, video analysis, and more. Powered by @dom-engine
This dataset contains 3,167 completed tasks of human-computer interactions captured with video, screenshots, DOM snapshots, and detailed interaction events. Created by Paradigm Shift AI for advancing computer use AI agent research.
G-Coder is a command-line AI agent designed to be your partner in software development, DevOps, and system administration. Built on the powerful and fluid [Google Agent Development Kit (ADK)](https://google.github.io/adk-docs/), G-Coder is engineered for speed, reliability, and effectiveness.
Recurrsive Context Agent