Open-source benchmark for browser AI agents on daily tasks.
chrome-extension benchmark evaluation dataset browser-automation ai-agents web-agent web-agents everyday-tasks browser-agent llm llm-evaluation agentic-ai computer-use browser-use agent-evaluation ai-agent-benchmark online-tasks chrome-agent real-world-benchmark
-
Updated
May 25, 2026 - Python