Browser Use
Open-source Python agent that lets LLMs navigate, fill forms, and extract from the web.
The Browser Use integration connects Steel's browser infrastructure with the Browser Use agent framework, enabling AI models to perform complex web interactions. Agents can navigate websites, fill forms, click buttons, extract data, and complete multi-step tasks – all while leveraging Steel's reliable cloud browsers for execution. This integration bridges the gap between AI capabilities and real-world web applications without requiring custom API development.
Requirements
- Steel API Key: Active Steel subscription
- Python: 3.11+
- Supported models: vision-capable models like GPT-5, Claude Sonnet 4, or Gemini 3 Pro
Connect Steel to Browser Use
Pass Steel's CDP URL into a Browser Use BrowserSession:
Python
from browser_use import Agent, BrowserSessionfrom browser_use.llm import ChatOpenAIfrom steel import Steelclient = Steel(steel_api_key=STEEL_API_KEY)session = client.sessions.create()cdp_url = f"{session.websocket_url}&apiKey={STEEL_API_KEY}"agent = Agent(task="Find the latest news on Steel.dev",llm=ChatOpenAI(model="gpt-5", api_key=OPENAI_API_KEY),browser_session=BrowserSession(cdp_url=cdp_url),)result = await agent.run()
Full runnable starters:
- Connect Browser Use to a Steel cloud browser: Steel + Browser Use recipe →
- Auto-solve captchas with Steel + Browser Use: Captcha auto recipe →
- Hand off captchas to a human via Steel Live View: Manual captcha recipe →
Resources
- Browser Use documentation – Comprehensive guide to the browser-use library
- Browser Use examples – Working example implementations
- Browser Use Discord – Join discussions and get support
- Steel Discord – Get help and share what you build