Overview

Claude Computer Use enables AI to control browsers through vision and reasoning in a continuous action loop.

Overview

The Claude Computer Use integration connects Claude 3.5 Sonnet (and newer models) with Steel's browser infrastructure. This enables:

  • Control Steel browser sessions via Claude's Computer Use API

  • Execute browser actions like clicking, typing, and scrolling

  • Automate complex web tasks and multi-step workflows

  • Process visual feedback from screenshots

  • Implement human verification for sensitive operations

Combining Claude's Computer Use with Steel gives you reliable automation with anti-bot capabilities, proxy support, and sandboxed environments.

Requirements & Limitations

  • Anthropic API Key: Access to Claude 3.5 Sonnet or newer models

  • Steel API Key: Active subscription to Steel

  • Python or Node.js Environment: Support for API clients for both services

  • Supported Environments: Works best with Steel's browser environment

  • Beta Status: Computer Use is currently in beta with some limitations

Documentation

Quickstart Guide (Python) → Step-by-step guide to building Claude Computer Use agents with Steel sessions in Python.

Quickstart Guide (Node.js) → Step-by-step guide to building Claude Computer Use agents with Steel sessions in TypeScript & Node.js.

Additional Resources

Anthropic Computer Use Documentation - Official documentation from Anthropic

Steel Sessions API Reference - Technical details for managing Steel browser sessions

Cookbook Recipe (Python) - Working, forkable examples of the integration in Python

Cookbook Recipe (Node.js) - Working, forkable examples of the integration in Node.js

Community Discord - Get help and share your implementations