> Full docs index: https://docs.steel.dev/llms.txt

# Gemini Computer Use
URL: https://docs.steel.dev/integrations/gemini-computer-use

The Gemini Computer Use integration runs Gemini 3's [vision-based agent loop](/cookbook/topics/computer-use) on a Steel browser session. Gemini takes screenshots through Steel, decides the next action (click, type, scroll), and Steel executes it, so you can automate complex web tasks without writing custom selectors.

It pairs well with Steel's anti-bot capabilities, proxy support, and sandboxed environments.

### Requirements

*   **Gemini API Key**: A Gemini 3 model with computer use
*   **Steel API Key**: Active Steel subscription
*   **Runtime**: Python 3.10+ or Node.js 20+

### Connect Steel to Gemini

Steel's `sessions.computer` API takes screenshots and executes actions; pair it with Gemini 3's built-in computer use:

```typescript
import { Steel } from "steel-sdk";

const steel = new Steel({ steelAPIKey: STEEL_API_KEY });
const session = await steel.sessions.create({
  dimensions: { width: 1280, height: 800 },
});

// Take a screenshot via Steel
const { base64_image } = await steel.sessions.computer(session.id, {
  action: "take_screenshot",
});

// Send to Gemini with the computer-use tool, then route Gemini's
// returned actions back through `steel.sessions.computer({ action: ... })`.
```

Full runnable starter: [Steel + Gemini Computer Use recipe →](/cookbook/gemini-computer-use)

### FAQ

### Do I need Playwright or CSS selectors to use Gemini Computer Use with Steel?

No — Gemini's loop is vision-based. Gemini reads screenshots, decides actions like click, type, or scroll, and Steel executes them via its `sessions.computer` API, so no custom selectors are needed.

### How do I connect Gemini Computer Use to a Steel browser session?

Create a session with explicit `dimensions` (the example uses 1280x800), take a screenshot with `steel.sessions.computer(session.id, { action: "take_screenshot" })`, send the `base64_image` to Gemini 3 with its built-in computer-use tool, and route Gemini's returned actions back through `steel.sessions.computer({ action: ... })`.

### Does Gemini Computer Use work with Steel's proxies, stealth mode, and CAPTCHA solving?

Yes — the page notes it pairs well with Steel's anti-bot capabilities, proxy support, and sandboxed environments. These are session-creation options (e.g. `useProxy`, `solveCaptcha`, `stealthConfig`) and don't change the agent loop.

### Which Gemini model do I need, and how does the loop work?

A Gemini 3 model with computer use — the tool is built in. Steel screenshots the session, Gemini decides the next action, Steel executes it via `sessions.computer`, and the loop repeats until the task completes. Consult the [Steel leaderboard](https://leaderboard.steel.dev/) for the most recent model recommendations.

### Resources

*   [Gemini Computer Use documentation](https://ai.google.dev/gemini-api/docs/computer-use) – Official documentation from Google
*   [Steel Sessions API reference](/api-reference) – Technical details for managing Steel browser sessions
*   [Steel Discord](https://discord.gg/steel-dev) – Get help and share what you build