🎯

computer-use

🎯Skill

from oimiragieo/agent-studio

What it does

computer-use skill from oimiragieo/agent-studio

computer-use

Installation

Install skill:

npx skills add https://github.com/oimiragieo/agent-studio --skill computer-use

AddedJan 29, 2026

View on GitHub Back to Skills

Skill Details

SKILL.md

Claude Computer Use tool integration for desktop automation. Enables Claude to interact with computer environments through screenshots, mouse control, keyboard input, and application automation in sandboxed environments.

Overview

# Computer Use Skill

Desktop automation specialist that enables Claude to interact with computer environments through the Anthropic Computer Use tool API, providing screenshot capture, mouse control, keyboard input, and application automation capabilities.

Configure Computer Use tool with proper display settings
Execute agent loops for multi-step desktop automation
Handle coordinate scaling for different screen resolutions
Implement security best practices for sandboxed execution
Process screenshots and determine next actions
Execute mouse actions (click, drag, scroll, move)
Execute keyboard actions (type, key combinations)
Wait for UI elements and screen changes

Step 1: Environment Setup

Before using Computer Use, ensure proper sandboxed environment:

Docker Container (Recommended):

```bash

# Use Anthropic's reference container

docker run -it --rm \

-e ANTHROPIC_API_KEY=$ANTHROPIC_API_KEY \

-p 5900:5900 -p 8501:8501 \

ghcr.io/anthropics/anthropic-quickstarts:computer-use-demo-latest

```

Virtual Machine: Dedicated VM with minimal privileges and isolated network

Never run on host machine with access to sensitive data or credentials

Step 2: Tool Configuration

Configure the computer use tool with display settings:

```javascript

const computerTool = {

type: 'computer_20250124', // or "computer_20251124" for Opus 4.5

name: 'computer',

display_width_px: 1024,

display_height_px: 768,

display_number: 1,

};

```

Resolution Guidelines:

XGA (1024x768): Default, works well for most tasks
WXGA (1280x800): Better for wide content
1920x1080: Only if needed, may reduce accuracy

Step 3: Agent Loop Implementation

The core pattern for computer use is an agent loop:

```javascript

async function computerUseAgentLoop(task, maxIterations = 50) {

const messages = [{ role: 'user', content: task }];

for (let i = 0; i < maxIterations; i++) {

// 1. Call Claude with computer tool

const response = await anthropic.messages.create({

model: 'claude-sonnet-4-20250514',

max_tokens: 4096,

tools: [computerTool],

messages,

betas: ['computer-use-2025-01-24'],

});

// 2. Check if task complete

if (response.stop_reason === 'end_turn') {

return extractFinalResult(response);

}

// 3. Process tool use requests

const toolResults = [];

for (const block of response.content) {

if (block.type === 'tool_use' && block.name === 'computer') {

const result = await executeComputerAction(block.input);

toolResults.push({

type: 'tool_result',

tool_use_id: block.id,

content: result,

});

}

// 4. Add assistant response and tool results

messages.push({ role: 'assistant', content: response.content });

messages.push({ role: 'user', content: toolResults });

}

throw new Error('Max iterations reached');

}

```

Step 4: Action Execution

Execute computer actions based on Claude's requests:

```javascript

async function executeComputerAction(input) {

const { action, coordinate, text, scroll_direction, scroll_amount } = input;

switch (action) {

case 'screenshot':