AI Task Completion

Anchor Browser delivers a state-of-the-art 89% Score on the industry-standard benchmark WebVoyager, leveraging browser-use as a core component of the automation capability.

The agent task method

Anchor Browser provides within its SDK the agent.task method that enables natural language control over web browsing sessions. This capability allows you to automate complex web tasks without coding the whole flow.

Looking for Tasks? Visit the Tasks Page.

Code Example

import Anchorbrowser from 'anchorbrowser';

const anchorClient = new Anchorbrowser({
  apiKey: process.env.ANCHORBROWSER_API_KEY
});

const response = await anchorClient.agent.task(
  'Extract the main heading',                     // Required
  {
    taskOptions: {
      url: 'https://example.com',                 // Either sessionId or url is required
      humanIntervention: false,                   // Disable human intervention during task execution (disabled by default)
      detectElements: true,                       // Improves the agent's ability to identify and interact with UI elements
      maxSteps: 40,                               // Maximum number of steps the agent can take
      agent: 'browser-use',                       // browser-use (default), openai-cua, or gemini-computer-use
      provider: 'openai',                         // For browser-use agent only, openai, gemini, groq, azure, xai
      model: 'gpt-5',                             // For browser-use agent only, see model list below
      extendedSystemMessage: 'Focus on extracting the main heading from the page',
      secretValues: {                             // Secret values to pass to the agent for secure credential handling
        API_KEY: 'your-secret-key'
      }
    }
  }
);

console.log(response);

Structured Output

The AI object can also be used to extract structured data from the browser. This is done by providing a JSON schema to the AI object, which will then return the structured data. The following demonstrates using Zod and Pydantic to utilize the structured output capability.

// Create a browser session and get references
const browser = await anchorClient.browser.create();
const context = browser.contexts()[0];
const page = context.pages()[0];
const ai = context.serviceWorkers()[0]; // Get the AI service worker

// Define the expected output structure using Zod schema
const outputSchema = z.object({
  nodes_cpu_usage: z.array(
    z.object({
      node: z.string(),           // Node name
      cluster: z.string(),        // Cluster identifier
      cpu_avg_percentage: z.number(), // CPU usage percentage
    })
  )
});

// Create task payload with structured output schema
const taskPayload = {
  output_schema: z.toJSONSchema(outputSchema);,      // Define expected output structure
  prompt: 'Collect the node names and their CPU average %',
};

// Navigate to the target page
await page.goto("https://play.grafana.org/a/grafana-k8s-app/navigation/nodes?from=now-1h&to=now&refresh=1m");

// Execute the AI task with structured output
const result = await ai.evaluate(JSON.stringify(taskPayload));
console.info(result);

// Clean up browser resources
await browser.close();

Show The Browser AI Object

The Browser AI Object

Anchor Browser comes with an embedded AI component, that allows to control the browser or extract data using natural language. This capability allows to use the browser without any coding.

Code Example

import Anchorbrowser from "anchorbrowser";

const anchor_client = new Anchorbrowser({apiKey: process.env.ANCHORBROWSER_API_KEY});

// Create a browser session
const browser = await anchor_client.browser.create();
const page = browser.contexts()[0].pages()[0];

// Get the AI service worker
const ai = context.serviceWorkers().find(sw => 
  sw.url().includes('chrome-extension://bppehibnhionalpjigdjdilknbljaeai/background.js')
);

await page.goto("http://docs.anchorbrowser.io/", {waitUntil:'domcontentloaded'});

// Use the embedded 'ai' object
const result = await ai.evaluate('Find the last game played by Milwaukee in the NBA and return the result');

await browser.close();
console.log(result);

Show Configuration Options

Configuration Options

The AI agent can be configured with the following parameters:

agent (string): AI agent to use (browser-use, openai-cua, gemini-computer-use). Defaults to browser-use.
secret_values (object): Secret values to pass to the agent for secure credential handling.
human_intervention (boolean): Allow human intervention during task execution.
provider (string): AI provider to use (openai, gemini, groq, azure, xai).
model (string): Specific model to use (see Available Models below).
url (string): Target URL to navigate to before executing the task.
output_schema (object): JSON Schema defining the expected structure of the output data.
max_steps (integer): Maximum number of steps the agent can take (default: 40).
detect_elements (boolean): Enable element detection for better interaction accuracy.
extended_system_message (string): Custom system message to provide additional context or instructions to the agent.
use_vision (boolean): Enable vision capabilities for enhanced visual understanding.

Secret Values

Securely pass credentials and sensitive data to AI agents during task execution. Secret values are not logged and automatically cleaned up after completion.

Learn more about Secret Values →

Available Models For Browser-Use

Hide Available Models

OpenAI
Gemini
Groq
Azure

gpt-5.2

gpt-5

gpt-5-mini

gpt-5-nano

gpt-4o

gpt-4o-mini

gpt-4.1

gpt-4.1-mini

Get Started

Capabilities

Integrations

Additional Details

The agent task method

Code Example

Structured Output

The Browser AI Object

Code Example

Configuration Options

Secret Values

Available Models For Browser-Use

Get Started

Capabilities

Integrations

Additional Details

​The agent task method

​Code Example

​Structured Output

​The Browser AI Object

​Code Example

​Configuration Options

​Secret Values

​Available Models For Browser-Use

The agent task method

Code Example

Structured Output

The Browser AI Object

Code Example

Configuration Options

Secret Values

Available Models For Browser-Use