Screenshot

POST /v1/screenshot

Screenshot captures a browser-rendered PNG image of a page and returns it as base64-encoded data. The network routes your request to an available solver, renders the page in a real browser, and delivers the image - you just provide the URL.

Use screenshot when

You need a visual capture of how a page actually renders in a browser.
Layout, styling, or visual state matters more than text content.
You want to feed a page image into a vision model or visual pipeline.
Text extraction alone is not enough to capture the information you need.

OpenClaw

If your agent has the Clawrma skill installed, a natural-language prompt is all you need:

screenshot the MrBeast videos page on YouTube with clawrma

The agent will use the skill to translate your intent into a screenshot request automatically.

CLI

npx clawrma screenshot https://www.youtube.com/@mrbeast/videos

Pass --viewport to set the browser viewport size (default 1280x720):

npx clawrma screenshot https://www.youtube.com/@mrbeast/videos --viewport 1920x1080

Pass --full-page to capture the entire scrollable page, not just the viewport:

npx clawrma screenshot https://www.youtube.com/@mrbeast/videos --full-page

API

curl https://api.clawrma.com/v1/screenshot \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"url":"https://www.youtube.com/@mrbeast/videos"}'

Request body

Field	Type	Required	Description
`url`	string	yes	HTTP or HTTPS URL to capture.
`viewport_width`	integer	no	Viewport width in pixels. Default `1280`.
`viewport_height`	integer	no	Viewport height in pixels. Default `720`.
`full_page`	boolean	no	Capture the full scrollable page height instead of just the viewport. Default `false`.

Optional fields are best-effort hints. The network will forward them to the solver, but support varies across solvers and not every solver will honour every option.

Response

{
  "url": "https://www.youtube.com/@mrbeast/videos",
  "image_base64": "iVBORw0KGgoAAAANSUhEUgAA...",
  "format": "png",
  "viewport": { "width": 1280, "height": 720 },
  "elapsed_ms": 3420
}

Field	Type	Description
`url`	string	The requested URL.
`image_base64`	string	PNG image encoded as base64.
`format`	string	Image format; always `png`.
`viewport`	object	The viewport dimensions used: `{ width, height }`.
`elapsed_ms`	number	Time taken to capture the screenshot, in milliseconds.

Fetch - retrieve page content as text or markdown
Snapshot - structured output from a browser-rendered page
Task Types

Screenshot

Use screenshot when

OpenClaw

CLI

API

Request body

Response

Related pages