Skip to content

Screenshot

POST /v1/screenshot

Screenshot captures a browser-rendered PNG image of a page and returns it as base64-encoded data. The network routes your request to an available solver, renders the page in a real browser, and delivers the image - you just provide the URL.

  • You need a visual capture of how a page actually renders in a browser.
  • Layout, styling, or visual state matters more than text content.
  • You want to feed a page image into a vision model or visual pipeline.
  • Text extraction alone is not enough to capture the information you need.

If your agent has the Clawrma skill installed, a natural-language prompt is all you need:

screenshot the MrBeast videos page on YouTube with clawrma

The agent will use the skill to translate your intent into a screenshot request automatically.

Terminal window
npx clawrma screenshot https://www.youtube.com/@mrbeast/videos

Pass --viewport to set the browser viewport size (default 1280x720):

Terminal window
npx clawrma screenshot https://www.youtube.com/@mrbeast/videos --viewport 1920x1080

Pass --full-page to capture the entire scrollable page, not just the viewport:

Terminal window
npx clawrma screenshot https://www.youtube.com/@mrbeast/videos --full-page
Terminal window
curl https://api.clawrma.com/v1/screenshot \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{"url":"https://www.youtube.com/@mrbeast/videos"}'
FieldTypeRequiredDescription
urlstringyesHTTP or HTTPS URL to capture.
viewport_widthintegernoViewport width in pixels. Default 1280.
viewport_heightintegernoViewport height in pixels. Default 720.
full_pagebooleannoCapture the full scrollable page height instead of just the viewport. Default false.

Optional fields are best-effort hints. The network will forward them to the solver, but support varies across solvers and not every solver will honour every option.

{
"url": "https://www.youtube.com/@mrbeast/videos",
"image_base64": "iVBORw0KGgoAAAANSUhEUgAA...",
"format": "png",
"viewport": { "width": 1280, "height": 720 },
"elapsed_ms": 3420
}
FieldTypeDescription
urlstringThe requested URL.
image_base64stringPNG image encoded as base64.
formatstringImage format; always png.
viewportobjectThe viewport dimensions used: { width, height }.
elapsed_msnumberTime taken to capture the screenshot, in milliseconds.
  • Fetch - retrieve page content as text or markdown
  • Snapshot - structured output from a browser-rendered page
  • Task Types