Endpoints
/find-element
Locate an element in a screenshot based on a natural language description
POST
/
find-element
This endpoint takes a screenshot and a natural language description of an element, and returns the bounding box coordinates of the matching element. Coordinates are in pixels relative to the top-left corner of the image.
Request
Natural language description of the element to find (e.g. “Login button”, “Email input field”)
Screenshot image file in PNG or JPEG format
Your API key for authentication