POST
/
find-element

This endpoint takes a screenshot and a natural language description of an element, and returns the bounding box coordinates of the matching element. Coordinates are in pixels relative to the top-left corner of the image.

Request

element_description
string
required

Natural language description of the element to find (e.g. “Login button”, “Email input field”)

image_data
file
required

Screenshot image file in PNG or JPEG format

api_key
string
required

Your API key for authentication

Example Request

curl -X POST https://u3mvtbirxf.us-east-1.awsapprunner.com/find-element \
  -F "element_description=Login button" \
  -F "image_data=@screenshot.png" \
  -F "api_key=your_api_key"

Example Response

{
  "x1": 100,
  "y1": 200,
  "x2": 200,
  "y2": 250
}