Integrate screengrasp into your applications
Access powerful click position prediction models through our simple API. Choose between:
POST https://screengrasp.onrender.com/api/createAnalysisTask
Creates a new image analysis task and returns a task ID.
image
- The screenshot file to analyzetaskDescription
- Description of what to find/click in the imagemechanism
(optional) - The model to use. Options:
screengrasp2
(default) - Reasoning Click Prediction Model using anensemble approachllabs
- CUA-NAV by LLABS model - Recommended for best accuracyanthropic-computer-use
- Advanced computer interaction modelgpt4o-with-omniparser
- Microsoft's interface parsing solution with GPT-4ogpt4o
- Direct GPT-4o modelmolmo
- Allen AI's click prediction modelgemini-flash-exp
- Experimental Gemini Flash modelgemini-flash-thinking
- Gemini Flash model with thinking stepsuitars-chute
- UI-TARS model with Chute integrationqwen-vl
- Qwen VL modelqwen-max
- Qwen Max modelqwen25-vl-72b
- Qwen 25 VL 72B modelGET https://screengrasp.onrender.com/api/getTaskStatus/:taskId
Retrieves the current status and results of an analysis task.
status
- Current task status:
QUEUED
- Task is waiting in queueCOMPLETED
- Analysis finished successfullyFAILED
- Analysis failedpredictedClickPosition
- When completed, contains the click coordinateserror
- Error message if status is FAILEDCOMPLETED
or FAILED
Each API call consumes token credits from your account. Average token usage:
Token estimation calculator coming soon!