manny2

vllm hid runner
log

  
latest screenshot
screenshot
run history
startedtaskstatus
testing center — service health
probes every service Manny2 depends on. green = ok, red = broken. click a row to see detail.
live SHM frame — what manny2 sees right now (click to refresh)
shm frame
tools — call any service directly
pick an endpoint, fill in params if needed, click call. raw request URL + response body shown below.
🧠 parse current frame with omniparser (canonical)
runs Microsoft's canonical OmniParser pipeline (YOLO + easyocr + Florence-2) on the live SHM frame. ~0.8s warm. returns text + icon elements with bboxes and labels.
🎯 ask UI-TARS to locate something on screen
natural language → vision LLM finds the pixel. e.g. "the Inbox folder in the left sidebar", "the Send button"
vault