Research

World Models
Models that predict how systems change under action.

Visual Grounding
Training-free confidence for visual grounding.
#1 on ScreenSpot Pro

Browser Agents
Agents that use browsers to act and gather evidence.
#1 in WebVoyager#1 in AssistantBench

Physical reasoning
Video models predicting physical outcomes.
#2 Physical Reasoning Leaderboard