Computer use

Model-native browser control where the LLM sees the screen and emits actions directly.