We want the llm to be able to somehow follow your activity on the screen (when you turn on that mode) and then guide you through doing something. eg: Like creating a new VM on GCP