The AI companion that sees what you see on Windows
Copilot Vision on Windows is a new feature that lets your AI companion see and understand your screen in real-time. It provides contextual guidance across multiple apps and uses "Highlights" to show you how to complete tasks.
Hi everyone!
I think a lot of our efforts in AI are ultimately about finding ways for it to instantly and seamlessly understand the user's context. The screen is often the primary information carrier we're dealing with, and I know from my own tests that having an AI assistant share your screen and interact via voice is a completely new kind of productivity experience.
Microsoft's new Copilot Vision for Windows explores this idea directly. When enabled, it can see what you see across apps and provide real-time insights. It can even use a "Highlights" feature to show you exactly where to click to complete a task.
This really pushes forward a new product philosophy: how do we make this kind of experience smoother, and truly let the AI act as an extension of the user's own intent? It's a fascinating direction to watch.
P.S. It's currently available in the US for Win 10 & 11.