Google DeepMind is developing an AI-powered computer mouse cursor based on Gemini, aiming to rethink a technology that has remained unchanged for over fifty years.
“We are exploring new AI capabilities that will help the cursor not only understand what it points to but also why it matters to the user,” the company stated in its blog.
This project addresses a common issue: standard AI tools operate in separate windows, requiring users to drag their content into them.
“We aim for the opposite: an intuitive AI that interacts with users across all their tools without interrupting their workflow,” the company declared.
Four Principles of the AI Cursor
Google DeepMind has established four principles that shift the burden of conveying context and intent from the user to the computer, replacing text prompts with more intuitive interaction methods.
Maintaining Workflow
AI functions should work across all applications, so the Google DeepMind cursor prototype is available wherever users perform tasks. For example, users can hover over a PDF file and request a summary in list form to insert into an email.
Another example is hovering over a table of statistics and asking to create a pie chart.
Show and Tell
Modern neural networks require precise instructions, and to get quality responses, users must craft detailed prompts. The AI cursor simplifies this process: it captures visual and semantic context, allowing the model to “see” and understand what is important.
In the experimental system, it is enough to point at a paragraph, an image segment, or a block of code, and the LLM will understand what assistance is needed.
The Power of “This” and “That”
In everyday communication, people rarely use long, detailed phrases. They say things like, “Fix this,” “Move that here,” or “What does this mean,” relying on gestures and shared context.
“An AI system capable of understanding this combination of context, gestures, and speech will allow users to formulate complex requests naturally and concisely without using complicated prompts,” Google DeepMind's blog noted.
Pixels for Interaction
For decades, computers have only tracked gaze direction. AI can understand exactly what the user is pointing at and transform pixels into structured objects for instant interaction.
“A photo with sketches turns into an interactive to-do list, and a still from a travel video becomes a link to book a table at that charming restaurant,” Google DeepMind highlighted.
Implementation
Google DeepMind is integrating these principles into Chrome and the new Googlebook laptop interface.
“Starting today, instead of entering complex queries, you can use the cursor to ask Gemini in Chrome about the part of the webpage that interests you,” the blog stated.
For instance, users can select multiple products on a page and ask to compare them or indicate where to visualize a new sofa in the living room.
In the near future, the company will launch Magic Pointer in Googlebook, allowing users to utilize Gemini with a simple finger movement.
Googlebook
Google introduced a “new category of laptops” based on Gemini's capabilities, merging Android and ChromeOS.
“More than 15 years ago, we introduced the Chromebook—a laptop designed for a cloud-first world. Today, as we transition from OS to intelligent systems, we see an opportunity to rethink the concept of laptops,” the announcement stated.
Googlebooks are designed for Gemini Intelligence and the new AI cursor. The devices feature a “Create Widget” function with voice command support.
Gemini can search the internet for information or connect to Google apps like Gmail and Calendar to create a unified personalized dashboard.
The tool is built on the Android tech stack, ensuring seamless operation across multiple devices: the “Quick Access” feature allows users to view, search, and insert files from their phone to the laptop without manual transfer.
The first Googlebooks are being developed in collaboration with Acer, ASUS, Dell, HP, and Lenovo.
Gemini Intelligence
Gemini Intelligence is a suite of AI features for Android devices. By summer 2026, they will first appear on Samsung Galaxy and Google Pixel, followed by other gadgets.
“This system combines top-notch hardware and innovative software to help you stay one step ahead, proactively solving tasks throughout the day while ensuring data privacy and full control,” the announcement stated.
Some agent capabilities of Gemini were already demonstrated in early 2026 on Samsung smartphones, such as ordering food or a taxi. Soon, the assistant will be able to perform more complex tasks: just take a photo of a tour brochure, and the bot will find similar options on Expedia.
In Chrome, the AI assistant will help find, summarize, and compare content across different pages. The Rambler feature on the Gboard keyboard will allow users to dictate text, highlight important segments, and compile them into a coherent message without filler words and repetitions.
Recall that in April, Google introduced Gemma 4—a new family of open AI models for advanced reasoning and agent workflows.
