No More Endless Forms: Gemini's Latest Model Can Use Your Computer; Scroll And Type For You

Anas Ali
Technology
Oct 08, 2025 14:14 pm IST

Read Time: 2 mins

The new Gemini model can control your personal computer. (Photo: Unsplash)

Google may have just cracked the next step of AI evolution by announcing Gemini 2.5 'Computer Use' version, which promises to take the application of artificial intelligence to a whole new level.

Gemini 2.5 Computer Use model is not designed to give replies to prompts or generate low-light, portrait images of your poorly taken selfies. Rather, it is designed to control your personal computer and complete basic front-end tasks.

While chatbots like ChatGPT, Gemini and Grok have evolved greatly to solve and ease many of the backend applications, Gemini 2.5 Computer Use model promises to learn to control your mouse and keyboard in order to complete some of the front-end tasks.

The current generation of AI tools are great for editing or generating pictures, writing letters, resumes or even doing calculations. For a more specific set of roles, it relies on Application Programming Interfaces (APIs).

However, they still cannot carry out more complex tasks that require an individual to identify visual patterns - something that the current generation of models still do not fully understand.

Gemini 2.5 Computer Use aims to solve this gap by actually performing visible actions on the screen. Think of it as your personal assistant which can use your mouse and keyboard. It can fill out your forms, navigate login screens, scroll through your browser and essentially understand everything that is being shown on the screen.

Potential use-cases can go beyond just these. Think of an AI that can submit your documents, book an appointment and solve real-world workflow problems.

Our new Gemini 2.5 Computer Use model can navigate browsers just like you do. 🌐

It builds on Gemini's visual understanding and reasoning capabilities to power agents that can click, scroll and type for you online - setting a new standard on multiple benchmarks, with faster… pic.twitter.com/Fqmov9Kkhb
October 7, 2025
Google has said its model has operated with higher efficiency even with lower latency. The model has also shown a lot of promise on mobile devices.
For now, the model is available in preview stage via the API in Google AI Studio and Vertex AI.
Essential Business Intelligence, Continuous LIVE TV, Sharp Market Insights, Practical Personal Finance Advice and Latest Stories — On NDTV Profit.

No More Endless Forms: Gemini's Latest Model Can Use Your Computer; Scroll And Type For You

News for You

World Book Day 2026 Special: From Memoirs To Motivational Reads — Bollywood Celebs Who Turned Authors

Chronic Burden: Hypertension, Diabetes, Cholesterol Drugs Dominate 40% Of Indian Pharma Landscape

Infosys ADR Falls Nearly 6% As IT Major's Weak FY27 Guidance Spooks Pre-Market Traders

Infosys Q4 Results: Profit Surges 28%, FY27 Growth Seen At 1.5%-3.55%