Gemini 3.5 Flash is now capable of viewing and managing your screen, and Google aims for businesses to have confidence in it.

Gemini 3.5 Flash is now capable of viewing and managing your screen, and Google aims for businesses to have confidence in it.

      TL;DR Google has integrated computer use as a built-in feature in Gemini 3.5 Flash, replacing the separate Gemini 2.5 standalone model with enterprise security measures. This new capability allows AI agents to interact with screens by clicking, typing, and scrolling across different devices and browsers, previously necessitating a dedicated model. Now, developers can easily activate computer use as one of several functions in Flash, alongside capabilities like code execution and search.

      The original standalone model was released in October 2025, achieving about 70 percent accuracy on the Online-Mind2Web benchmark. The new integration simplifies the process by merging what were once two separate workflows into one. Google emphasizes that this tool can automate tasks beyond simple chatbot interactions, enabling software testing and assisting knowledge workers with complex browser tasks.

      Google has implemented enhanced safety measures, including targeted adversarial training to combat prompt injection attacks, where malicious content may mislead AI agents. They offer two optional enterprise safeguards: one that requires user confirmation for sensitive actions and another that halts the agent upon detecting a potential prompt injection attempt. Both safeguards are not activated by default, and Google advises a layered approach to security.

      The competitive landscape has evolved, with Anthropic's Claude Computer Use offering cross-platform versatility and enhanced desktop capabilities. Google's Flash seeks to extend similar functions beyond just Chrome. OpenAI has joined the market, leading to competition focused on safe execution within regulated environments.

      There are currently no updated benchmark scores for the integrated tool, nor has Google shared information about enterprise adoption or provided case studies. The Gemini Enterprise Agent Platform offers a pay-as-you-go pricing model, making it potentially more affordable for extensive automation than previous models. However, the effectiveness of this cost advantage is contingent on the complexity of agent workflows and how frequently safety measures require user confirmation.

      In the realm of AI, computer use remains in its infancy. While models can maneuver familiar interfaces, they often struggle with unexpected pop-ups and complex layouts. Google's decision to make this capability a built-in feature indicates a belief in its readiness for general use, yet the optional safety measures acknowledge that it’s not fully prepared for unsupervised operation.

Other articles

Gemini 3.5 Flash is now capable of viewing and managing your screen, and Google aims to gain the trust of enterprises in this feature. Gemini 3.5 Flash is now capable of viewing and managing your screen, and Google aims to gain the trust of enterprises in this feature. Google has integrated computer use as a core feature in Gemini 3.5 Flash, replacing the standalone model and incorporating enterprise safety measures. Runpod achieves a $1 billion valuation following a $100 million funding round. Runpod achieves a $1 billion valuation following a $100 million funding round. Runpod has secured $100 million at a valuation of $1 billion, representing a tenfold increase in two years, and has stated that it declined acquisition offers exceeding $500 million. Google Wallet has the potential to expedite your experience at airport security through TSA PreCheck's Touchless ID feature. Google Wallet has the potential to expedite your experience at airport security through TSA PreCheck's Touchless ID feature. TSA and Google Wallet have introduced a simplified opt-in feature for PreCheck Touchless ID, allowing travelers to share their digital ID and boarding pass, enabling them to pass through security without needing a physical ID. Nvidia's CEO Jensen Huang states that illicit data centers are futile and emphasizes the priority of national security. Nvidia's CEO Jensen Huang states that illicit data centers are futile and emphasizes the priority of national security. Jensen Huang informed shareholders that Nvidia would focus on US national security rather than sales and described smuggled chip data centers as impractical. Micron's revenue increased fourfold as the demand for AI memory has driven gross margins to surpass 81 percent. Micron's revenue increased fourfold as the demand for AI memory has driven gross margins to surpass 81 percent. Micron reported $41 billion in revenue for Q3, marking a fourfold increase compared to the previous year, as HBM4 chips for Nvidia and Google pushed gross margins over 81 percent for the first time. Qualcomm secures Meta as its first identified customer for the Dragonfly data centre chips. Qualcomm secures Meta as its first identified customer for the Dragonfly data centre chips. Qualcomm announced its Dragonfly C1000 data center chip, with Meta being its first specified customer, and confirmed its acquisition of AI startup Modular for $3.9 billion.

Gemini 3.5 Flash is now capable of viewing and managing your screen, and Google aims for businesses to have confidence in it.

Google has integrated computer usage into Gemini 3.5 Flash, moving away from the standalone model and incorporating enterprise safety measures.