HyperAI

3 days ago

Google has integrated native computer use capabilities into Gemini 3.5 Flash, establishing the feature as a built-in tool within the model architecture. Previously accessible only as a standalone component in the Gemini 2.5 series, computer use now operates natively alongside Gemini's existing strengths in function calling and grounded utilities like Search and Maps. This integration enables developers to deploy custom agents capable of observing, reasoning, and executing actions across browser, mobile, and desktop environments. The upgrade targets high-complexity workflows, including long-horizon enterprise automation, continuous software testing, and knowledge work across professional applications. Demonstrations indicate the model can analyze application interfaces to categorize features and audit documentation for accessibility compliance. Security protocols for agentic operations include targeted adversarial training to mitigate prompt injection risks. Google is releasing two optional enterprise safeguard systems designed to require explicit user confirmation for sensitive or irreversible actions and to automatically halt tasks when indirect prompt injection is detected. The company recommends a defense-in-depth approach, advising developers to supplement these safeguards with secure sandboxing, human-in-the-loop verification, and strict access controls. Computer use in Gemini 3.5 Flash is available via the Gemini API and the Gemini Enterprise Agent Platform. A demo environment hosted by Browserbase offers immediate capability testing, with reference implementations and documentation provided for integration.

This news is intelligently aggregated by AI to deliver industry updates efficiently. It does not constitute opinions or advice.

Related Links

Related Links

Related Links

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.

Command Palette

Gemini 3.5 Flash Natively Integrates Computer Use

Related Links

Command Palette

Gemini 3.5 Flash Natively Integrates Computer Use

Related Links

Command Palette

Gemini 3.5 Flash Natively Integrates Computer Use

Related Links

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.

4-step Image output/4K quality/6x Speedup, PiD Uses Pixel Diffusion to Unify Decoding and super-resolution Output; SA-3DAO: a Dataset Containing 1000 Pairs of Real Images Paired With Handcrafted 3D Meshes by artists.