Sekėjai

Ieškoti šiame dienoraštyje

2026 m. gegužės 20 d., trečiadienis

Google Unveils Gemini AI Agent For Life Tasks


“Google is supercharging its Gemini artificial-intelligence model to become more competitive in the era of agentic AI.

 

The company has started rolling out what it calls Gemini Spark, a personal agent it says is capable of navigating a user's digital life and acting on his or her behalf. The agent will work across many of Google's products and run on the company's cloud infrastructure.

 

"We've been bringing agents to developers and enterprises for a while now," said Sundar Pichai, chief executive of Google parent Alphabet. "We are super focused on bringing that frontier capability of agents safely and securely to consumers so that they work for everyone."

 

The company has been testing Spark with a limited number of users and plans to make it available next week to those who pay for AI Ultra, a new subscription tier that costs $100 a month.

 

The debut of Spark, which Google announced at its annual developer's conference on Tuesday, comes as AI upstarts including Anthropic and OpenAI work to make their agents more capable of making choices without explicit human direction. In the workplace, agents are most often deployed to complete tasks such as writing code, summarizing text and analyzing data. AI researchers are working to make them more useful for personal tasks.

 

Google also announced the rollout of Gemini Omni, a tool to create videos from various inputs including text, audio and images.

 

"It can create videos that are very, very high level, and you can interact with that video afterwards," said Koray Kavukcuoglu, Google's chief AI architect and chief technology officer of the company's DeepMind unit. "You can imagine that this builds very much a tutorial-like capability when you are exploring something."

 

High-quality videos have historically been difficult for AI tools to generate, and the process requires a large amount of computing power. OpenAI this year scrapped Sora, its video-generation tool, in part to free up more computing resources for other projects.

 

Shares of Alphabet have surged since the November launch of Gemini 3, a powerful AI model that vaulted Google ahead in the AI race. OpenAI's ChatGPT is still by far the most popular AI chatbot and Anthropic's Claude is regarded as one of the best models for coding, but investors widely view Google as a formidable competitor.

 

News Corp, owner of The Wall Street Journal, has a commercial agreement to supply content on Google platforms.

 

Google on Tuesday introduced Gemini 3.5 Flash, a model it says is well suited for coding and agentic work and runs four times faster than other frontier models. Gemini 3.5 Pro, a stronger but slower model, will roll out in the coming weeks.

 

Alphabet, shares of which are up roughly 25% this year, has recently come close to hitting a $5 trillion valuation. It would be the second company after Nvidia to cross that threshold.

 

Growth in the company's cloud division, which sells access to AI platforms and computing power, has powered much of those gains. The cloud unit had $20 billion in first-quarter revenue, a 63% increase from the same period in 2025.” [1]

 

1. Google Unveils Gemini AI Agent For Life Tasks. Blunt, Katherine; Winkler, Rolfe.  Wall Street Journal, Eastern edition; New York, N.Y.. 20 May 2026: B4.  

Komentarų nėra: