Google's annual I/O developer conference brought a slew of exciting announcements, primarily focusing on artificial intelligence (AI) capabilities. One of the key highlights is the new Gemini features, including Gemini Live, which is now available in the Gemini app for iOS. This feature allows real-time conversations with Gemini, enabling users to ask questions about what's on their screen or in their surroundings.
Gemini Agent Mode is also on the horizon, which will enable Gemini to perform tasks like finding sports game tickets at an ideal price or locating the right apartment based on specific requirements. Additionally, Gemini will incorporate search history for more personalized results and pull information from other Google apps, offering proactive reminders and tools to help with preparations.
Google is also integrating its latest Gemini 2.5 model into search, offering a dedicated AI mode that uses a query fan-out technique to break down questions into multiple searches. The company has also introduced Imagen 4, a new image generating tool capable of creating more photorealistic images with improved details for hair, fur, and fabric.
Furthermore, Google announced Veo 3, an updated video generation model that can create videos with sound effects, background noise, and dialogue. The company is also bringing AI-powered features to its apps, including Gmail and Docs, where Gemini will be able to scan past emails, look up notes, and view documents in Google Drive to match tone, style, and mimic word choices.
Other notable announcements include Android XR, a platform for building VR headsets, and smart glasses with an in-lens display, cameras, microphones, and speakers, connected to Gemini for live translation, directions, and image recognition. With these updates, Google is pushing the boundaries of what AI can do, making it more accessible and integrated into everyday life.