Generative AI in Search, Gemini Live, Google Veo, and More
Google’s enthusiasm for AI was evident at the I/O developer conference, where the technology featured prominently throughout the event. The company introduced a range of exciting new AI products and features aimed at enhancing user experiences and making everyday tasks easier. From innovative search capabilities to advanced video creation tools, Google showcased how AI can transform the way we interact with technology. From cutting-edge search enhancements to advanced video creation tools, Google’s announcements demonstrated the transformative potential of AI. Here are the key highlights the AI Innovation Times Team believes you should know.
Ask Photos
Google Photos is set to launch Ask Photos, an experimental feature powered by the Gemini family of generative AI models. Rolling out this summer, Ask Photos allows users to search their photo collections using natural language queries. Instead of looking for specific items, users can perform broad searches, like finding the “best photo from each of the National Parks I visited.” The AI considers factors like lighting, blurriness, and geolocation to determine the best photos.
Gemini Live and Project Astra
A notable preview was the Gemini Live experience, which enables users to have “in-depth” voice chats with Gemini on their smartphones. Users can interrupt the chatbot to ask clarifying questions, and it adapts to their speech patterns in real-time. Gemini Live can respond to users’ surroundings using photos or video from their smartphones’ cameras. The technology behind Live is part of Project Astra, a new DeepMind initiative for creating AI-powered apps and agents for real-time, multimodal understanding. This feature will launch later this year.
Google Veo
Google introduced Veo, an AI model designed to create 1080p video clips up to a minute long from text prompts. Veo can capture various visual and cinematic styles, including landscapes and time lapses, and make edits to existing footage. The model understands camera movements, VFX, and physics, such as fluid dynamics and gravity, to enhance video realism. It supports masked editing for specific areas of a video and can generate videos from still images. Additionally, Veo can produce longer than one minute videos from a sequence of prompts that tell a story.
Generative AI in Search
Google aims to revolutionize search results pages with generative AI. Depending on the search query, AI-organized pages might display AI-generated summaries of reviews, discussions from social media sites like Reddit, and AI-generated lists of suggestions. Initially, these AI-enhanced results will appear when users search for inspiration, such as trip planning. Soon, they will also be available for dining options, recipes, movies, books, hotels, e-commerce, and more.
Gmail Enhancements
Gmail will soon incorporate Gemini to search, summarize, and draft emails, as well as assist with more complex tasks like processing returns. In a demo, Google showcased how a parent could summarize emails from their child’s school using Gemini. The AI can analyze attachments and provide summaries with key points and action items. From a Gmail sidebar, users can organize receipts, extract information into spreadsheets, and automate frequent workflows.
As Google continues to push the boundaries of AI technology, it’s evident that their commitment to innovation is set to reshape how we interact with the digital world. With these new tools and features on the horizon, users can look forward to more intuitive, efficient, and engaging experiences. Stay tuned as these exciting developments roll out and begin to make their mark on everyday life.
Image – @ AI Innovation Times