At the Google I/O 2024 event on May 14, Alphabet CEO Sundar Pichai introduced a slew of new AI capabilities and updates to several Google services. The discussion centred on how these improvements, notably the Gemini AI, may alter consumer interactions with Google products. These are all of Sundai Pichai’s significant announcements from the Google I/O 2024 event.
Ask Photos Feature: Gemini Enhances Google Photos
One of the standout features announced is the “Ask Photos” functionality in Google Photos. Using Gemini, users can now find specific images by simply prompting the chatbot. For instance, you could ask it to locate a photo containing a license plate number.
Gemini 1.5 Flash: Efficient and Cost-Effective AI
Google introduced Gemini 1.5 Flash, a lightweight, multi-modal AI model. It’s optimized for tasks that require high frequency and low latency, such as summarizing text, chatting, captioning images and videos, and extracting data from lengthy documents and tables.
Gemini Integration in Google Workspace
Soon, Google Workspace apps including Docs, Gmail, Meet, and Drive will be able to connect to Gemini 1.5 Pro. For instance, the AI in Gmail can quickly summarise and reply to emails. Starting in June, Gemini Advanced members will have access to these capabilities.
Gemini 1.5 Pro: Enhanced Context Window
Up to two million tokens can be handled by the bigger context window included in the Gemini 1.5 Pro model. With the help of this private preview capability, the AI can handle difficult jobs like examining large codebases, documents and videos.
Introducing Project Astra
Equally fascinating was the unveiling of Project Astra, a multimodal AI assistant. This AI uses your device’s camera to help locate misplaced items and do numerous daily tasks, demonstrating useful applications for AI in everyday life.
Personalise Your Gemini Assistant
In addition, Google unveiled “Gems,” a feature that lets customers customise Gemini assistants. The demeanour, tone and specialisations of the assistant may be altered by users to fit their own preferences.
Veo: AI Video Generation Tool
Veo, a new artificial intelligence tool that can produce minute-long, 1080p videos with excellent quality. It makes it simpler to produce visually engaging videos by comprehending cues pertaining to variations and cinematic effects.
Gemini Nano Integration with Chrome
Gemini Nano will be integrated into Chrome on desktop, functioning as an on-device assistant. It will assist in generating text and performing tasks like autofill.
AI-Powered Assistant on Android
Android devices will soon feature a new AI-powered assistant, enhancing creativity and productivity. This assistant includes features like “Ask this video” or “Ask this PDF,” working seamlessly with Gmail, Google Messages, and YouTube. It also includes scam detection capabilities.
Trillium: Sixth-Generation Google Cloud TPU
Google announced the Trillium, the sixth generation of Google Cloud TPU. This new chip provides improved computing performance and high bandwidth memory, powering the next generation of AI models.
Circle to Search Expands
The Circle to Search tool can now accurately answer arithmetic problems that are displayed on the screen thanks to its improved capabilities. By the end of the year, more devices are going to have this feature.
SynthID AI Watermarking
Finally, in order to guarantee that movies created by AI are appropriately tagged and distinguishable, Google is extending the use of its SynthID AI watermarking technology to Veo videos.