Enhanced Gemini on Android: Now Seamlessly Integrate with Gmail, Messages, YouTube, and More

Google's Gemini: The Future of AI on Android

Google is set to enhance its Gemini, the AI successor to Google Assistant, by leveraging its deep integration with Android's mobile operating system and Google apps. During the Google I/O 2024 developer conference on Tuesday, the company revealed exciting new features that will allow users to access the Gemini overlay more seamlessly within their current apps. Additionally, Android's built-in AI model, Gemini Nano, is set for significant updates.

In the near future, Android users will enjoy the convenience of dragging and dropping AI-generated images into Gmail, Google Messages, and other applications. YouTube viewers can utilize the “Ask this video” feature to extract specific information directly from videos, according to Google’s announcement.

For subscribers of the premium Gemini Advanced service, priced at $19.99 per month, an “Ask this PDF” option will be available. This feature enables users to obtain answers from documents without having to sift through extensive pages. Subscribers also benefit from 2TB of storage and other perks associated with Google One.

Currently, Gemini on Android is capable of various functions, such as generating captions for photos, answering questions about articles, and performing a range of generative AI tasks akin to other chatbots. However, OpenAI recently launched its GenAI model, GPT-4o—where the "o" stands for “omni”—which can process text, speech, and video, including real-time input from a phone's camera. This development indicates that while Gemini holds strong advantages, it will face competition in the mobile AI landscape.

Google announced that the new Gemini features for Android will be rolled out to hundreds of millions of supported devices in the upcoming months. As it evolves, Gemini will begin offering tailored suggestions based on user interactions and what's displayed on their screens.

Meanwhile, the on-device foundation model, Gemini Nano, will receive updates to include multimodal capabilities. This enhancement will enable it to process various types of information, including text, visual input, sounds, and spoken language.

Stay tuned for our upcoming AI newsletter, launching on June 5, where you can keep up with the latest advancements.

Most people like

Find AI tools in YBX

Related Articles
Refresh Articles