Google Introduces Gemini Live Voice Assistant to Compete with ChatGPT Voice Mode

Google has introduced Gemini Live, a conversational voice assistant designed to compete with OpenAI's Voice Mode. This feature is accessible via the Gemini app on Android and iOS, enabling users to interact with the AI through voice commands. Gemini Live utilizes Google's Gemini 1.5 Flash model, offering responses in 10 different generated voices. Users can engage with the chatbot to manage tasks like creating shopping lists and summarizing emails.

Sissie Hsiao, Google's general manager for Gemini experiences and Google Assistant, expressed excitement about Gemini's innovation in personal assistant technology. The focus is on delivering AI-powered mobile assistance that is more intuitive, natural, and conversational. This voice assistant allows users to communicate with the chatbot while switching between apps or even when their phone is locked, making it feel like a regular phone call.

Currently, Gemini Live is accessible in English to Gemini Advanced subscribers on Android devices, with plans to expand to iOS and more languages in the near future. Gemini Advanced offers a one-month free trial, followed by a subscription fee of $20 per month. Subscribers gain access to the Gemini 1.5 Pro model, increased input length, additional storage, integration with Workspace applications, and the capability to upload files for the chatbot to engage with.

In addition to providing voice support, Google is extending Live's functionality to include compatibility with various Google apps like YouTube Music, where the chatbot can create playlists based on voice commands. Future updates will introduce calendar support, enabling users to set reminders through their calendar app. Google aims to enhance Live's speed and response quality while leveraging the robust context window of the 1.5 Flash model to process significant data inputs efficiently.

While OpenAI enhances its ChatGPT's audio features with GPT-4o for Plus subscribers, Google's Gemini Live reflects its ongoing development efforts. Despite comparisons to ChatGPT Voice Mode, Google has been exploring similar technologies for a while. Gemini Live offers a glimpse of Google's research progress, with a conversational agent previewed at I/O as part of Project Astra. Exciting updates and improvements are expected in the weeks ahead to enhance user experience and functionality.

Most people like

Find AI tools in YBX

Related Articles
Refresh Articles