Google Gemini Live: How It Gains Voice Capabilities and Enhanced Interaction

Tech

google gemini live with voice

Google unveiled several new Pixel devices at its Made By Google event. Along with the hardware, the tech giant introduced exciting new AI features for its Gemini chatbot. The highlight of the evening was Gemini Live, a major upgrade that adds voice capabilities to the chatbot.

Gemini Live Brings Voice Capabilities

Gemini Live is a new feature that allows users to have conversations with the Gemini chatbot using their voice.

  • Instead of typing or reading messages, users can now talk directly to the AI. This feature is similar to ChatGPT’s advanced Voice Mode, which was recently introduced to some paid subscribers.

Natural and Flexible Conversations

Gemini Live offers a mobile conversational experience where the AI can respond naturally with voice modulations and emotions.

  • Users will have 10 different voices to choose from, each with unique energy levels, pitches, and tonalities.
  • This aims to make the AI’s responses sound more human-like.

Hands-Free Interaction

The new feature supports a hands-free experience. Gemini Live can listen and respond while the device is locked or in the background, similar to a regular phone call.

Users can interact with the AI without needing to keep their device actively in use.

Watch the below video:

Flexible Conversations and Interactions

With Gemini Live, users can have ongoing conversations with the AI. If a topic needs more explanation or if there are follow-up questions, users can provide additional context. The AI can be interrupted mid-response for more information and can be paused if users wish to return to it later.

Gemini Live is comparable to OpenAI’s ChatGPT Advanced Voice Mode, which was also announced recently. However, Gemini Live offers more voice options and a higher context window, which might give it an edge. Google’s Gemini Live supports up to one million tokens, and developers have access to up to 2 million tokens.

Availability and Future Plans

Currently, Gemini Live is rolling out to Gemini Advanced subscribers on Android devices. Initially, the feature is available only in English.

Google plans to expand it to more languages and make it available on iOS in the near future.

The Gemini Advanced subscription is part of the Google One AI Premium plan, which costs Rs. 1,950 per month esp in India.

Final Thoughts

Google’s new AI features aim to enhance user interaction with its chatbot technology.

With Gemini Live, conversations with AI are becoming more natural and flexible, setting a new standard in voice-based AI interactions.