Google Launches Gemini Live For More ‘In-Depth’ Voice Chats

Google is giving tough competition to OpenAI’s latest Advanced Voice Mode on ChatGPT with the launch of its Gemini Live.

The latest rollout is said to cater to more in-depth voice conversations on the Gemini chatbot. The product was first unveiled after getting announced at the company’s I/O developer conference this year and it’s finally going live.

For now, it’s just a limited alpha but the idea of having comprehensive voice chats through the company’s AI voice assistant is another breakthrough launch. Google promises consistent chats that are more emotionally expressive and realistic in nature. You can also speak in between the chatbot’s conversation by interrupting and making the most of multi-turn dialogue as well.

Before, asking follow-up queries was not possible. Now, the chatbot is enabled to follow your tone and pattern of speech live.

Google described the entire matter through a new blog post where it boasted the 10 different natural-sounding voices that enable you to provide responses. Hence, no need to worry about your tone, speed, or interruptions as Gemini Live will adjust accordingly. Just pretend like you’re talking to a human being as everything stays the same.

Other than that, Google says it’s hands-free, an option that those on the run can avail. Keep on talking into the app or when the device is locked. All the chats could be paused and then resumed at any given moment in time.

As far as productivity is concerned, it’s right up there. Imagine preparing for a job interview. You can simply rehearse before the next meeting or practice giving a speech to a chatbot with ease. The produce allows you to gain more insights on how to make your speaking skills better, including some useful tips and tricks to nail any conversation that puts you on the spot.

Another major advantage over OpenAI’s Advanced Voice Mode is enhanced memory. The offering has a longer context window. This means taking in a lot of information and providing logical reasoning regarding that. So you can go on and on, and engage in back-and-forth discussions, before finally producing a written response.

Google says Gemini Live makes use of the newest models that give it a more conversational touch. It’s a huge context window to benefit from where users can take part in long chats through Live.

The promises are plenty but how well Google stands against them is a question worth pondering that only time can tell. There is one thing missing that Google displayed at the developer’s conference for this product. It’s the multimodal output.

Google first rolled out pre-recorded content displaying Gemini Live showing responses to users’ backgrounds through images and recordings captured via phone cameras. Hence, Google did speak about how it’s arriving sometime this year. But it failed in terms of adding more details on the subject.

Gemini Live will also be rolled out in different languages this year and can also be available for iOS users. For now, it’s just in English. Also, it’s not free. Only those with Gemini Advanced subscriptions will benefit which costs $20 a month.

However, Google does promise more freebies for Gemini coming soon this year. This includes making overlay more prominent on the app through more questions about what’s next. Simply press the Power button and say the term ‘Hey Google’ to produce a vast array of images. These can be edited into other Google apps such as Gmail.

Google is also equipping Gemini with more Google services. This includes more activities with Calendar, YouTube Music, Tasks, and Utilities. Media, volume, Bluetooth, flashlight, and more.

As per the company’s latest blog post, it looks like there’s a lot you can do with Gemini. The list of tasks is plenty and it’s like your own mini assistant. From finding what to cook for dinner to curating the perfect music playlist for date night, trust Gemini to do it all.

Did we mention how Gemini is coming to Android tablets soon? Clearly, there’s a lot to anticipate here and we’ll keep you updated as it happens.


Read next:

• China and the U.S. Drive 42% of Global Tech Spending, Forrester Research Reveals

• FCC Intensifies Fight Against AI-Generated Phone Calls With New Rules

YouTube Capitalizes On Shorts Popularity By Testing Longer Clips Within The Shorts Feed

• What Are The Industrial Applications Of Nanoparticle Processing?
Previous Post Next Post