Gemini Live first look: Better than talking to Siri, but worse than I’d like
Comment
Google launched Gemini Live during its Made by Google event Tuesday. The feature allows you to have a semi-natural spoken conversation, not typed out, with an AI chatbot powered by Google’s latest large language model. TechCrunch was there to test it out firsthand.
Gemini Live is Google’s answer to OpenAI’s Advanced Voice Mode, ChatGPT’s nearly identical feature that’s current in a limited alpha test. While OpenAI beat Google to the punch by demoing the feature first, Google is the first to roll out the finalized feature.
In my experience, these low latency, verbal features feel much more natural than texting with ChatGPT, or even talking with Siri or Alexa. I found that Gemini Live responded to questions in less than two seconds, and was able to pivot fairly quickly when interrupted. Gemini Live is not perfect, but it’s the best way to use your phone hands-free that I’ve seen yet.
Before speaking with Gemini Live, the feature lets you choose from 10 voices, compared to just three voices from OpenAI. Google worked with voice actors to create each one. I appreciated the variety there, and found each one to sound very humanlike.
In one example, a Google product manager verbally asked Gemini Live to find family-friendly wineries near Mountain View with outdoor areas and playgrounds nearby, so that kids could potentially come along. That’s a far more complicated task than I’d ask Siri — or Google Search, frankly — but Gemini successfully recommended a spot that met the criteria: Cooper-Garrod Vineyards in Saratoga.
That said, Gemini Live leaves something to be desired. It seemed to hallucinate a nearby playground called Henry Elementary School Playground that is supposedly “10 minutes away” from that vineyard. There are other playgrounds nearby in Saratoga, but the nearest Henry Elementary School is more than a two-hour drive from there. There’s a Henry Ford Elementary School in Redwood City, but it’s 30 minutes away.
Google liked to show off how users can interrupt Gemini Live mid-sentence, and the AI will quickly pivot. The company says this allows users to control the conversation. In practice, this feature doesn’t work perfectly. Sometimes Google’s project managers and Gemini Live were talking over each other, and the AI didn’t seem to pick up on what was said.
Notably, Google is not allowing Gemini Live to sing or mimic any voices outside of the 10 it provides, according to product manager Leland Rechis. The company is likely doing this to avoid run-ins with copyright law. Further, Rechis said Google is not focused on getting Gemini Live to understand emotional intonation in a user’s voice — something OpenAI touted during its demo.
Overall, the feature seems like a great way to dive deeply into a subject more naturally than you would with simple Google Search. Google notes that Gemini Live is a step along the way to Project Astra, the fully multimodal AI model the company debuted during Google I/O. For now, Gemini Live is just capable of voice conversations; however, in the future Google wants to add real-time video understanding.
Every weekday and Sunday, you can get the best of TechCrunch’s coverage.
Startups are the core of TechCrunch, so get our best coverage delivered weekly.
The latest Fintech news and analysis, delivered every Tuesday.
TechCrunch Mobility is your destination for transportation news and insight.
By submitting your email, you agree to our Terms and Privacy Notice.
Stoke Space is nothing if not ambitious. The five-year-old launch startup has generated a lot of hype due to its bold plans to develop the first fully reusable rocket, with…
Telegram announced on Wednesday that it’s adding new ways for creators to make money on its platform. Most notably, the platform is launching monthly paid subscriptions that users can purchase…
A Texas company says it lost $60 million to a criminal fraud scheme, which the FBI says makes fraudsters billions of dollars every year.
Software as a service (SaaS) is an ever-evolving industry. We’ll talk to some of the brightest minds and leaders in the industry — executives from early- and late-stage SaaS companies,…
What is the right way to build a software business? Many startup advisers say that B2B software should solve one pain point, gain customers, then add features as their company…
Virtuix’s timeline has coincided with a rise of interest around mixed reality, led by Oculus/Meta, HTC and now Apple, among others.
London-based Roto VR’s spinning gaming chair is the first of its kind to boast a “Made for Meta” seal of approval.
EliseAI employs an army of chatbots to text with, email, and respond to calls from renters about things such as apartment tours, maintenance requests, lease renewals and delinquencies.
In crafting laws to regulate AI, like the EU AI Act or California’s SB 1047, policymakers have struggled to come to a consensus on which risks the laws should cover.
Kiteworks, which builds tools to secure email communications and file sharing, has raised $456 million from Insight Partners and Sixth Street Growth.
Hadrian announced they bought Datum Source, a software company founded by SpaceX alums that uses AI to help hardware companies find manufacturing partners.
Spotify will be able to display the pricing for things like Spotify subscriptions and digital goods, including Spotify’s more recently added collection of audiobooks.
India’s Supreme Court has cleared the way for insolvency proceedings to be resumed against Byju’s in a win for U.S. creditors.
Elon Musk-owned launched Grok-2 and Grok-2 mini in beta today with improved reasoning. The new Grok AI model can now generate images on the X social network, though Grok access…
Google Pixel 9 series India launch coincides with the expansion of its sales channels and after-sales support in the country.
General Catalyst and Mars Growth Capital are co-leading the Series G round, which will be closed within a few days, sources familiar with the deal told TechCrunch.
Let’s dive right into what the Google Pixel 9 lineup looks like, how Google’s Gemini AI will be incorporated in the devices, and more.
We rounded up some of the more intriguing AI-related announcements that didn’t get a ton of play, like Pixel Studio.
Ben Affleck and Matt Damon have acquired a screenplay called “Killing Gawker,” which presumably delves into billionaire VC Peter Thiel’s campaign to bury the media outfit for posting excerpts from…
Google launched Gemini Live during its Made by Google event Tuesday. The feature allows you to have a semi-natural spoken conversation, not typed out, with an AI chatbot powered by…
Texas filed a lawsuit Tuesday against GM over years of alleged abuse of customers’ data and trust. New car owners were presented with a “confusing and highly misleading” process that…
Chinese autonomous vehicle company WeRide has received the green light to test its driverless vehicles with passengers in California. The step comes as WeRide begins the process to go public…
Kristen Faulkner astonishing Olympic success of two gold medals stems from lessons learned from her former career as a venture capitalist, she says.
SB 1047 has drawn the ire of Silicon Valley players large and small, including venture capitalists, big tech trade groups, researchers and startup founders.
For many companies, this also means that now is the time to start implementing these algorithms.
The United Auto Workers union said Tuesday that it filed federal labor charges against Donald Trump and Elon Musk. The union alleges that Trump and Musk attempted to “threaten and…
With the introduction of the Pixel Watch 3 smartwatches, which now come in two sizes, Google is also introducing a new, potentially life-saving feature: loss of pulse detection. At the…
Made You Look will be available on the Pixel 9 Pro Fold when it launches next month.
Made by Google 2024 kicks off at 10 a.m. PT on August 13. Get ready for a slew of new hardware, including the Pixel 9 and a new foldable.
With the new app, users “won’t have to scroll through a bunch of numbers to get a sense of the day’s weather,” according to Google.
Powered by WordPress VIP
source
Sponsor:News technical sponsor
Sponsor:News AI sponsor
Sponsor: AI sponsor
Sponsor: AI sponsor
Leave a Comment