Google Reveals Gemini AI Is Enhancing Robot Intelligence

Home AI News Google Reveals Gemini AI Is Enhancing Robot Intelligence

Updated on November 12 2024

Google is enhancing its robots’ capabilities through Gemini AI, improving navigation and task completion. The DeepMind robotics team published a research paper detailing how Gemini 1.5 Pro's expansive context window allows users to interact with RT-2 robots using natural language commands.

This process involves capturing a video tour of a space, such as a home or office, during which the robot "watches" the footage to learn about the environment. It can then execute commands based on observations, such as guiding users to a power outlet when shown a phone and asked, "Where can I charge this?" DeepMind reports a 90 percent success rate for the Gemini-powered robot across over 50 user instructions in a 9,000-plus-square-foot area.

Researchers also noted “preliminary evidence” that Gemini 1.5 Pro enables robots to plan how to execute tasks beyond basic navigation. For instance, if a user asks if their favorite drink is available amidst a clutter of Coke cans, Gemini directs the robot to navigate to the fridge, check for Cokes, and return with the results. DeepMind intends to explore these findings further.

While Google's video demonstrations are impressive, they reveal that the robot takes between 10 to 30 seconds to process each instruction, as noted in the research. Although we may not be sharing our homes with advanced environment-mapping robots just yet, these machines could soon help us locate our missing keys or wallets.

Here’s How OpenAI Plans to Assess the Power and Capabilities of Its AI Systems

UK Politician Mistakenly Labeled as AI: The True Identity Revealed

Most people like

Globose Technology Solutions

28.7K

Unlock the full potential of your artificial intelligence projects with our comprehensive AI datasets collection and annotation services. Our expert team specializes in curating and refining high-quality datasets tailored to your specific needs, ensuring optimal performance for your AI models. From image and text data gathering to precise labeling and categorization, we provide end-to-end services that enhance accuracy and reliability. Discover the difference that expertly annotated datasets can make in accelerating your AI development and achieving impactful results.

AI Dataset Collection Other

Wirestock

373.3K

Unlock your creative potential and start earning today! Effortlessly sell your stunning photos, captivating AI art, and engaging videos. Monetize your passion and turn your creativity into income.

monetization AI Content Generator

Powder

134.4K

Elevate your content by transforming lengthy streams into engaging, bite-sized clips using cutting-edge AI technology. Share easily and reach a wider audience!

AI-powered AI Short Clips Generator

Araby.ai

276.1K

Discover Araby.ai, your go-to source for advanced Arabic AI tools designed to elevate multiple industries. Experience the future of technology with our innovative solutions tailored for Arabic-speaking users.

Arabic AI AI Productivity Tools

Find AI tools in YBX