Close Menu
iM.NewsiM.News

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Trump says he does not think Ukraine can win war against Russia, but adds ‘anything is possible’

    October 21, 2025

    Microsoft Stock Joins Exclusive $4 Trillion Club After Blockbuster Cloud + AI Earnings

    July 31, 2025

    Danville Councilman Set on Fire in His Own Workplace

    July 31, 2025
    Facebook X (Twitter) Instagram
    iM.NewsiM.News
    Subscribe
    • Home
    • Lifestyle

      Celsius Recall: Vodka Seltzer Cans Misbranded as Energy Drinks

      July 31, 2025

      Bruce Willis’ Illness Made Him Unable To Speak, Read, or Walk

      July 22, 2025

      FDA Recalls Over 67,000 Cases of Power Stick Deodorant

      July 20, 2025

      Trump says Coca-Cola Will Swap From Corn Syrup To Cane Sugar

      July 17, 2025

      Federal Judge Reverses Medical Debt Rule From Biden Era

      July 15, 2025
    • Relations

      Kim Jong Un’s Sister Rejects Diplomacy From South Korea’s New President

      July 28, 2025

      Thailand–Cambodia Border Conflict Escalates Today With Airstrikes and Civilian Casualties

      July 24, 2025

      Trump Faces Revolt Over Epstein Files as Administration Pushes Back

      July 13, 2025

      ICE Raid at MacArthur Park Sparks Backlash in L.A.

      July 8, 2025

      DOJ & FBI Find No Epstein Client List, Suicide Confirmed

      July 7, 2025
    • Technology

      Microsoft Stock Joins Exclusive $4 Trillion Club After Blockbuster Cloud + AI Earnings

      July 31, 2025

      Tea App Hacked Exposes 72,000 Images Following 4chan Leak

      July 26, 2025

      Battlefield 6 Official Reveal Trailer Unveiled — Open Beta & Release Date Rumors

      July 25, 2025

      Cash App Settlement Referral Lawsuit: Are You Eligible?

      July 21, 2025

      Horrific MRI Accident Claims Life of Long Island Man

      July 20, 2025
    • Travel & Tourism

      Massive Russian Earthquake Triggers Tsunami Warning for California’s North Coast

      July 30, 2025

      Alaska Airlines Grounded Nationwide Due To IT Outage

      July 21, 2025

      Fire Destroyed Tomorrowland Main Stage Days Before Opening

      July 17, 2025

      Catastrophic New Jersey Flash Flooding Strikes Following Historic Storm

      July 15, 2025

      Historic Grand Canyon Lodge Destroyed by Dragon Bravo Wildfire

      July 14, 2025
    • Get in Touch
    iM.NewsiM.News
    Home»Technology»Google’s Gemini: A Leap Forward in Robotic Development
    Technology

    Google’s Gemini: A Leap Forward in Robotic Development

    Google DeepMind's Robotics team has just unveiled a groundbreaking application of their Gemini AI model. Showcased in their paper, "Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs," this innovation highlights how Google’s Gemini 1.5 Pro can be utilized to teach robots to navigate and respond to commands effectively.
    Zayne PhamBy Zayne PhamJuly 12, 2024No Comments3 Mins Read4 Views
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Credit: Google DeepMind.
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp Email

    Google’s Gemini AI: Redefining Robotic Interaction

    Google DeepMind’s robotics project represents a significant milestone in the field of AI-driven robotics. Leveraging the capabilities of Gemini 1.5 Pro, the team has demonstrated how robots can navigate complex office environments and perform tasks based on natural language commands. This advancement follows a series of innovative applications where generative AI has shown promise, including natural language interactions, robot learning, no-code programming, and design.

    The Demonstration: Robots Navigating Google DeepMind Offices

    In a series of compelling videos, DeepMind employees interacted with the robot using a smart assistant-style command, “OK, Robot.” Upon receiving various instructions, the robot, equipped with a jaunty yellow bowtie, showcased its ability to navigate the 9,000-square-foot office space.

    The system’s architecture takes in these inputs and then creates a topological graph – or a simplified representation of a space. 📈

    This is constructed from frames within tour videos, which captures the general connectivity of their surroundings to find a path without a map. pic.twitter.com/JJJwNpTtLx

    — Google DeepMind (@GoogleDeepMind) July 11, 2024

    For instance, when asked to find a place to draw, the robot replied, “Thinking with Gemini,” and then led the person to a wall-sized whiteboard. In another scenario, it successfully navigated to a designated “Blue Area” in response to directions written on a whiteboard, demonstrating an impressive level of understanding and execution.

    The Technology Behind Gemini-Powered Robots

    The success of this project is rooted in a novel technique known as “Multimodal Instruction Navigation with demonstration Tours (MINT).” This approach involves familiarizing the robot with the office space by walking it around and pointing out various landmarks using speech. This initial phase is followed by hierarchical Vision-Language-Action (VLA) navigation, combining environmental understanding and common-sense reasoning.

    We gave the robot 57 types of tasks to perform throughout a 9000+ square ft operating area.

    Can you guess its success rate❔

    — Google DeepMind (@GoogleDeepMind) July 11, 2024

    The integration of Gemini 1.5 Pro allows the robot to process extensive amounts of information through its long context window, enabling it to handle video and text inputs. This capability is crucial for the robot to make sense of its environment and navigate based on commands that require common sense reasoning.


    Practical Applications and Future Prospects

    The implications of this technology are vast. During practical tests, the Gemini-powered robot achieved a 90% success rate across more than 50 user interactions. Tasks ranged from simple navigation commands to more complex instructions, such as checking the availability of a specific drink in a refrigerator and reporting back.

    This advancement is part of a larger revolution in robotics, where large language models like Gemini are increasingly enhancing the capabilities of physical machines. Academic and industry research labs are actively exploring the potential of vision language models to improve robotic performance. For example, several researchers from the Google project have moved on to startups like Physical Intelligence, aiming to combine large language models with real-world training to give robots general problem-solving abilities.


    Challenges and Future Developments

    Despite the impressive demonstrations, there are still challenges to overcome. One notable limitation is the robot’s processing time, which can take up to 30 seconds to respond to commands. Additionally, real-world environments present more complexity than controlled office spaces, requiring further refinement of the technology.

    How can Gemini 1.5 Pro’s long context window help robots navigate the world? 🤖

    A thread of our latest experiments. 🧵 pic.twitter.com/ZRQqQDEw98

    — Google DeepMind (@GoogleDeepMind) July 11, 2024

    Google DeepMind plans to test the system on different types of robots and explore more complex tasks. The integration of AI models like Gemini into robotics is expected to transform various industries, from healthcare and shipping to janitorial duties, by enhancing the robots’ ability to understand and interact with their environments.

    Gemini AI is setting a new standard in robotic navigation and interaction. By combining advanced AI models with practical robotic applications, Google DeepMind is paving the way for a future where robots can perform tasks with a level of understanding and efficiency previously thought unattainable. As this technology continues to evolve, it holds the potential to revolutionize the way we interact with and utilize robots in everyday life.

    Gemini AI Google
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleSamsung Galaxy Unpacked July 2024: Everything You Need to Know
    Next Article Sam Wilson Takes Flight With Thaddeus Ross “Red Hulk” in First ‘Captain America: Brave New World’ Trailer
    Zayne Pham

    Related Posts

    Trump says he does not think Ukraine can win war against Russia, but adds ‘anything is possible’

    October 21, 2025

    Microsoft Stock Joins Exclusive $4 Trillion Club After Blockbuster Cloud + AI Earnings

    July 31, 2025

    Danville Councilman Set on Fire in His Own Workplace

    July 31, 2025
    Leave A Reply Cancel Reply

    Latest Posts

    Trump says he does not think Ukraine can win war against Russia, but adds ‘anything is possible’

    October 21, 202511 Views

    Microsoft Stock Joins Exclusive $4 Trillion Club After Blockbuster Cloud + AI Earnings

    July 31, 202533 Views

    Danville Councilman Set on Fire in His Own Workplace

    July 31, 202510 Views

    Celsius Recall: Vodka Seltzer Cans Misbranded as Energy Drinks

    July 31, 20257 Views
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss

    Microsoft Stock Joins Exclusive $4 Trillion Club After Blockbuster Cloud + AI Earnings

    July 31, 2025 Technology 33 Views

    Microsoft stock soared after an AI- and cloud-powered earnings beat pushed its market capitalization above $4 trillion, joining Nvidia in an exclusive club.

    Tea App Hacked Exposes 72,000 Images Following 4chan Leak

    July 26, 2025

    Diddy Trial 2025: Day 1 Live Updates

    May 13, 2025

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    About Us
    About Us

    We provide the daily life news. You find latest trendy news at our portal, from entertainment to economy or politics.

    We're accepting new partnerships right now.

    Email Us: info@im.news

    Facebook X (Twitter) RSS
    Our Picks

    Trump says he does not think Ukraine can win war against Russia, but adds ‘anything is possible’

    October 21, 2025

    Microsoft Stock Joins Exclusive $4 Trillion Club After Blockbuster Cloud + AI Earnings

    July 31, 2025

    Danville Councilman Set on Fire in His Own Workplace

    July 31, 2025
    Most Popular

    Microsoft Stock Joins Exclusive $4 Trillion Club After Blockbuster Cloud + AI Earnings

    July 31, 202533 Views

    Tea App Hacked Exposes 72,000 Images Following 4chan Leak

    July 26, 202523 Views

    Diddy Trial 2025: Day 1 Live Updates

    May 13, 202522 Views
    © 2025 I'm News. Designed by I'm News.
    • Home
    • Lifestyle
    • Relations
    • Travel & Tourism
    • Technology

    Type above and press Enter to search. Press Esc to cancel.