Close Menu
iM.NewsiM.News

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Diddy Trial 2025: Day 2 Live Updates – Casandra “Cassie” Ventura Take The Stand

    May 14, 2025

    Diddy Trial 2025: Day 1 Live Updates

    May 13, 2025

    India’s Attack on Pakistan Sparks Fears of Full‑Scale War

    May 7, 2025
    Facebook X (Twitter) Instagram
    iM.NewsiM.News
    Subscribe
    • Home
    • Lifestyle

      RFK Jr Autism Claims Sparks Heated Debate: Is Autism Preventable?

      April 17, 2025

      Is Chipotle Closing in 2025? Fact from Rumor

      March 26, 2025

      Happy Pi Day 2025: Celebrate Math, Pie, and Unbeatable Deals!

      March 14, 2025

      How To Watch The Blood Moon Eclipse Tonight?

      March 14, 2025

      USDA Cuts Funding to Maine University As State Still Allows Trans Athlete In Women’s Sport

      March 12, 2025
    • Relations

      Trump Deports Venezuelan Gang To El Salvador’s Mega Prison

      March 17, 2025

      Why Tesla stock is Making Headlines Today?

      March 11, 2025

      Hundreds of Christians Slaughtered in Syria Less Than 24 Hours

      March 9, 2025

      Democrat Al Green Kicked Out After Disrupting House Chamber

      March 5, 2025

      Hasan Banned From Twitch After ‘Kill Rick Scott’ Comment

      March 4, 2025
    • Technology

      Is Spotify Down For You? You’re Not Alone

      April 16, 2025

      Dire Wolves Brought Back From Extinction

      April 8, 2025

      Nintendo Switch 2: Pre-Orders, Release Date and Launch Games

      April 3, 2025

      Why Tesla stock is Making Headlines Today?

      March 11, 2025

      AMD Radeon RX 9070/9070 XT Review: A Midrange Value Champion

      March 6, 2025
    • Travel & Tourism

      Happy Pi Day 2025: Celebrate Math, Pie, and Unbeatable Deals!

      March 14, 2025

      How To Watch The Blood Moon Eclipse Tonight?

      March 14, 2025

      Southwest Airlines Will Now Charge For Checked Bags

      March 11, 2025

      Sinner vs Rune Suspended As Defending Champion Break The Net

      January 20, 2025

      Trump Appoints Tom Homan Border Czar To Oversee New Immigration Strategy

      November 12, 2024
    • Get in Touch
    iM.NewsiM.News
    Home»Technology»Google’s Gemini: A Leap Forward in Robotic Development
    Technology

    Google’s Gemini: A Leap Forward in Robotic Development

    Google DeepMind's Robotics team has just unveiled a groundbreaking application of their Gemini AI model. Showcased in their paper, "Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs," this innovation highlights how Google’s Gemini 1.5 Pro can be utilized to teach robots to navigate and respond to commands effectively.
    Zayne PhamBy Zayne PhamJuly 12, 2024No Comments3 Mins Read4 Views
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Credit: Google DeepMind.
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp Email

    Google’s Gemini AI: Redefining Robotic Interaction

    Google DeepMind’s robotics project represents a significant milestone in the field of AI-driven robotics. Leveraging the capabilities of Gemini 1.5 Pro, the team has demonstrated how robots can navigate complex office environments and perform tasks based on natural language commands. This advancement follows a series of innovative applications where generative AI has shown promise, including natural language interactions, robot learning, no-code programming, and design.

    The Demonstration: Robots Navigating Google DeepMind Offices

    In a series of compelling videos, DeepMind employees interacted with the robot using a smart assistant-style command, “OK, Robot.” Upon receiving various instructions, the robot, equipped with a jaunty yellow bowtie, showcased its ability to navigate the 9,000-square-foot office space.

    The system’s architecture takes in these inputs and then creates a topological graph – or a simplified representation of a space. 📈

    This is constructed from frames within tour videos, which captures the general connectivity of their surroundings to find a path without a map. pic.twitter.com/JJJwNpTtLx

    — Google DeepMind (@GoogleDeepMind) July 11, 2024

    For instance, when asked to find a place to draw, the robot replied, “Thinking with Gemini,” and then led the person to a wall-sized whiteboard. In another scenario, it successfully navigated to a designated “Blue Area” in response to directions written on a whiteboard, demonstrating an impressive level of understanding and execution.

    The Technology Behind Gemini-Powered Robots

    The success of this project is rooted in a novel technique known as “Multimodal Instruction Navigation with demonstration Tours (MINT).” This approach involves familiarizing the robot with the office space by walking it around and pointing out various landmarks using speech. This initial phase is followed by hierarchical Vision-Language-Action (VLA) navigation, combining environmental understanding and common-sense reasoning.

    We gave the robot 57 types of tasks to perform throughout a 9000+ square ft operating area.

    Can you guess its success rate❔

    — Google DeepMind (@GoogleDeepMind) July 11, 2024

    The integration of Gemini 1.5 Pro allows the robot to process extensive amounts of information through its long context window, enabling it to handle video and text inputs. This capability is crucial for the robot to make sense of its environment and navigate based on commands that require common sense reasoning.


    Practical Applications and Future Prospects

    The implications of this technology are vast. During practical tests, the Gemini-powered robot achieved a 90% success rate across more than 50 user interactions. Tasks ranged from simple navigation commands to more complex instructions, such as checking the availability of a specific drink in a refrigerator and reporting back.

    This advancement is part of a larger revolution in robotics, where large language models like Gemini are increasingly enhancing the capabilities of physical machines. Academic and industry research labs are actively exploring the potential of vision language models to improve robotic performance. For example, several researchers from the Google project have moved on to startups like Physical Intelligence, aiming to combine large language models with real-world training to give robots general problem-solving abilities.


    Challenges and Future Developments

    Despite the impressive demonstrations, there are still challenges to overcome. One notable limitation is the robot’s processing time, which can take up to 30 seconds to respond to commands. Additionally, real-world environments present more complexity than controlled office spaces, requiring further refinement of the technology.

    How can Gemini 1.5 Pro’s long context window help robots navigate the world? 🤖

    A thread of our latest experiments. 🧵 pic.twitter.com/ZRQqQDEw98

    — Google DeepMind (@GoogleDeepMind) July 11, 2024

    Google DeepMind plans to test the system on different types of robots and explore more complex tasks. The integration of AI models like Gemini into robotics is expected to transform various industries, from healthcare and shipping to janitorial duties, by enhancing the robots’ ability to understand and interact with their environments.

    Gemini AI is setting a new standard in robotic navigation and interaction. By combining advanced AI models with practical robotic applications, Google DeepMind is paving the way for a future where robots can perform tasks with a level of understanding and efficiency previously thought unattainable. As this technology continues to evolve, it holds the potential to revolutionize the way we interact with and utilize robots in everyday life.

    Gemini AI Google
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleSamsung Galaxy Unpacked July 2024: Everything You Need to Know
    Next Article Sam Wilson Takes Flight With Thaddeus Ross “Red Hulk” in First ‘Captain America: Brave New World’ Trailer
    Zayne Pham

      Related Posts

      Diddy Trial 2025: Day 2 Live Updates – Casandra “Cassie” Ventura Take The Stand

      May 14, 2025

      Diddy Trial 2025: Day 1 Live Updates

      May 13, 2025

      India’s Attack on Pakistan Sparks Fears of Full‑Scale War

      May 7, 2025
      Leave A Reply Cancel Reply

      Latest Posts

      Diddy Trial 2025: Day 2 Live Updates – Casandra “Cassie” Ventura Take The Stand

      May 14, 202519 Views

      Diddy Trial 2025: Day 1 Live Updates

      May 13, 202520 Views

      India’s Attack on Pakistan Sparks Fears of Full‑Scale War

      May 7, 20258 Views

      Thunderbolts Title Change: Who Are the New Avengers?

      May 6, 20256 Views
      Stay In Touch
      • Facebook
      • Twitter
      • Pinterest
      • Instagram
      • YouTube
      • Vimeo
      Don't Miss

      Diddy Trial 2025: Day 1 Live Updates

      May 13, 2025 Trending Now 20 Views

      Day 1 of Diddy trial—on charges of racketeering conspiracy, sex trafficking and transportation for prostitution—began May 12, 2025, in Manhattan.

      Diddy Trial 2025: Day 2 Live Updates – Casandra “Cassie” Ventura Take The Stand

      May 14, 2025

      2024 U.S. Election Results Update: Donald Trump Re-elected!

      November 6, 2024

      Subscribe to Updates

      Get the latest creative news from SmartMag about art & design.

      About Us
      About Us

      We provide the daily life news. You find latest trendy news at our portal, from entertainment to economy or politics.

      We're accepting new partnerships right now.

      Email Us: info@im.news

      Facebook X (Twitter) RSS
      Our Picks

      Diddy Trial 2025: Day 2 Live Updates – Casandra “Cassie” Ventura Take The Stand

      May 14, 2025

      Diddy Trial 2025: Day 1 Live Updates

      May 13, 2025

      India’s Attack on Pakistan Sparks Fears of Full‑Scale War

      May 7, 2025
      Most Popular

      Diddy Trial 2025: Day 1 Live Updates

      May 13, 202520 Views

      Diddy Trial 2025: Day 2 Live Updates – Casandra “Cassie” Ventura Take The Stand

      May 14, 202519 Views

      2024 U.S. Election Results Update: Donald Trump Re-elected!

      November 6, 202418 Views
      © 2025 I'm News. Designed by I'm News.
      • Home
      • Lifestyle
      • Relations
      • Travel & Tourism
      • Technology

      Type above and press Enter to search. Press Esc to cancel.