Unlock the Editor’s Digest for free
Roula Khalaf, Editor of the FT, selects her favourite stories in this weekly newsletter.
Artificial Intelligence Models for Robotics
Google DeepMind has unveiled artificial intelligence models for robotics that it hailed as a milestone in the long quest to make the general-purpose machines more useful and practical in the everyday world.
New Robotics Models
The company’s new robotics models, called Gemini Robotics and Gemini Robotics-ER, are designed to help robots adapt to complex environments by taking advantage of the reasoning capabilities of large language models to complete complicated real-world tasks.
Real-World Tasks
According to Google DeepMind, a robot trained using its new models was able to:
- Fold an origami fox
- Organize a desk according to verbal instructions
- Wrap headphone wires
- Slam dunk a miniature basketball through a hoop
- Partner with start-up Apptronik to build humanoid robots using this technology
Industry Insights
The development comes as tech groups, including Tesla and OpenAI, and start-ups are racing to build the AI “brain” that can autonomously operate robotics in moves that could transform a range of industries, from manufacturing to healthcare.
**Jensen Huang, chief executive of chipmaker Nvidia, said this year that the use of generative AI to deploy robots at scale represents a multitrillion-dollar opportunity that will “pave the way to “the largest technology industry the world has ever seen”.”
Progress in Robotics
Progress in advanced robotics has been painstakingly slow in recent decades, with scientists manually coding each move a robot makes. Thanks to new AI techniques, scientists have been able to train robots to adapt better to their surroundings and learn new skills much faster.
Creating the Gemini Robotics Model
To create the Gemini Robotics model, Google used its Gemini 2.0 language model and trained it specifically to control robots. This gave robots a boost in performance and allowed them to do three things:
- Adjust to different new situations
- Respond quickly to verbal instructions or changes in their environment
- Be dexterous enough to manipulate objects
Adaptability
Such adaptability would be a boon for those developing the technology, as one big obstacle for robotics is that they perform well in laboratories, but poorly in less tightly controlled settings.
Conclusion
The development of Google DeepMind’s Gemini Robotics model is an exciting step towards creating general-purpose robots that can adapt to complex environments and complete real-world tasks. While much remains to be done before these robots are ready for adoption, the potential is vast, with the possibility of transforming industries and transforming the way we live and work.
Frequently Asked Questions
Q: What are the capabilities of the Gemini Robotics model?
A: The Gemini Robotics model can adjust to different new situations, respond quickly to verbal instructions or changes in their environment, and be dexterous enough to manipulate objects.
Q: How does the Gemini Robotics model work?
A: The Gemini Robotics model uses the Gemini 2.0 language model and is trained specifically to control robots, giving them a boost in performance and allowing them to complete real-world tasks.
Q: What is the potential impact of the Gemini Robotics model?
A: The Gemini Robotics model has the potential to transform industries, from manufacturing to healthcare, and could lead to the development of general-purpose robots that can adapt to complex environments and complete real-world tasks.

