Postavke privatnosti

A revolutionary cuTAMP algorithm from mit and NVIDIA allows robots to solve complex tasks in seconds

Researchers from mit and NVIDIA have developed cuTAMP, a new algorithm that uses the power of GPUs for parallel planning. Instead of slow, sequential testing, the robot now analyzes thousands of possible movements simultaneously, solving complex manipulation and packaging tasks in just seconds.

A revolutionary cuTAMP algorithm from mit and NVIDIA allows robots to solve complex tasks in seconds
Photo by: Domagoj Skledar/ arhiva (vlastita)

Imagine that you are preparing for a long-awaited vacation. You are faced with the challenge of packing a suitcase: all the necessary items must fit without anything fragile breaking in the process. For humans, thanks to our visual and spatial abilities, this is a mostly solvable problem, even if it requires a little creative arrangement. However, for a robot, this represents an extremely complex planning task that requires the simultaneous consideration of countless actions, constraints, and mechanical possibilities. Finding an effective solution could take an extremely long time, if the robot manages to find one at all.


But a scientific team composed of researchers from the prestigious Massachusetts Institute of Technology (MIT) and the technology giant NVIDIA has developed a revolutionary algorithm that dramatically speeds up this process. Their innovative approach allows the robot to literally "think ahead," evaluating thousands of potential movement plans in parallel, and then refining the best ones to meet all the set conditions of the robot and the environment. Instead of testing every possible action one by one, like existing methods, this new method considers thousands of them simultaneously, solving complex, multi-stage manipulation problems in just a few seconds.


A Revolution in Planning: From a Sequential to a Parallel Approach


The key to this incredible speed lies in using the immense computing power of specialized processors known as graphics processing units (GPUs). In environments like factories or warehouses, this technique could enable robots to instantly determine how to manipulate and densely pack items of various shapes and sizes without damage, collapse, or collision with obstacles, even in very confined spaces. This is crucial in industrial settings where time is literally money and an efficient solution needs to be found in the shortest possible time.


William Shen, an MIT graduate and lead author of the scientific paper on this technique, points out: "If your algorithm takes minutes to find a plan, as opposed to seconds, that directly costs the business." Traditional Task and Motion Planning (TAMP) algorithms often face what is called a "combinatorial explosion" – the number of possible action sequences grows exponentially with each new item or step, making the problem almost unsolvable in real-time. Most of these randomly tried actions do not lead to any productive outcome, which further slows down the process.


At the Heart of the Innovation: The Power of Graphics Processing Units (GPUs)


The algorithm, named cuTAMP, is accelerated using the parallel computing platform CUDA, developed by NVIDIA itself. This platform allows programmers to harness the full potential of GPUs for general-purpose computing tasks, far beyond their original purpose of generating computer graphics. GPUs are designed with thousands of cores that can execute operations simultaneously, making them ideal for tasks that can be divided into many smaller, independent parts – just like simulating thousands of different plans for a robot.


Caelan Garrett, a senior research scientist at NVIDIA Research, explains: "The search space is huge, and many of the actions the robot takes in that space don't actually accomplish anything productive." By using a GPU, the computational cost of optimizing one solution becomes almost identical to the cost of optimizing hundreds or thousands of solutions. This is a fundamental paradigm shift that opens the door to solving problems that were previously considered too complex for real-time automation.


How Does cuTAMP “Think”? A Combination of Sampling and Optimization


The research team designed the algorithm specifically for what is called Task and Motion Planning (TAMP). The goal of a TAMP algorithm is to create a dual plan for the robot: a task plan, which represents a high-level sequence of actions (e.g., "pick up object A," "place object A in the box"), and a motion plan, which includes low-level action parameters such as the exact joint positions of the arm and the orientation of the gripper to execute that plan.


To create a plan for packing items, the robot must think about numerous variables. This includes the final orientation of the packed items to make them fit, as well as how it will lift and manipulate them using its arm and gripper, all while avoiding collisions and respecting user-defined constraints, such as the packing order.


The cuTAMP algorithm achieves its efficiency by combining two powerful techniques: smart sampling and parallel optimization.


Smart sampling: Instead of randomly choosing potential solutions, cuTAMP restricts the range of possible solutions to those most likely to satisfy the problem's constraints. This modified sampling procedure allows the algorithm to broadly explore potential solutions, but within a narrowed, promising space. "Once we combine the outputs of these samples, we get a much better starting point than if we had sampled randomly. This ensures that we can find solutions more quickly during optimization," explains Shen.


Parallel optimization: After generating a set of samples, cuTAMP performs a parallelized optimization procedure. It calculates a "cost" for each sample, which corresponds to how well that sample avoids collisions, meets the robot's motion constraints, and fulfills the goals defined by the user. The algorithm then updates all samples simultaneously, selects the best candidates, and repeats the process until it narrows them down to a single successful, feasible solution.


Practical Application and Testing: From Simulation to the Real World


When the researchers tested their approach on simulated Tetris-like packing challenges, cuTAMP took only a few seconds to find successful, collision-free plans, tasks that would take sequential approaches significantly longer, if they could solve them at all. More importantly, when applied to a real robotic arm, the algorithm always found a solution in less than 30 seconds.


The system is designed to be general and to work on different robots. It has been successfully tested on a robotic arm at MIT and on a humanoid robot in NVIDIA's labs. One of the key advantages is that cuTAMP is not a machine learning algorithm and therefore does not require training data. This allows it to be easily applied in many new situations. "You can give it a completely new problem, and it's proven to solve it," adds Garrett. This generalization also extends to situations beyond packing, such as robots using tools. A user could incorporate different types of skills into the system to automatically expand the robot's capabilities.


The Future of Autonomous Manipulation: More Than Just Packing Boxes


Although packing is an excellent example of complexity, the potential applications of this technology are far broader. In manufacturing, robots could perform complex assembly tasks that require precise manipulation of multiple components. In logistics, they could optimize the loading and unloading of trucks, maximizing space utilization. In scientific laboratories, they could handle sensitive equipment and samples, reducing the risk of human error.


In the future, the researchers want to leverage large language models (LLMs) and vision-language models within cuTAMP. This would allow the robot to formulate and execute a plan that achieves specific goals based on the user's voice commands. For example, you could tell the robot, "Pack my beach bag," and it would, using visual sensors to identify items like a towel, sunscreen, and a book, independently devise and execute the most efficient way to pack them. This step represents a crucial link between abstract human language and the concrete physical action of the robot, opening the door to an era where robots will become even more intuitive and useful partners in daily life and work.

Source: Massachusetts Institute of Technology

Find accommodation nearby

Creation time: 06 June, 2025

Science & tech desk

Our Science and Technology Editorial Desk was born from a long-standing passion for exploring, interpreting, and bringing complex topics closer to everyday readers. It is written by employees and volunteers who have followed the development of science and technological innovation for decades, from laboratory discoveries to solutions that change daily life. Although we write in the plural, every article is authored by a real person with extensive editorial and journalistic experience, and deep respect for facts and verifiable information.

Our editorial team bases its work on the belief that science is strongest when it is accessible to everyone. That is why we strive for clarity, precision, and readability, without oversimplifying in a way that would compromise the quality of the content. We often spend hours studying research papers, technical documents, and expert sources in order to present each topic in a way that will interest rather than burden the reader. In every article, we aim to connect scientific insights with real life, showing how ideas from research centres, universities, and technology labs shape the world around us.

Our long experience in journalism allows us to recognize what is truly important for the reader, whether it is progress in artificial intelligence, medical breakthroughs, energy solutions, space missions, or devices that enter our everyday lives before we even imagine their possibilities. Our view of technology is not purely technical; we are also interested in the human stories behind major advances – researchers who spend years completing projects, engineers who turn ideas into functional systems, and visionaries who push the boundaries of what is possible.

A strong sense of responsibility guides our work as well. We want readers to trust the information we provide, so we verify sources, compare data, and avoid rushing to publish when something is not fully clear. Trust is built more slowly than news is written, but we believe that only such journalism has lasting value.

To us, technology is more than devices, and science is more than theory. These are fields that drive progress, shape society, and create new opportunities for everyone who wants to understand how the world works today and where it is heading tomorrow. That is why we approach every topic with seriousness but also with curiosity, because curiosity opens the door to the best stories.

Our mission is to bring readers closer to a world that is changing faster than ever before, with the conviction that quality journalism can be a bridge between experts, innovators, and all those who want to understand what happens behind the headlines. In this we see our true task: to transform the complex into the understandable, the distant into the familiar, and the unknown into the inspiring.

NOTE FOR OUR READERS
Karlobag.eu provides news, analyses and information on global events and topics of interest to readers worldwide. All published information is for informational purposes only.
We emphasize that we are not experts in scientific, medical, financial or legal fields. Therefore, before making any decisions based on the information from our portal, we recommend that you consult with qualified experts.
Karlobag.eu may contain links to external third-party sites, including affiliate links and sponsored content. If you purchase a product or service through these links, we may earn a commission. We have no control over the content or policies of these sites and assume no responsibility for their accuracy, availability or any transactions conducted through them.
If we publish information about events or ticket sales, please note that we do not sell tickets either directly or via intermediaries. Our portal solely informs readers about events and purchasing opportunities through external sales platforms. We connect readers with partners offering ticket sales services, but do not guarantee their availability, prices or purchase conditions. All ticket information is obtained from third parties and may be subject to change without prior notice. We recommend that you thoroughly check the sales conditions with the selected partner before any purchase, as the Karlobag.eu portal does not assume responsibility for transactions or ticket sale conditions.
All information on our portal is subject to change without prior notice. By using this portal, you agree to read the content at your own risk.