Wednesday, June 07, 2023

 A I



ChatGPT designs its first robot with TU Delft researchers


What are the opportunities and risks? The result of this partnership between humans and AI have been published in Nature Machine Learning

Peer-Reviewed Publication

DELFT UNIVERSITY OF TECHNOLOGY

Tomato picker robotic arm designed with ChatGPT 

IMAGE: THE TOMATO PICKER ROBOT DESIGNED WITH CHATGPT BY RESEARCHERS FROM TU DELFT AND EPFL MOVES THROUGH A TESTING ENVIRONMENT view more 

CREDIT: © ADRIEN BUTTIER / EPFL



Poems, essays and even books – is there anything the open AI platform ChatGPT can’t handle? These new AI developments have inspired researchers at TU Delft and the Swiss technical university EPFL to dig a little deeper: For instance, can ChatGPT also design a robot? And is this a good thing for the design process, or are there risks? The researchers published their findings in Nature Machine Intelligence.

What are the greatest future challenges for humanity? This was the first question that Cosimo Della Santina, assistant professor, and PhD student Francesco Stella, both from TU Delft, and Josie Hughes from EPFL, asked ChatGPT. “We wanted ChatGPT to design not just a robot, but one that is actually useful,” says Della Santina. In the end, they chose food supply as their challenge, and as they chatted with ChatGPT, they came up with the idea of creating a tomato-harvesting robot.

Helpful suggestions
The researchers followed all of ChatGPT’s design decisions. The input proved particularly valuable in the conceptual phase, according to Stella. “ChatGPT extends the designer's knowledge to other areas of expertise. For example, the chat robot taught us which crop would be most economically valuable to automate.” But ChatGPT also came up with useful suggestions during the implementation phase: “Make the gripper out of silicone or rubber to avoid crushing tomatoes” and “a Dynamixel motor is the best way to drive the robot”. The result of this partnership between humans and AI is a robotic arm that can harvest tomatoes.

ChatGPT as a researcher
The researchers found the collaborative design process to be positive and enriching. “However, we did find that our role as engineers shifted towards performing more technical tasks,” says Stella. In Nature Machine Intelligence, the researchers explore the varying degrees of cooperation between humans and Large Language Models (LLM), of which ChatGPT is one. In the most extreme scenario, AI provides all the input to the robot design, and the human blindly follows it. In this case, the LLM acts as the researcher and engineer, while the human acts as the manager, in charge of specifying the design objectives.

Risk of misinformation
Such an extreme scenario is not yet possible with today’s LLMs. And the question is whether it is desirable. “In fact, LLM output can be misleading if it is not verified or validated. AI bots are designed to generate the ‘most probable’ answer to a question, so there is a risk of misinformation and bias in the robotic field,” Della Santina says. Working with LLMs also raises other important issues, such as plagiarism, traceability and intellectual property.

Della Santina, Stella and Hughes will continue to use the tomato-harvesting robot in their research on robotics. They are also continuing their study of LLMs to design new robots. Specifically, they are looking at the autonomy of AIs in designing their own bodies. “Ultimately an open question for the future of our field is how LLMs can be used to assist robot developers without limiting the creativity and innovation needed for robotics to rise to the challenges of the 21st century,” Stella concludes.

A tomato picker robot designed by ChatGPT and researchers from TU Delft and EPFL in a field test together with a researcher

A robot tomato picker arm designed by ChatGPT and researchers from TU Delft and EPFL "looks' at the camera

CREDIT

© Adrien Buttier / EPFL

JOURNAL

DOI

METHOD OF RESEARCH

SUBJECT OF RESEARCH

ARTICLE TITLE

ARTICLE PUBLICATION DATE

Computational model mimics humans’ ability to predict emotions 


Using insights into how people intuit others’ emotions, researchers have designed a model that approximates this aspect of human social intelligence.

Peer-Reviewed Publication

MASSACHUSETTS INSTITUTE OF TECHNOLOGY




CAMBRIDGE, MA -- When interacting with another person, you likely spend part of your time trying to anticipate how they will feel about what you’re saying or doing. This task requires a cognitive skill called theory of mind, which helps us to infer other people’s beliefs, desires, intentions, and emotions.

MIT neuroscientists have now designed a computational model that can predict other people’s emotions — including joy, gratitude, confusion, regret, and embarrassment — approximating human observers’ social intelligence. The model was designed to predict the emotions of people involved in a situation based on the prisoner’s dilemma, a classic game theory scenario in which two people must decide whether to cooperate with their partner or betray them. 

To build the model, the researchers incorporated several factors that have been hypothesized to influence people’s emotional reactions, including that person’s desires, their expectations in a particular situation, and whether anyone was watching their actions.

“These are very common, basic intuitions, and what we said is, we can take that very basic grammar and make a model that will learn to predict emotions from those features,” says Rebecca Saxe, the John W. Jarve Professor of Brain and Cognitive Sciences, a member of MIT’s McGovern Institute for Brain Research, and the senior author of the study.

Sean Dae Houlihan PhD ’22, a postdoc at the Neukom Institute for Computational Science at Dartmouth College, is the lead author of the paper, which appears in Philosophical Transactions A. Other authors include Max Kleiman-Weiner PhD ’18, a postdoc at MIT and Harvard University; Luke Hewitt PhD ’22, a visiting scholar at Stanford University; and Joshua Tenenbaum, a professor of computational cognitive science at MIT and a member of the Center for Brains, Minds, and Machines and MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL).

Predicting emotions

While a great deal of research has gone into training computer models to infer someone’s emotional state based on their facial expression, that is not the most important aspect of human emotional intelligence, Saxe says. Much more important is the ability to predict someone’s emotional response to events before they occur.

“The most important thing about what it is to understand other people's emotions is to anticipate what other people will feel before the thing has happened,” she says. “If all of our emotional intelligence was reactive, that would be a catastrophe.”

To try to model how human observers make these predictions, the researchers used scenarios taken from a British game show called “Golden Balls.” On the show, contestants are paired up with a pot of $100,000 at stake. After negotiating with their partner, each contestant decides, secretly, whether to split the pool or try to steal it. If both decide to split, they each receive $50,000. If one splits and one steals, the stealer gets the entire pot. If both try to steal, no one gets anything.

Depending on the outcome, contestants may experience a range of emotions — joy and relief if both contestants split, surprise and fury if one’s opponent steals the pot, and perhaps guilt mingled with excitement if one successfully steals.

To create a computational model that can predict these emotions, the researchers designed three separate modules. The first module is trained to infer a person’s preferences and beliefs based on their action, through a process called inverse planning.

“This is an idea that says if you see just a little bit of somebody's behavior, you can probabilistically infer things about what they wanted and expected in that situation,” Saxe says.

Using this approach, the first module can predict contestants’ motivations based on their actions in the game. For example, if someone decides to split in an attempt to share the pot, it can be inferred that they also expected the other person to split. If someone decides to steal, they may have expected the other person to steal, and didn’t want to be cheated. Or, they may have expected the other person to split and decided to try to take advantage of them.

The model can also integrate knowledge about specific players, such as the contestant’s occupation, to help it infer the players’ most likely motivation.

The second module compares the outcome of the game with what each player wanted and expected to happen. Then, a third module predicts what emotions the contestants may be feeling, based on the outcome and what was known about their expectations. This third module was trained to predict emotions based on predictions from human observers about how contestants would feel after a particular outcome. The authors emphasize that this is a model of human social intelligence, designed to mimic how observers causally reason about each other’s emotions, not a model of how people actually feel.

“From the data, the model learns that what it means, for example, to feel a lot of joy in this situation, is to get what you wanted, to do it by being fair, and to do it without taking advantage,” Saxe says.

Core intuitions

Once the three modules were up and running, the researchers used them on a new dataset from the game show to determine how the models’ emotion predictions compared with the predictions made by human observers. This model performed much better at that task than any previous model of emotion prediction.

The model’s success stems from its incorporation of key factors that the human brain also uses when predicting how someone else will react to a given situation, Saxe says. Those include computations of how a person will evaluate and emotionally react to a situation, based on their desires and expectations, which relate to not only material gain but also how they are viewed by others.

“Our model has those core intuitions, that the mental states underlying emotion are about what you wanted, what you expected, what happened, and who saw. And what people want is not just stuff. They don’t just want money; they want to be fair, but also not to be the sucker, not to be cheated,” she says.

In future work, the researchers hope to adapt the model so that it can perform more general predictions based on situations other than the game-show scenario used in this study. They are also working on creating models that can predict what happened in the game based solely on the expression on the faces of the contestants after the results were announced.

###

The research was funded by the McGovern Institute; the Paul E. and Lilah Newton Brain Science Award; the Center for Brains, Minds, and Machines; the MIT-IBM Watson AI Lab; and the Multidisciplinary University Research Initiative.

No comments: