Can I just say I was equal parts thrilled and creeped out by Boston Dynamics’ latest video of Spot-Minis cooperating to open doors?

While the entire video was quite interesting, I loved the part where one of the Spot-Minis acts as if it realises that the door is closed. Then it patiently waits for the other Spot-Mini with a hand-like contraption to come and open the door. Did the robots just communicate, communicate and work together to open the door and get out of the room? I strongly believe that part was entirely scripted for the purpose of the video. Still, it’s not too hard to speculate that someone is working on the problem of large-scale cooperation between AI agents and human beings.

I struggled to find any published papers from Boston Dynamics about how they do what they do. In fact, notoriously little is known of the actual tech and learning process that Boston Dynamics (BD) use in their robots. I came across a reddit post some time ago where some insiders claimed that BD robots are mostly meticulously hand-engineered as opposed to being architected to use machine learning and artificial intelligence extensively.

Cooperation and the general feeling of helping out is an innate human behaviour and this has been shown several times in research where even 18 month old babies try to “comfort others in distress, participate in household tasks, and help adults by bringing or pointing to out-of-reach objects”. You can read about one such fantastic study here. We haven’t fully understood what goes on inside our minds that makes us flexibly and instinctively cooperate. I think it’s crucial we figure this out soon and work towards building empathetic AI agents and robots.

Cooperation between AI and machine learning agents, however, is being studied only recently by research groups. OpenAI (one of the top AI research groups famously founded by Elon Musk) recently released a paper about cooperation strategies in a multi-agent setting. Another top research group called DeepMind (they’ve been bought by Google) used deep multi-agent reinforcement learning models to understand complex behaviours of cooperation and competition. DeepMinds’ research more closely modeled game theoritical simulations of Prisoner’s Dilemma, which features interactions and strategies in a closed environment - for ex: 2 players. Interestingly, in one of their games “Gathering”, AI agents became less cooperative and more aggresive when they were either resource-starved as well as when agent strategies became more complex.

When you’re in a social context however, you’re interacting with a much larger number of people over a longer time scale and with different value functions. These are the kinds of models that social scientists and macro-economists deal with, and would sure be interesting to model on a large scale for AI.

Notes:

Leave a Comment