After receiving a natural language voice prompt, the robot visually assesses its environment and then performs the task. Figure offers examples like, “Hand the bag of cookies to the robot on ...