Planning for Human-Robot Interaction Using Time-State Aggregated POMDPs

15 years 5 months ago

Download www.cs.cmu.edu

In order to interact successfully in social situations, a robot must be able to observe others' actions and base its own behavior on its beliefs about their intentions. Many interactions take place in dynamic environments, and the outcomes of people's or the robot's actions may be time-dependent. In this paper, such interactions are modeled as a POMDP with a time index as part of the state, resulting in a fully Markov model with a potentially very large state space. The complexity of finding even an approximate solution often limits POMDP's practical applicability for large problems. This difficulty is addressed through the development of an algorithm for aggregating states in POMDPs with a time-indexed state space. States that represent the same physical configuration of the environment at different times are chosen to be combined using reward-based metrics, preserving the structure of the original model while producing a smaller model that is faster to solve. We ...

Frank Broz, Illah R. Nourbakhsh, Reid G. Simmons

Real-time Traffic

AAAI 2008 | Intelligent Agents | Original Model | POMDP's Practical Applicability | State Space |

claim paper

Post Info
More Details (n/a)

Added	02 Oct 2010
Updated	02 Oct 2010
Type	Conference
Year	2008
Where	AAAI
Authors	Frank Broz, Illah R. Nourbakhsh, Reid G. Simmons

Comments (0)

Sciweavers

Planning for Human-Robot Interaction Using Time-State Aggregated POMDPs

AAAI 2008 | Intelligent Agents | Original Model | POMDP's Practical Applicability | State Space |

Explore & Download

Productivity Tools

Sciweavers