Planning for Human-robot Interaction: Representing Time and Human Intention

ProQuest, 2008 - 172 pages
0 Avis
The approach to human-robot social interaction taken in this thesis focuses on creating more accurate models of social tasks for planning. Because the human participants are modeled as a part of the environment, the world state in these problems is dynamic and partially observable. Human intention is represented as hidden state in a partially observable Markov decision process (POMDP), and the time-dependence of action outcomes are explicitly modeled. A model structure designed by a human expert is combined with human task performance data. The resulting models are large and complex. State aggregation over the time dimension of the state space is used to trade off between the accuracy of the representation and its size in order to find sufficiently expressive models that can also be solved tractably. The utility of this approach is demonstrated by implementing a controller for a mobile robot that rides elevators with people and an agent in a driving simulator that performs the Pittsburgh left with human drivers. Performance is evaluated by comparing the policies obtained using the proposed modeling technique to policies developed using less expressive representations. In an interactions with human participants, the policies for time-dependent POMDP models with human intention as hidden state outperform the other policies, achieving both higher rewards and more positive evaluations for naturalness and social propriety of behavior.

Avis des internautes - Rédiger un commentaire

Aucun commentaire n'a été trouvé aux emplacements habituels.

Table des matières

Related Work
Technical Approach
The Pittsburgh Left Driving Simulation Task
Model Design
TimeState Aggregation for POMDPs
Human Subject Experiment
APPENDIX A Domain Descriptions
Droits d'auteur

Expressions et termes fréquents

Informations bibliographiques