TY  - RPRT
U1  - Forschungsbericht
A1  - Abdelrahman, Ahmed Faisal
T1  - Incorporating Contextual Knowledge Into Human-Robot Collaborative Task Execution
N2  - An essential measure of autonomy in service robots designed to assist humans is adaptivity to the various contexts of human-oriented tasks. These robots may have to frequently execute the same action, but subject to subtle variations in task parameters that determine optimal behaviour. Such actions are traditionally executed by robots using pre-determined, generic motions, but a better approach could utilize robot arm maneuverability to learn and execute different trajectories that work best in each context.
In this project, we explore a robot skill acquisition procedure that allows incorporating contextual knowledge, adjusting executions according to context, and improvement through experience, as a step towards more adaptive service robots. We propose an apprenticeship learning approach to achieving context-aware action generalisation on the task of robot-to-human object hand-over. The procedure combines learning from demonstration, with which a robot learns to imitate a demonstrator’s execution of the task, and a reinforcement learning strategy, which enables subsequent experiential learning of contextualized policies, guided by information about context that is integrated into the learning process. By extending the initial, static hand-over policy to a contextually adaptive one, the robot derives and executes variants of the demonstrated action that most appropriately suit the current context. We use dynamic movement primitives (DMPs) as compact motion representations, and a model-based Contextual Relative Entropy Policy Search (C-REPS) algorithm for learning policies that can specify hand-over position, trajectory shape, and execution speed, conditioned on context variables. Policies are learned using simulated task executions, before transferring them to the robot and evaluating emergent behaviours.
We demonstrate the algorithm’s ability to learn context-dependent hand-over positions, and new trajectories, guided by suitable reward functions, and show that the current DMP implementation limits learning context-dependent execution speeds. We additionally conduct a user study involving participants assuming different postures and receiving an object from the robot, which executes hand-overs by either exclusively imitating a demonstrated motion, or selecting hand-over positions based on learned contextual policies and adapting its motion accordingly. The results confirm the hypothesized improvements in the robot’s perceived behaviour when it is context-aware and adaptive, and provide useful insights that can inform future developments.
T3  - Technical Report / Hochschule Bonn-Rhein-Sieg University of Applied Sciences. Department of Computer Science - 01-2020 
KW  - Apprenticeship Learning
KW  - Learning and Adaptive Systems
KW  - Human-Centered Robotics
KW  - Domestic Robots
UN  - https://nbn-resolving.org/urn:nbn:de:hbz:1044-opus-48287
SN  - 1869-5272
SS  - 1869-5272
SN  - 978-3-96043-080-3
SB  - 978-3-96043-080-3
U6  - https://doi.org/10.18418/978-3-96043-080-3
DO  - https://doi.org/10.18418/978-3-96043-080-3
SP  - x, 166
S1  - x, 166
ER  -