A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction

Zehfroosh, Ashkan; Tanner, Herbert G.

A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction

Author(s)	Zehfroosh, Ashkan
Author(s)	Tanner, Herbert G.
Date Accessioned	2022-06-09T18:45:23Z
Date Available	2022-06-09T18:45:23Z
Publication Date	2022-03-09
Description	This article was originally published in Frontiers in Robotics and AI. The version of record is available at: https://doi.org/10.3389/frobt.2022.797213	en_US
Abstract	This paper offers a new hybrid probably approximately correct (PAC) reinforcement learning (RL) algorithm for Markov decision processes (MDPs) that intelligently maintains favorable features of both model-based and model-free methodologies. The designed algorithm, referred to as the Dyna-Delayed Q-learning (DDQ) algorithm, combines model-free Delayed Q-learning and model-based R-max algorithms while outperforming both in most cases. The paper includes a PAC analysis of the DDQ algorithm and a derivation of its sample complexity. Numerical results are provided to support the claim regarding the new algorithm’s sample efficiency compared to its parents as well as the best known PAC model-free and model-based algorithms in application. A real-world experimental implementation of DDQ in the context of pediatric motor rehabilitation facilitated by infant-robot interaction highlights the potential benefits of the reported method.	en_US
Sponsor	This work has been supported by NIH R01HD87133-01 and NSF 2014264 to BT.	en_US
Citation	Zehfroosh, Ashkan, and Herbert G. Tanner. 2022. “A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction.” Frontiers in Robotics and AI 9 (March): 797213. https://doi.org/10.3389/frobt.2022.797213.	en_US
ISSN	2296-9144
URL	https://udspace.udel.edu/handle/19716/30973
Language	en_US	en_US
Publisher	Frontiers in Robotics and AI	en_US
Keywords	reinforcement learning	en_US
Keywords	probably approximately correct	en_US
Keywords	markov decision process	en_US
Keywords	human-robot interaction	en_US
Keywords	sample complexity	en_US
Title	A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction	en_US
Type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: A Hybrid PAC Reinforcement Learning.pdf
Size:: 1.98 MB
Format:: Adobe Portable Document Format
Description:: Main article

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.22 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Open Access Publications