conference

Reducing Sample Complexity in Reinforcement Learning by Transferring Transition and Reward Probabilities.

Kouta Oguni, Kazuyuki Narisawa, Ayumi Shinohara

ICAART 2014 - Proceedings of the 6th International Conference on Agents and Artificial Intelligence, Volume 1, ESEO, Angers, Loire Valley, France, 6-8 March, 2014, , pp.632-638 (2014), [peer-reviewed]
Event Date: March 6-8, 2014

DOI External Link

Abstract / 概要

Most existing reinforcement learning algorithms require many trials until they obtain optimal policies. In this study, we apply transfer learning to reinforcement learning to realize greater efficiency. We propose a new algorithm called TR-MAX, based on the R-MAX algorithm. TR-MAX transfers the transition and reward probabilities from a source task to a target task as prior knowledge. We theoretically analyze the sample complexity of TR-MAX. Moreover, we show that TR-MAX performs much better in practice than R-MAX in maze tasks.

DOI: 10.5220/0004915606320638