True Online Temporal-Difference Learning

published by Harm van Seijen in 2015 in Informatics Engineering and research's language is English Download

Abstract in English

The temporal-difference methods TD($lambda$) and Sarsa($lambda$) form a core part of modern reinforcement learning. Their appeal comes from their good performance, low computational cost, and their simple interpretation, given by their forward view. Recently, n

Download