Reinforcement Learning Adaptive Control and Explicit Criterion Maximization
1996 (English)Report (Other academic)
This paper reviews an existing algorithm for adaptive control based on explicit criterion maximization (ECM) and presents an extended version suited for reinforcement learning tasks. Furthermore, assumptions under which the algorithm convergences to a local maxima of a long term utility function are given. Such convergence theorems are very rare for reinforcement learning algorithms working with continuous state and action spaces. A number of similar algorithms, previously suggested to the reinforcement learning community, are briefly surveyed in order to give the presented algorithm a place in the field. The relations between the different algorithms is exemplified by checking their consistency on a simple problem of linear quadratic regulation (LQR).
Place, publisher, year, edition, pages
Linköping, Sweden: Linköping University, Department of Electrical Engineering , 1996. , 8 p.
LiTH-ISY-R, ISSN 1400-3902 ; 1829
lReinforcement learning, Adaptive control
Engineering and Technology
IdentifiersURN: urn:nbn:se:liu:diva-53328ISRN: LiTH-ISY-R-1829OAI: oai:DiVA.org:liu-53328DiVA: diva2:288584