Reinforcement Learning of Locomotion based on Central Pattern Generators
2014 (English)Doctoral thesis, comprehensive summary (Other academic)
Locomotion learning for robotics is an interesting and challenging area in which the movement capabilities of animals have been deeply investigated and acquired knowledge has been transferred into modelling locomotion on robots. What modellers are required to understand is what structure can represent locomotor systems in different animals and how such animals develop various and dexterous locomotion capabilities. Notwithstanding the depth of research in the area, modelling locomotion requires a deep rethinking.
In this thesis, based on the umbrella of embodied cognition, a neural-body-environment interaction is emphasised and regarded as the solution to locomotion learning/development. Central pattern generators (CPGs) are introduced in the first part (Chapter 2) to generally interpret the mechanism of locomotor systems in animals. With a deep investigation on the structure of CPGs and inspiration from human infant development, a layered CPG architecture with baseline motion generation and dynamics adaptation interfaces are proposed. In the second part, reinforcement learning (RL) is elucidated as a good method for dealing with locomotion learning from the perspectives of psychology, neuroscience and robotics (Chapter 4). Several continuous-space RL techniques (e.g. episodic natural actor critic, policy learning by weighting explorations with returns, continuous action space learning automaton are introduced for practical use (Chapter 3). With the knowledge of CPGs and RL, the architecture and concept of CPG-Actor-Critic is constructed. Finally, experimental work based on published papers is highlighted in a path of my PhD research (Chapter 5). This includes the implementation of CPGs and the learning on the NAO robot for crawling and walking. The implementation is also extended to test the generalizability to different morphologies (the ghostdog robot). The contribution of this thesis is discussed from two angles: the investigation of the CPG architecture and the implementation (Chapter 6).
Place, publisher, year, edition, pages
Linköping: Linköping University Electronic Press, 2014. , 71 p.
Linköping Studies in Science and Technology. Dissertations, ISSN 0345-7524 ; 1602
IdentifiersURN: urn:nbn:se:liu:diva-105884ISBN: 978-91-7519-313-7 (print)OAI: oai:DiVA.org:liu-105884DiVA: diva2:712601
2014-06-04, G110, hus G, Högskolan i Skövde, Skövde, 12:30 (English)
Ekeberg, Örjan, Professor
Ziemke, Tom, ProfessorLowe, Robert, Dr.
List of papers