liu.seSearch for publications in DiVA
Change search
ReferencesLink to record
Permanent link

Direct link
Self Organising Maps for Value Estimation to Solve Reinforcement Learning Tasks
2000 (English)In: Proc. of the 2nd International Conference on Enterprise Information Systems (ICEIS 2000), 2000, 74-83Conference paper (Refereed)
Abstract [en]

Reinforcement learning has been applied recently more and more for the optimisation of agent behaviours. This approach became popular due to its adaptive and unsupervised learning process. One of the key ideas of this approach is to estimate the value of agent states. For huge state spaces however, it is difficult to implement this approach. As a result, various models were proposed which make use of function approximators, such as neural networks, to solve this problem. This paper focuses on an implementation of value estimation with a particular class of neural networks, known as self organizing maps. Experiments with an agent moving in a gridworld and the autonomous robot Khepera have been carried out to show the benefit of our approach. The results clearly show that the conventional approach, done by an implementation of a look-up table to represent the value function, can be out performed in terms of memory usage and convergence speed.

National Category
Computer Systems
Identifiers
urn:nbn:se:liu:diva-72563 (URN)oai:DiVA.org:liu-72563 (OAI)diva2:460024 (DiVA)
Available from2011-11-28 Created:2011-11-28 Last updated:2011-12-06

Open Access in DiVA

fulltext(401 kB)49 downloads
File information
File name FULLTEXT01.pdfFile size 401 kBChecksum SHA-512
13205cf21b28f66031627dd1c34b45aa215cec64210319ccb5e266918cfc7300894f3dbde5e04fbb4d9de7141a7a5240adc14db39fb04b00c4e28ef18c9d558f
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Kleiner, AlexanderSharp, BernadetteBittel, Oliver
Computer Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 49 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 14 hits
ReferencesLink to record
Permanent link

Direct link