liu.seSearch for publications in DiVA
Change search
ReferencesLink to record
Permanent link

Direct link
Self Organising Maps for Value Estimation to Solve Reinforcement Learning Tasks
Staffordshire University.
Staffordshire University.
University of Applied Sciences, Konstanz.
2000 (English)In: Proc. of the 2nd International Conference on Enterprise Information Systems (ICEIS 2000), 2000, 74-83 p.Conference paper (Refereed)
Abstract [en]

Reinforcement learning has been applied recently more and more for the optimisation of agent behaviours. This approach became popular due to its adaptive and unsupervised learning process. One of the key ideas of this approach is to estimate the value of agent states. For huge state spaces however, it is difficult to implement this approach. As a result, various models were proposed which make use of function approximators, such as neural networks, to solve this problem. This paper focuses on an implementation of value estimation with a particular class of neural networks, known as self organizing maps. Experiments with an agent moving in a gridworld and the autonomous robot Khepera have been carried out to show the benefit of our approach. The results clearly show that the conventional approach, done by an implementation of a look-up table to represent the value function, can be out performed in terms of memory usage and convergence speed.

Place, publisher, year, edition, pages
2000. 74-83 p.
National Category
Computer Systems
URN: urn:nbn:se:liu:diva-72563OAI: diva2:460024
Available from: 2011-11-28 Created: 2011-11-28 Last updated: 2011-12-06

Open Access in DiVA

fulltext(401 kB)74 downloads
File information
File name FULLTEXT01.pdfFile size 401 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Kleiner, Alexander
Computer Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 74 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 109 hits
ReferencesLink to record
Permanent link

Direct link