liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Using Reinforcement Learning for Model-free Linear Quadratic Control with Process and Measurement Noises
Linköping University, Department of Electrical Engineering, Automatic Control. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0002-6665-5881
Linköping University, Department of Electrical Engineering, Automatic Control. Linköping University, Faculty of Science & Engineering.
2019 (English)In: 2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), IEEE , 2019, p. 6510-6517Conference paper, Published paper (Refereed)
Abstract [en]

In this paper, we analyze a Linear Quadratic (LQ) control problem in terms of the average cost and the structure of the value function. We develop a completely model-free reinforcement learning algorithm to solve the LQ problem. Our algorithm is an off-policy routine where each policy is greedy with respect to all previous value functions. We prove that the algorithm produces stable policies given that the estimation errors remain small. Empirically, our algorithm outperforms the classical Q and off-policy learning routines.

Place, publisher, year, edition, pages
IEEE , 2019. p. 6510-6517
Series
IEEE Conference on Decision and Control, ISSN 0743-1546
National Category
Control Engineering
Identifiers
URN: urn:nbn:se:liu:diva-169303DOI: 10.1109/CDC40024.2019.9029904ISI: 000560779005155ISBN: 978-1-7281-1398-2 (electronic)ISBN: 978-1-7281-1399-9 (print)OAI: oai:DiVA.org:liu-169303DiVA, id: diva2:1466580
Conference
58th IEEE Conference on Decision and Control (CDC), Nice, FRANCE, dec 11-13, 2019
Note

Funding Agencies|Vinnova Competence Center LINK-SIC; Wallenberg Artificial Intelligence, Autonomous Systems and Software Program (WASP)

Available from: 2020-09-12 Created: 2020-09-12 Last updated: 2021-04-20

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Search in DiVA

By author/editor
Adib Yaghmaie, FarnazGustafsson, Fredrik
By organisation
Automatic ControlFaculty of Science & Engineering
Control Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 724 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf