liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Enhancing Safety via Deep Reinforcement Learning in Trajectory Planning for Agile Flights in Unknown Environments
Univ Fed Sao Carlos, Brazil.
Univ Fed Sao Paulo, Brazil.
Linköping University, Department of Computer and Information Science, Artificial Intelligence and Integrated Computer Systems. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0002-9595-2471
Linköping University, Department of Computer and Information Science, Artificial Intelligence and Integrated Computer Systems. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0002-8546-4431
Show others and affiliations
2024 (English)In: 2024 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS 2024, IEEE , 2024, p. 3076-3083Conference paper, Published paper (Refereed)
Abstract [en]

Unmanned aerial vehicles (UAVs), known for their agile flight capabilities, require safe trajectory planning to achieve high-speed flights. This is necessary to swiftly evade obstacles and adapt trajectories under hard real-time constraints. These adjustments are essential to generate viable paths that prevent collisions while maintaining high speeds with minimal tracking errors. This paper addresses the challenge of enhancing the safety of agile trajectory planning. The proposed method combines a supervised learning approach, as teacher policy, with deep reinforcement learning (DRL), as student policy. Initially, we train the teacher policy using a path planning algorithm that prioritizes safety while minimizing jerk and flight time. Then, we use this policy to guide the learning of the student policy in various unknown environments. Testing in simulation demonstrates noteworthy advancements, including an 80% reduction in tracking error, a 31% decrease in flight time, a 19% increase in high-speed duration, and a success rate improvement from 50% to 100%, as compared to baseline methods.

Place, publisher, year, edition, pages
IEEE , 2024. p. 3076-3083
Series
IEEE International Conference on Intelligent Robots and Systems, ISSN 2153-0858, E-ISSN 2153-0866
National Category
Robotics and automation
Identifiers
URN: urn:nbn:se:liu:diva-212760DOI: 10.1109/IROS58592.2024.10801910ISI: 001411890000326Scopus ID: 2-s2.0-85216502840ISBN: 9798350377712 (print)ISBN: 9798350377705 (electronic)OAI: oai:DiVA.org:liu-212760DiVA, id: diva2:1949246
Conference
2024 International Conference on Intelligent Robots and Systems, Abu Dhabi, U ARAB EMIRATES, oct 14-18, 2024
Note

Funding Agencies|Federal Agency for Support and Evaluation of Graduate Education (CAPES); National Council for Scientific and Technological Development (CNPQ); Swedish-Brazilian Research and Innovation Center (CISB) [200056/2022-0, 200051/2022-9]

Available from: 2025-04-02 Created: 2025-04-02 Last updated: 2025-04-02

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Heintz, FredrikTiger, Mattias
By organisation
Artificial Intelligence and Integrated Computer SystemsFaculty of Science & Engineering
Robotics and automation

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 73 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf