liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Adaptive Supervision Online Learning for Vision Based Autonomous Systems
Linköping University, Department of Electrical Engineering, Computer Vision. Linköping University, Faculty of Science & Engineering.
2016 (English)Doctoral thesis, monograph (Other academic)
Abstract [en]

Driver assistance systems in modern cars now show clear steps towards autonomous driving and improvements are presented in a steady pace. The total number of sensors has also decreased from the vehicles of the initial DARPA challenge, more resembling a pile of sensors with a car underneath. Still, anyone driving a tele-operated toy using a video link is a demonstration that a single camera provides enough information about the surronding world.  

Most lane assist systems are developed for highway use and depend on visible lane markers. However, lane markers may not be visible due to snow or wear, and there are roads without lane markers. With a slightly different approach, autonomous road following can be obtained on almost any kind of road. Using realtime online machine learning, a human driver can demonstrate driving on a road type unknown to the system and after some training, the system can seamlessly take over. The demonstrator system presented in this work has shown capability of learning to follow different types of roads as well as learning to follow a person. The system is based solely on vision, mapping camera images directly to control signals.  

Such systems need the ability to handle multiple-hypothesis outputs as there may be several plausible options in similar situations. If there is an obstacle in the middle of the road, the obstacle can be avoided by going on either side. However the average action, going straight ahead, is not a viable option. Similarly, at an intersection, the system should follow one road, not the average of all roads.  

To this end, an online machine learning framework is presented where inputs and outputs are represented using the channel representation. The learning system is structurally simple and computationally light, based on neuropsychological ideas presented by Donald Hebb over 60 years ago. Nonetheless the system has shown a cabability to learn advanced tasks. Furthermore, the structure of the system permits a statistical interpretation where a non-parametric representation of the joint distribution of input and output is generated. Prediction generates the conditional distribution of the output, given the input.  

The statistical interpretation motivates the introduction of priors. In cases with multiple options, such as at intersections, a prior can select one mode in the multimodal distribution of possible actions. In addition to the ability to learn from demonstration, a possibility for immediate reinforcement feedback is presented. This allows for a system where the teacher can choose the most appropriate way of training the system, at any time and at her own discretion.  

The theoretical contributions include a deeper analysis of the channel representation. A geometrical analysis illustrates the cause of decoding bias commonly present in neurologically inspired representations, and measures to counteract it. Confidence values are analyzed and interpreted as evidence and coherence. Further, the use of the truncated cosine basis function is motivated.  

Finally, a selection of applications is presented, such as autonomous road following by online learning and head pose estimation. A method founded on the same basic principles is used for visual tracking, where the probabilistic representation of target pixel values allows for changes in target appearance.

Place, publisher, year, edition, pages
Linköping: Linköping University Electronic Press, 2016. , 176 p.
Series
Linköping Studies in Science and Technology. Dissertations, ISSN 0345-7524 ; 1749
National Category
Computer Vision and Robotics (Autonomous Systems)
Identifiers
URN: urn:nbn:se:liu:diva-125916DOI: 10.3384/diss.diva-125916ISBN: 978-91-7685-815-8 (print)OAI: oai:DiVA.org:liu-125916DiVA: diva2:916645
Public defence
2016-05-20, Visionen, B-building, Campus Valla, Linköping, 09:00 (English)
Opponent
Supervisors
Funder
EU, FP7, Seventh Framework ProgrammeSwedish Research Council
Available from: 2016-04-19 Created: 2016-03-08 Last updated: 2016-05-16Bibliographically approved

Open Access in DiVA

fulltext(5398 kB)472 downloads
File information
File name FULLTEXT02.pdfFile size 5398 kBChecksum SHA-512
50bbd88b0dc56e8820cb50c9a8a25e466217b795fa543b1d61c1a779c3505f7761c978d3f2011dbffc45503c9854cd8bf23837e1d219e494e2d9d11c34314b7a
Type fulltextMimetype application/pdf
omslag(361 kB)46 downloads
File information
File name COVER01.pdfFile size 361 kBChecksum SHA-512
5af1613f62e4591b02904c7365896feaa55905da035f67fb1c49a7919e46e149498915ab43292a95ffe650b46ade3eb44f88f1901c9477e872dbf2e6bac6e5d3
Type coverMimetype application/pdf
Supplementary files with videos(62 kB)23 downloads
File information
File name ATTACHMENT03.pdfFile size 62 kBChecksum SHA-512
1eed3c69434a9e2694fd01e95b83ed0cb495866b0f54ef2daad7533a7cdba5411e2534286560898cea5856695c4ab636fa45d19691de5f8a46aa5757dadcdda7
Type attachmentMimetype application/pdf
Channel vector curves, Four channels, 3D space(5573 kB)77 downloads
File information
File name MOVIE01.mp4File size 5573 kBChecksum SHA-512
ddb6bd937d01e46d234c344753ba0e481d038440a251ddc4d6e04d73827b163f04e6140fe5d0c8a48e5676ec2e8a2d0cb3e8f518126edff0722971c6f48a4087
Type movieMimetype video/mp4
Channel vector curves, Five channels, 4D space(3862 kB)39 downloads
File information
File name MOVIE02.mp4File size 3862 kBChecksum SHA-512
ac83d61c0fbfdda0059b5dd116027a7d7efa4f3a80706c46084171f26a78ba12aaf1de986b116d395e266bcf471dc5ce72c3ad437858eb444dfd1535aad313f9
Type movieMimetype video/mp4
Channel vector curves, Seven channels, 6D space(6091 kB)33 downloads
File information
File name MOVIE03.mp4File size 6091 kBChecksum SHA-512
59715ae071d2c0fc0c71519f5c5f020f831f25f912903b5b6226d2920a1bc023e234589f00147fc05e38f3bd4cb4f8cdb03db802527f8f2679eb7484c4379604
Type movieMimetype video/mp4
Cone, Three channels, 3D space(11233 kB)29 downloads
File information
File name MOVIE04.mp4File size 11233 kBChecksum SHA-512
ae4aafb4563b5cb079c8e526c274682306db669fea421b38b1052448bb848d876933224eecbd0000605feebceea6b3d095737d7e8c968e4728c3a4758e257b66
Type movieMimetype video/mp4
Cone, Four channels, 4D space(21066 kB)41 downloads
File information
File name MOVIE05.mp4File size 21066 kBChecksum SHA-512
5339c565420dd8109986651711362a1063a56ea774e120cb0044c1cb0936888c8a590ceaedfde1c8c0987edf1f59b7d95db093849e7570534833491d1d5212c2
Type movieMimetype video/mp4
Cone, Seven channels, 7D space(37349 kB)37 downloads
File information
File name MOVIE06.mp4File size 37349 kBChecksum SHA-512
37e0ab435981553023739a4e494ab27e6c7a59eb4b44ed915838e8c21a91b1cea331c8e7f6c61f77d1e79e466acc3d3072ee1849e348cff44c03d39303b1304e
Type movieMimetype video/mp4
Associative learning illustration(2842 kB)31 downloads
File information
File name MOVIE07.mp4File size 2842 kBChecksum SHA-512
857a3df7676ccd43674592f5937d50bb95e85f0980ddb98809c58c931857e948d9f85bbc26286efeab7e31ca66c1b750531583502936ea9f62fcfc6f56d85375
Type movieMimetype video/mp4
Decoding of five pixels in a sequence(26451 kB)35 downloads
File information
File name MOVIE17.mp4File size 26451 kBChecksum SHA-512
2b1a6a5a6eee7e4ded10dddb4fa736c01b35dbecfbb12af490ebe2dad0f4a3f469a4b34133dcc8cf2aca10d9c15aec93897cae9a427faccdc80b940fe8d362b4
Type movieMimetype video/mp4
Sequence with translating cameraman image(742 kB)23 downloads
File information
File name MOVIE14.mp4File size 742 kBChecksum SHA-512
e326c2316ef99e50f5af6d6a565741078f4c9e40bc2075210e2bca90d06ec7cc6f43f618704b18b496d5fbb5bb264cc28728e9e5a714c1c685fe92caaa46aecd
Type movieMimetype video/mp4
Video from UAV(8388 kB)36 downloads
File information
File name MOVIE15.mp4File size 8388 kBChecksum SHA-512
f77d3b93b71089dcee36a0e1b76b545b1b0d5ada2e94400fc60bb6bab7ef08e4061d67c59512260902c28a08ba53737c168a8f60932cb633d650d6ee7c67c508
Type movieMimetype video/mp4
Original video from the UAV(16423 kB)31 downloads
File information
File name MOVIE16.mp4File size 16423 kBChecksum SHA-512
7bc1c4b91b77b162c9cde2a393626824cd2a988c6a103b7a3210901bf0093262486497eb9a88c4fd41c7d9152cef34878b9a10ab2fd5d53c4fe1e390e13a82d0
Type movieMimetype video/mp4
Autonomous Road Following Application, Use case demo(270078 kB)50 downloads
File information
File name MOVIE12.mp4File size 270078 kBChecksum SHA-512
35fb150aa7eeafa6238ef917b4207cd437ae106cfadd63c9068650ddbe94abbc0d368a2e8133b7483e01e69c005992d3ee30f596e15bb4a9aea489e87d5e63a2
Type movieMimetype video/mp4
Autonomous Road Following Application, Demonstrator system(159031 kB)100 downloads
File information
File name MOVIE13.mp4File size 159031 kBChecksum SHA-512
e738f3ee1e2f102da7db238a4875052b8e1676b7b85ba0a4cadb75437e39b397b8126d1a46c0f12b02f838379fc535af3945152fdfcbe62e720eab616841df29
Type movieMimetype video/mp4

Other links

Publisher's full text

Authority records BETA

Öfjäll, Kristoffer

Search in DiVA

By author/editor
Öfjäll, Kristoffer
By organisation
Computer VisionFaculty of Science & Engineering
Computer Vision and Robotics (Autonomous Systems)

Search outside of DiVA

GoogleGoogle Scholar
Total: 478 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 5381 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf