liu.seSök publikationer i DiVA
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Shape Based Recognition – Cognitive Vision Systems in Traffic Safety Applications
Linköpings universitet, Institutionen för systemteknik, Datorseende. Linköpings universitet, Tekniska högskolan.
2011 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

Traffic accidents are globally the number one cause of death for people 15-29 years old and is among the top three causes for all age groups 5-44 years. Much of the work within this thesis has been carried out in projects aiming for (cognitive) driver assistance systems and hopefully represents a step towards improving traffic safety.

The main contributions are within the area of Computer Vision, and more specifically, within the areas of shape matching, Bayesian tracking, and visual servoing with the main focus being on shape matching and applications thereof. The different methods have been demonstrated in traffic safety applications, such as  bicycle tracking, car tracking, and traffic sign recognition, as well as for pose estimation and robot control.

One of the core contributions is a new method for recognizing closed contours, based on complex correlation of Fourier descriptors. It is shown that keeping the phase of Fourier descriptors is important. Neglecting the phase can result in perfect matches between intrinsically different shapes. Another benefit of keeping the phase is that rotation covariant or invariant matching is achieved in the same way. The only difference is to either consider the magnitude, for rotation invariant matching, or just the real value, for rotation covariant matching, of the complex valued correlation.

The shape matching method has further been used in combination with an implicit star-shaped object model for traffic sign recognition. The presented method works fully automatically on query images with no need for regions-of-interests. It is shown that the presented method performs well for traffic signs that contain multiple distinct contours, while some improvement still is needed for signs defined by a single contour. The presented methodology is general enough to be used for arbitrary objects, as long as they can be defined by a number of regions.

Another contribution has been the extension of a framework for learning based Bayesian tracking called channel based tracking. Compared to earlier work, the multi-dimensional case has been reformulated in a sound probabilistic way and the learning algorithm itself has been extended. The framework is evaluated in car tracking scenarios and is shown to give competitive tracking performance, compared to standard approaches, but with the advantage of being fully learnable.

The last contribution has been in the field of (cognitive) robot control. The presented method achieves sufficient accuracy for simple assembly tasks by combining autonomous recognition with visual servoing, based on a learned mapping between percepts and actions. The method demonstrates that limitations of inexpensive hardware, such as web cameras and low-cost robotic arms, can be overcome using powerful algorithms.

All in all, the methods developed and presented in this thesis can all be used for different components in a system guided by visual information, and hopefully represents a step towards improving traffic safety.

Ort, förlag, år, upplaga, sidor
Linköping: Linköping University Electronic Press , 2011. , s. 49
Serie
Linköping Studies in Science and Technology. Dissertations, ISSN 0345-7524 ; 1395
Nationell ämneskategori
Datorseende och robotik (autonoma system)
Identifikatorer
URN: urn:nbn:se:liu:diva-71664ISBN: 978-91-7393-074-1 (tryckt)OAI: oai:DiVA.org:liu-71664DiVA, id: diva2:452207
Disputation
2011-11-25, Vallfarten, hus Vallfarten, Campus Valla, Linköpings universitet, Linköping, 09:15 (Engelska)
Opponent
Handledare
Tillgänglig från: 2011-10-28 Skapad: 2011-10-28 Senast uppdaterad: 2018-01-12Bibliografiskt granskad
Delarbeten
1. Torchlight Navigation
Öppna denna publikation i ny flik eller fönster >>Torchlight Navigation
Visa övriga...
2010 (Engelska)Ingår i: Proceedings of the 20th International Conferenceon Pattern Recognition, 2010, s. 302-306Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

A common computer vision task is navigation and mapping. Many indoor navigation tasks require depth knowledge of flat, unstructured surfaces (walls, floor, ceiling). With passive illumination only, this is an ill-posed problem. Inspired by small children using a torchlight, we use a spotlight for active illumination. Using our torchlight approach, depth and orientation estimation of unstructured, flat surfaces boils down to estimation of ellipse parameters. The extraction of ellipses is very robust and requires little computational effort.

Serie
International Conference on Pattern Recognition, ISSN 1051-4651
Nyckelord
Torchlight, Pose estimation, Active illumination, Plane estimation, Ellipses
Nationell ämneskategori
Reglerteknik
Identifikatorer
urn:nbn:se:liu:diva-60597 (URN)10.1109/ICPR.2010.83 (DOI)978-1-4244-7542-1 (ISBN)978-0-7695-4109-9 (ISBN)
Konferens
20th International Conference on Pattern Recognition, Istanbul, Turkey, 23-26 August, 2010
Projekt
DIPLECSGARNICSELLIITCADICS
Forskningsfinansiär
Stiftelsen för strategisk forskning (SSF)
Tillgänglig från: 2010-10-20 Skapad: 2010-10-20 Senast uppdaterad: 2016-05-04Bibliografiskt granskad
2. Bicycle Tracking Using Ellipse Extraction
Öppna denna publikation i ny flik eller fönster >>Bicycle Tracking Using Ellipse Extraction
Visa övriga...
2011 (Engelska)Ingår i: Proceedings of the 14thInternational Conference on Information Fusion, 2011, IEEE , 2011, s. 1-8Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

A new approach to track bicycles from imagery sensor data is proposed. It is based on detecting ellipsoids in the images, and treat these pair-wise using a dynamic bicycle model. One important application area is in automotive collision avoidance systems, where no dedicated systems for bicyclists yet exist and where very few theoretical studies have been published.

Possible conflicts can be predicted from the position and velocity state in the model, but also from the steering wheel articulation and roll angle that indicate yaw changes before the velocity vector changes. An algorithm is proposed which consists of an ellipsoid detection and estimation algorithm and a particle filter.

A simulation study of three critical single target scenarios is presented, and the algorithm is shown to produce excellent state estimates. An experiment using a stationary camera and the particle filter for state estimation is performed and has shown encouraging results.

Ort, förlag, år, upplaga, sidor
IEEE, 2011
Nyckelord
Tracking, Particle Filter, Computer Vision, Ellipse Extraction, Bicycle
Nationell ämneskategori
Teknik och teknologier
Identifikatorer
urn:nbn:se:liu:diva-69672 (URN)978-1-4577-0267-9 (ISBN)
Konferens
The 14th International Conference on Information Fusion, 5-8 July 2011, Chicago, IL, USA
Tillgänglig från: 2011-07-13 Skapad: 2011-07-13 Senast uppdaterad: 2016-05-04Bibliografiskt granskad
3. Correlating Fourier descriptors of local patches for road sign recognition
Öppna denna publikation i ny flik eller fönster >>Correlating Fourier descriptors of local patches for road sign recognition
2011 (Engelska)Ingår i: IET Computer Vision, ISSN 1751-9632, E-ISSN 1751-9640, Vol. 5, nr 4, s. 244-254Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

The Fourier descriptors (FDs) is a classical but still popular method for contour matching. The key idea is to apply the Fourier transform to a periodic representation of the contour, which results in a shape descriptor in the frequency domain. FDs are most commonly used to compare object silhouettes and object contours; the authors instead use this well-established machinery to describe local regions to be used in an object-recognition framework. Many approaches to matching FDs are based on the magnitude of each FD component, thus ignoring the information contained in the phase. Keeping the phase information requires us to take into account the global rotation of the contour and shifting of the contour samples. The authors show that the sum-of-squared differences of FDs can be computed without explicitly de-rotating the contours. The authors compare correlation-based matching against affine-invariant Fourier descriptors (AFDs) and WARP-matched FDs and demonstrate that correlation-based approach outperforms AFDs and WARP on real data. As a practical application the authors demonstrate the proposed correlation-based matching on a road sign recognition task.

Ort, förlag, år, upplaga, sidor
IET, 2011
Nationell ämneskategori
Teknik och teknologier
Identifikatorer
urn:nbn:se:liu:diva-65621 (URN)10.1049/iet-cvi.2010.0040 (DOI)000291385900007 ()
Projekt
DIPLECS, GARNICS, ELLIIT
Anmärkning
This paper is a postprint of a paper submitted to and accepted for publication in IET Computer Vision and is subject to Institution of Engineering and Technology Copyright. The copy of record is available at IET Digital Library Fredrik Larsson, Michael Felsberg and Per-Erik Forssen, Correlating Fourier descriptors of local patches for road sign recognition, 2011, IET Computer Vision, (5), 4, 244-254. http://dx.doi.org/10.1049/iet-cvi.2010.0040 Copyright: Iet http://www.theiet.org/ Tillgänglig från: 2011-02-14 Skapad: 2011-02-14 Senast uppdaterad: 2017-12-11Bibliografiskt granskad
4. Using Fourier Descriptors and Spatial Models for Traffic Sign Recognition
Öppna denna publikation i ny flik eller fönster >>Using Fourier Descriptors and Spatial Models for Traffic Sign Recognition
2011 (Engelska)Ingår i: Image Analysis: 17th Scandinavian Conference, SCIA 2011, Ystad, Sweden, May 2011. Proceedings / [ed] Anders Heyden, Fredrik Kahl, Springer Berlin/Heidelberg, 2011, s. 238-249Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Traffic sign recognition is important for the development of driver assistance systems and fully autonomous vehicles. Even though GPS navigator systems works well for most of the time, there will always be situations when they fail. In these cases, robust vision based systems are required. Traffic signs are designed to have distinct colored fields separated by sharp boundaries. We propose to use locally segmented contours combined with an implicit star-shaped object model as prototypes for the different sign classes. The contours are described by Fourier descriptors. Matching of a query image to the sign prototype database is done by exhaustive search. This is done efficiently by using the correlation based matching scheme for Fourier descriptors and a fast cascaded matching scheme for enforcing the spatial requirements. We demonstrated on a publicly available database state of the art performance.

Ort, förlag, år, upplaga, sidor
Springer Berlin/Heidelberg, 2011
Serie
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 6688
Nyckelord
Traffic sign recognition – Fourier descriptors – spatial models – traffic sign dataset
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
urn:nbn:se:liu:diva-69521 (URN)10.1007/978-3-642-21227-7_23 (DOI)000308543900023 ()978-3-642-21226-0 (ISBN)978-3-642-21227-7 (ISBN)
Konferens
17th Scandinavian Conference on Image Analysis (SCIA), Ystad, Sweden, May 23-27, 2011
Anmärkning

Original Publication: Fredrik Larsson and Michael Felsberg, Using Fourier Descriptors and Spatial Models for Traffic Sign Recognition, SCIA konferens, 23-27 May 2011, Ystad Sweden, 2011, Lecture Notes in Computer Science, Image Analysis, 238-249. http://dx.doi.org/10.1007/978-3-642-21227-7_23 Copyright: Springer

Tillgänglig från: 2011-06-30 Skapad: 2011-06-30 Senast uppdaterad: 2018-01-30Bibliografiskt granskad
5. Learning Higher-Order Markov Models for ObjectTracking in Image Sequences
Öppna denna publikation i ny flik eller fönster >>Learning Higher-Order Markov Models for ObjectTracking in Image Sequences
2009 (Engelska)Ingår i: Proceedings of the 5th International Symposium on Advances in Visual Computing: Part II, Berlin, Heidelberg: Springer-Verlag , 2009, s. 184-195Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

This work presents a novel object tracking approach, where the motion model is learned from sets of frame-wise detections with unknown associations. We employ a higher-order Markov model on position space instead of a first-order Markov model on a high-dimensional state-space of object dynamics. Compared to the latter, our approach allows the use of marginal rather than joint distributions, which results in a significant reduction of computation complexity. Densities are represented using a grid-based approach, where the rectangular windows are replaced with estimated smooth Parzen windows sampled at the grid points. This method performs as accurately as particle filter methods with the additional advantage that the prediction and update steps can be learned from empirical data. Our method is compared against standard techniques on image sequences obtained from an RC car following scenario. We show that our approach performs best in most of the sequences. Other potential applications are surveillance from cheap or uncalibrated cameras and image sequence analysis.

Ort, förlag, år, upplaga, sidor
Berlin, Heidelberg: Springer-Verlag, 2009
Serie
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 5876
Nationell ämneskategori
Teknik och teknologier
Identifikatorer
urn:nbn:se:liu:diva-50495 (URN)10.1007/978-3-642-10520-3_17 (DOI)000279247100017 ()978-3-642-10519-7 (ISBN)
Konferens
The 5th International Symposium on Advances in Visual Computing (ISVC), Las Vegas, USA, December
Projekt
DIPLECS
Tillgänglig från: 2009-10-12 Skapad: 2009-10-12 Senast uppdaterad: 2018-01-31Bibliografiskt granskad
6. Simultaneously learning to recognize and control a low-cost robotic arm
Öppna denna publikation i ny flik eller fönster >>Simultaneously learning to recognize and control a low-cost robotic arm
2009 (Engelska)Ingår i: Image and Vision Computing, ISSN 0262-8856, E-ISSN 1872-8138, Vol. 27, nr 11, s. 1729-1739Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

In this paper, we present a visual servoing method based on a learned mapping between feature space and control space. Using a suitable recognition algorithm, we present and evaluate a complete method that simultaneously learns the appearance and control of a low-cost robotic arm. The recognition part is trained using an action precedes perception approach. The novelty of this paper, apart from the visual servoing method per se, is the combination of visual servoing with gripper recognition. We show that we can achieve high precision positioning without knowing in advance what the robotic arm looks like or how it is controlled.

Nyckelord
Gripper recognition; Jacobian estimation; LWPR; Visual servoing
Nationell ämneskategori
Teknik och teknologier
Identifikatorer
urn:nbn:se:liu:diva-21195 (URN)10.1016/j.imavis.2009.04.003 (DOI)
Anmärkning
Original Publication: Fredrik Larsson, Erik Jonsson and Michael Felsberg, Simultaneously learning to recognize and control a low-cost robotic arm, 2009, Image and Vision Computing, (27), 11, 1729-1739. http://dx.doi.org/10.1016/j.imavis.2009.04.003 Copyright: Elsevier Science B.V., Amsterdam. http://www.elsevier.com/ Tillgänglig från: 2009-09-30 Skapad: 2009-09-30 Senast uppdaterad: 2017-12-13Bibliografiskt granskad

Open Access i DiVA

Shape Based Recognition: Cognitive Vision Systems in Trac Safety Applications(948 kB)2058 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 948 kBChecksumma SHA-512
ccd7e9746526bd8d9a4a6fe23ed082eaacf27f98250c9e6ce26c148171c7a4a61fb1bbaa817a0ab30d1db3a8a619121c1bfe17cb885110cfc1b182b4c18b6442
Typ fulltextMimetyp application/pdf
omslag(73 kB)62 nedladdningar
Filinformation
Filnamn COVER01.pdfFilstorlek 73 kBChecksumma SHA-512
94423b02e42e69a3ee4455e3736cc061a9af87a919d1d2421e4aa37f33af20c0174267d91b76b1a39c959e72b8ac3322d24d0e95982912e46f967ac0b9a8c93d
Typ coverMimetyp application/pdf

Personposter BETA

Larsson, Fredrik

Sök vidare i DiVA

Av författaren/redaktören
Larsson, Fredrik
Av organisationen
DatorseendeTekniska högskolan
Datorseende och robotik (autonoma system)

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 2058 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

isbn
urn-nbn

Altmetricpoäng

isbn
urn-nbn
Totalt: 2386 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf