liu.seSearch for publications in DiVA
Change search
Link to record
Permanent link

Direct link
Moe, Anders
Publications (10 of 22) Show all publications
Viksten, F., Forssén, P.-E., Johansson, B. & Moe, A. (2010). Local Image Descriptors for Full 6 Degree-of-Freedom Object Pose Estimation and Recognition.
Open this publication in new window or tab >>Local Image Descriptors for Full 6 Degree-of-Freedom Object Pose Estimation and Recognition
2010 (English)Article in journal (Refereed) Submitted
Abstract [en]

Recent years have seen advances in the estimation of full 6 degree-of-freedom object pose from a single 2D image. These advances have often been presented as a result of, or together with, a new local image feature type. This paper examines how the pose accuracy and recognition robustness for such a system varies with choice of feature type. This is done by evaluating a full 6 degree-of-freedom pose estimation system for 17 different combinations of local descriptors and detectors. The evaluation is done on data sets with photos of challenging 3D objects with simple and complex backgrounds and varying illumination conditions. We examine the performance of the system under varying levels of object occlusion and we find that many features allow considerable object occlusion. From the experiments we can conclude that duplet features, that use pairs of interest points, improve pose estimation accuracy, compared to single point features. Interestingly, we can also show that many features previously used for recognition and wide-baseline stereo are unsuitable for pose estimation, one notable example are the affine covariant features that have been proven quite successful in other applications. The data sets and their ground truths are available on the web to allow future comparison with novel algorithms.

Keywords
bin picking, pose estimation, local features
National Category
Computer graphics and computer vision
Identifiers
urn:nbn:se:liu:diva-57330 (URN)
Note
This is an extension of http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-44894Available from: 2010-06-16 Created: 2010-06-16 Last updated: 2025-02-07
Forssen, P.-E. & Moe, A. (2009). View matching with blob features. Image and Vision Computing, 27(1-2), 99-107
Open this publication in new window or tab >>View matching with blob features
2009 (English)In: Image and Vision Computing, ISSN 0262-8856, E-ISSN 1872-8138, Vol. 27, no 1-2, p. 99-107Article in journal (Refereed) Published
Abstract [en]

This article introduces a new region based feature for object recognition and image matching. In contrast to many other region based features, this one makes use of colour in the feature extraction stage. We perform experiments on the repeatability rate of the features across scale and inclination angle changes, and show that avoiding to merge regions connected by only a few pixels improves the repeatability. We introduce two voting schemes that allow us to find correspondences automatically, and compare them with respect to the number of valid correspondences they give, and their inlier ratios. We also demonstrate how the matching procedure can be applied to colour correction.

Keywords
Wide-baseline matching, Colour, Region feature, Conic, Colour correction, Homography
National Category
Engineering and Technology
Identifiers
urn:nbn:se:liu:diva-16258 (URN)10.1016/j.imavis.2006.10.005 (DOI)
Available from: 2009-01-12 Created: 2009-01-09 Last updated: 2017-12-14Bibliographically approved
Felsberg, M., Wiklund, J., Jonsson, E., Moe, A. & Granlund, G. (2007). Exploratory Learning Strucutre in Artificial Cognitive Systems. In: International Cognitive Vision Workshop. Paper presented at The 5th International Conference on Computer Vision Systems, 2007, 21-24 March, Bielefeld University, Germany. Bielefeld: eCollections
Open this publication in new window or tab >>Exploratory Learning Strucutre in Artificial Cognitive Systems
Show others...
2007 (English)In: International Cognitive Vision Workshop, Bielefeld: eCollections , 2007Conference paper, Published paper (Other academic)
Abstract [en]

One major goal of the COSPAL project is to develop an artificial cognitive system architecture with the capability of exploratory learning. Exploratory learning is a strategy that allows to apply generalization on a conceptual level, resulting in an extension of competences. Whereas classical learning methods aim at best possible generalization, i.e., concluding from a number of samples of a problem class to the problem class itself, exploration aims at applying acquired competences to a new problem class. Incremental or online learning is an inherent requirement to perform exploratory learning.

Exploratory learning requires new theoretic tools and new algorithms. In the COSPAL project, we mainly investigate reinforcement-type learning methods for exploratory learning and in this paper we focus on its algorithmic aspect. Learning is performed in terms of four nested loops, where the outermost loop reflects the user-reinforcement-feedback loop, the intermediate two loops switch between different solution modes at symbolic respectively sub-symbolic level, and the innermost loop performs the acquired competences in terms of perception-action cycles. We present a system diagram which explains this process in more detail.

We discuss the learning strategy in terms of learning scenarios provided by the user. This interaction between user ('teacher') and system is a major difference to most existing systems where the system designer places his world model into the system. We believe that this is the key to extendable robust system behavior and successful interaction of humans and artificial cognitive systems.

We furthermore address the issue of bootstrapping the system, and, in particular, the visual recognition module. We give some more in-depth details about our recognition method and how feedback from higher levels is implemented. The described system is however work in progress and no final results are available yet. The available preliminary results that we have achieved so far, clearly point towards a successful proof of the architecture concept.

Place, publisher, year, edition, pages
Bielefeld: eCollections, 2007
Keywords
artificial cognitive system, perception action learning, exploratory learning, cognitive bootstrapping
National Category
Engineering and Technology
Identifiers
urn:nbn:se:liu:diva-39511 (URN)10.2390/biecoll-icvs2007-173 (DOI)49069 (Local ID)49069 (Archive number)49069 (OAI)
Conference
The 5th International Conference on Computer Vision Systems, 2007, 21-24 March, Bielefeld University, Germany
Projects
COSPAL
Available from: 2009-10-10 Created: 2009-10-10 Last updated: 2016-05-04Bibliographically approved
Källhammer, J.-E., Eriksson, D., Granlund, G., Felsberg, M., Moe, A., Johansson, B., . . . Forssén, P.-E. (2007). Near Zone Pedestrian Detection using a Low-Resolution FIR Sensor. In: Intelligent Vehicles Symposium, 2007 IEEE: . Istanbul, Turkey: IEEE
Open this publication in new window or tab >>Near Zone Pedestrian Detection using a Low-Resolution FIR Sensor
Show others...
2007 (English)In: Intelligent Vehicles Symposium, 2007 IEEE, Istanbul, Turkey: IEEE , 2007, , p. 339-345Conference paper, Published paper (Refereed)
Abstract [en]

This paper explores the possibility to use a single low-resolution FIR camera for detection of pedestrians in the near zone in front of a vehicle. A low resolution sensor reduces the cost of the system, as well as the amount of data that needs to be processed in each frame.

We present a system that makes use of hot-spots and image positions of a near constant bearing to detect potential pedestrians. These detections provide seeds for an energy minimization algorithm that fits a pedestrian model to the detection. Since false alarms are hard to tolerate, the pedestrian model is then tracked, and the distance-to-collision (DTC) is measured by integrating size change measurements at sub-pixel accuracy, and the car velocity. The system should only engage braking for detections on a collision course, with a reliably measured DTC.

Preliminary experiments on a number of recorded near collision sequences indicate that our method may be useful for ranges up to about 10m using an 80x60 sensor, and somewhat more using a 160x120 sensor. We also analyze the robustness of the evaluated algorithm with respect to dead pixels, a potential problem for low-resolution sensors.

Place, publisher, year, edition, pages
Istanbul, Turkey: IEEE, 2007. p. 339-345
Series
Intelligent Vehicles Symposium, ISSN 1931-0587
National Category
Engineering and Technology
Identifiers
urn:nbn:se:liu:diva-39510 (URN)10.1109/IVS.2007.4290137 (DOI)49068 (Local ID)1-4244-1067-3 (ISBN)49068 (Archive number)49068 (OAI)
Available from: 2009-10-10 Created: 2009-10-10 Last updated: 2016-05-04
Merino, L., Caballero, F., Forssén, P.-E., Wiklund, J., Ferruz, J., Martinez-de Dios, J. R., . . . Ollero, A. (2007). Single and Multi-UAV Relative Position Estimation Based on Natural Landmarks. In: Kimon P. Valavanis (Ed.), Advances in Unmanned Aerial Vehicles: State of the Art and the Road to Autonomy (pp. 267-307). Netherlands: Springer
Open this publication in new window or tab >>Single and Multi-UAV Relative Position Estimation Based on Natural Landmarks
Show others...
2007 (English)In: Advances in Unmanned Aerial Vehicles: State of the Art and the Road to Autonomy / [ed] Kimon P. Valavanis, Netherlands: Springer , 2007, p. 267-307Chapter in book (Other (popular science, discussion, etc.))
Abstract [en]

This Chapter presents a vision-based method for unmanned aerial vehicle (UAV) motion estimation that uses as input an image motion field obtained from matches of point-like features. The Chapter enhances visionbased techniques developed for single UAV localization and demonstrates how they can be modified to deal with the problem of multi-UAV relative position estimation. The proposed approach is built upon the assumption that if different UAVs identify, using their cameras, common objects in a scene, the relative pose displacement between the UAVs can be computed from these correspondences under certain assumptions. However, although point-like features are suitable for local UAV motion estimation, finding matches between images collected using different cameras is a difficult task that may be overcome using blob features. Results justify the proposed approach.

Place, publisher, year, edition, pages
Netherlands: Springer, 2007
Series
Microprocessor-Based and Intelligent Systems Engineering ; 33
National Category
Engineering and Technology
Identifiers
urn:nbn:se:liu:diva-51244 (URN)10.1007/978-1-4020-6114-1_9 (DOI)978-1-4020-6113-4 (ISBN)978-1-4020-6114-1 (ISBN)
Projects
COMETS: “Real-time coordination and control of multiple heterogeneous unmanned aerial vehicles”, IST-2001-34304
Available from: 2009-10-23 Created: 2009-10-23 Last updated: 2015-12-10
Forssén, P.-E. & Moe, A. (2006). Autonomous Learning of Object Appearances using Colour Contour Frames. In: 3rd Canadian Conference on Computer and Robot Vision, CRV06, Québec City, Québec, Canada. Paper presented at 3rd Canadian Conference on Computer and Robot Vision, CRV06, June 07-June. Quebec City, 3rd Canadian Conference on Computer and Robot Vision Québec, Canada (pp. 3-3). Québec, Canada: IEEE Computer Society
Open this publication in new window or tab >>Autonomous Learning of Object Appearances using Colour Contour Frames
2006 (English)In: 3rd Canadian Conference on Computer and Robot Vision, CRV06, Québec City, Québec, Canada, Québec, Canada: IEEE Computer Society , 2006, p. 3-3Conference paper, Published paper (Refereed)
Abstract [en]

In this paper we make use of the idea that a robot can autonomously discover objects and learn their appearances by poking and prodding at interesting parts of a scene. In order to make the resultant object recognition ability more robust, and discriminative, we replace earlier used colour histogram features with an invariant texture-patch method. The texture patches are extracted in a similarity invariant frame which is constructed from short colour contour segments. We demonstrate the robustness of our invariant frames with a repeatability test under general homography transformations of a planar scene. Through the repeatability test, we find that defining the frame using using ellipse segments instead of lines where this is appropriate improves repeatability. We also apply the developed features to autonomous learning of object appearances, and show how the learned objects can be recognised under out-of-plane rotation and scale changes.

Place, publisher, year, edition, pages
Québec, Canada: IEEE Computer Society, 2006
National Category
Engineering and Technology
Identifiers
urn:nbn:se:liu:diva-37179 (URN)10.1109/CRV.2006.17 (DOI)33870 (Local ID)0-7695-2542-3 (ISBN)33870 (Archive number)33870 (OAI)
Conference
3rd Canadian Conference on Computer and Robot Vision, CRV06, June 07-June. Quebec City, 3rd Canadian Conference on Computer and Robot Vision Québec, Canada
Available from: 2009-10-10 Created: 2009-10-10 Last updated: 2015-12-10Bibliographically approved
Felsberg, M., Wiklund, J., Jonsson, E., Moe, A. & Granlund, G. (2006). Exploratory Learning Structure in Artificial Cognitive Systems. Linköping: Linköping University Electronic Press
Open this publication in new window or tab >>Exploratory Learning Structure in Artificial Cognitive Systems
Show others...
2006 (English)Report (Other academic)
Abstract [en]

One major goal of the COSPAL project is to develop an artificial cognitive system architecture with the capability of exploratory learning. Exploratory learning is a strategy that allows to apply generalization on a conceptual level, resulting in an extension of competences. Whereas classical learning methods aim at best possible generalization, i.e., concluding from a number of samples of a problem class to the problem class itself, exploration aims at applying acquired competences to a new problem class. Incremental or online learning is an inherent requirement to perform exploratory learning.

Exploratory learning requires new theoretic tools and new algorithms. In the COSPAL project, we mainly investigate reinforcement-type learning methods for exploratory learning and in this paper we focus on its algorithmic aspect. Learning is performed in terms of four nested loops, where the outermost loop reflects the user-reinforcement-feedback loop, the intermediate two loops switch between different solution modes at symbolic respectively sub-symbolic level, and the innermost loop performs the acquired competences in terms of perception-action cycles. We present a system diagram which explains this process in more detail.

We discuss the learning strategy in terms of learning scenarios provided by the user. This interaction between user (’teacher’) and system is a major difference to most existing systems where the system designer places his world model into the system. We believe that this is the key to extendable robust system behavior and successful interaction of humans and artificial cognitive systems.

We furthermore address the issue of bootstrapping the system, and, in particular, the visual recognition module.We give some more in-depth details about our recognition method and how feedback from higher levels is implemented. The described system is however work in progress and no final results are available yet. The available preliminary results that we have achieved so far, clearly point towards a successful proof of the architecture concept.

Place, publisher, year, edition, pages
Linköping: Linköping University Electronic Press, 2006. p. 5
Series
LiTH-ISY-R, ISSN 1400-3902 ; 2738
Keywords
COSPAL project
National Category
Engineering and Technology
Identifiers
urn:nbn:se:liu:diva-54326 (URN)LiTH-ISY-R-2738 (ISRN)
Available from: 2010-03-09 Created: 2010-03-09 Last updated: 2016-05-04Bibliographically approved
Merino, L., Wiklund, J., Caballero, F., Moe, A., Martinez-de Dios, J. R., Forssén, P.-E., . . . Ollero, A. (2006). Vision-Based Multi-UAV Position Estimation. IEEE Robotics & Automation Magazine, 13(3), 53-62
Open this publication in new window or tab >>Vision-Based Multi-UAV Position Estimation
Show others...
2006 (English)In: IEEE Robotics & Automation Magazine, ISSN 1070-9932, Vol. 13, no 3, p. 53-62Article in journal (Refereed) Published
Abstract [en]

This paper describes a method for vision-based unmanned aerial vehicle (UAV) motion estimation from multiple planar homographies. The paper also describes the determination of the relative displacement between different UAVs employing techniques for blob feature extraction and matching. It then presents and shows experimental results of the application of the proposed technique to multi-UAV detection of forest fires.  

Keywords
Blob features, Multirobot localization, Multirobot systems, Unmanned aerial vehicles, Vision-based localization
National Category
Engineering and Technology
Identifiers
urn:nbn:se:liu:diva-41584 (URN)10.1109/MRA.2006.1678139 (DOI)57993 (Local ID)57993 (Archive number)57993 (OAI)
Available from: 2009-10-10 Created: 2009-10-10 Last updated: 2015-12-10
Felsberg, M., Forssén, P.-E., Moe, A. & Granlund, G. (2005). A COSPAL Subsystem: Solving a Shape-Sorter Puzzle. In: AAAI Fall Symposium: From Reactive to Anticipatory Cognitive Embedded Systems, FS-05-05 (pp. 65-69). AAAI Press
Open this publication in new window or tab >>A COSPAL Subsystem: Solving a Shape-Sorter Puzzle
2005 (English)In: AAAI Fall Symposium: From Reactive to Anticipatory Cognitive Embedded Systems, FS-05-05, AAAI Press , 2005, p. 65-69Conference paper, Published paper (Refereed)
Abstract [en]

 To program a robot to solve a simple shape-sorter puzzle is trivial. To devise a Cognitive System Architecture, which allows the system to find out by itself how to go about a solution, is less than trivial. The development of such an architecture is one of the aims of the COSPAL project, leading to new techniques in vision based Artificial Cognitive Systems, which allow the development of robust systems for real dynamic environments. The systems developed under the project itself remain however in simplified scenarios, likewise the shape-sorter problem described in the present paper. The key property of the described system is its robustness. Since we apply association strategies of local features, the system behaves robustly under a wide range of distortions, as occlusion, colour and intensity changes. The segmentation step which is applied in many systems known from literature is replaced with local associations and view-based hypothesis validation. The hypotheses used in our system are based on the anticipated state of the visual percepts. This state replaces explicit modeling of shapes. The current state is chosen by a voting system and verified against the true visual percepts. The anticipated state is obtained from the association to the manipulator actions, where reinforcement learning replaces the explicit calculation of actions. These three differences to classical schemes allow the design of a much more generic and flexible system with a high level of robustness. On the technical side, the channel representation of information and associative learning in terms of the channel learning architecture are essential ingredients for the system. It is the properties of locality, smoothness, and non-negativity which make these techniques suitable for this kind of application. The paper gives brief descriptions of how different system parts have been implemented and show some examples from our tests.

Place, publisher, year, edition, pages
AAAI Press, 2005
National Category
Engineering and Technology
Identifiers
urn:nbn:se:liu:diva-37187 (URN)33895 (Local ID)33895 (Archive number)33895 (OAI)
Available from: 2009-10-10 Created: 2009-10-10 Last updated: 2016-05-04
Forssen, P.-E. & Moe, A. (2005). Contour Descriptors for View-Based Object Recognition (ed.). Linköping, Sweden: Linköping University, Department of Electrical Engineering
Open this publication in new window or tab >>Contour Descriptors for View-Based Object Recognition
2005 (English)Report (Other academic)
Abstract [en]

This report introduces a robust contour descriptor for view-based object recognition. In recent years great progress has been made in the field of view based object recognition mainly due to the introduction of texture based features such as SIFT and MSER. Although these are remarkably successful for textured objects, they have problems with man-made objects with little or no texture. For such objects, either explicit geometrical models, or contour and shading based features are also needed. This report introduces a robust contour descriptor which we hope can be combined with texture based features to obtain object recognition systems that work in a wider range of situations. Each detected contour is described as a sequence of line and ellipse segments, both which have well defined geometrical transformations to other views. The feature detector is also quite fast, this is mainly due to the idea of first detecting chains of contour points, these chains are then split into line segments, which are later either kept, grouped into ellipses or discarded. We demonstrate the robustness of the feature detector with a repeatability test under general homography transformations of a planar scene. Through the repeatability test, we find that using ellipse segments instead of lines, where this is appropriate improves repeatability. We also apply the features in a robotic setting where object appearances are learned by manipulating the objects.

Place, publisher, year, edition, pages
Linköping, Sweden: Linköping University, Department of Electrical Engineering, 2005
Series
LiTH-ISY-R, ISSN 1400-3902 ; 2706
National Category
Engineering and Technology
Identifiers
urn:nbn:se:liu:diva-53329 (URN)
Available from: 2010-01-21 Created: 2010-01-20 Last updated: 2015-12-10
Organisations

Search in DiVA

Show all publications