liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Temporally-consistent 3D Reconstruction of Birds
Linköping University, Department of Electrical Engineering, Computer Vision. Linköping University, Faculty of Science & Engineering.ORCID iD: 0009-0003-1150-3412
Swedish University of Agricultural Sciences.ORCID iD: 0000-0002-3201-9262
Linköping University, Department of Electrical Engineering, Computer Vision. Linköping University, Faculty of Science & Engineering.ORCID iD: 0009-0002-1203-1093
2024 (English)In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR): CV4Animals: Computer Vision for Animal Behavior, 2024Conference paper, Poster (with or without abstract) (Other academic)
Abstract [en]

This paper deals with 3D reconstruction of seabirds which recently came into focus of environmental scientists as valuable bio-indicators for environmental change. Such 3D information is beneficial for analyzing the bird's behavior and physiological shape, for example by tracking motion, shape, and appearance changes. From a computer vision perspective birds are especially challenging due to their rapid and oftentimes non-rigid motions. We propose an approach to reconstruct the 3D pose and shape from monocular videos of a specific breed of seabird - the common murre. Our approach comprises a full pipeline of detection, tracking, segmentation, and temporally consistent 3D reconstruction. Additionally, we propose a temporal loss that extends current single-image 3D bird pose estimators to the temporal domain. Moreover, we provide a real-world dataset of 10000 frames of video observations on average capture nine birds simultaneously, comprising a large variety of motions and interactions, including a smaller test set with bird-specific keypoint labels. Using our temporal optimization, we achieve state-of-the-art performance for the challenging sequences in our dataset. 

Place, publisher, year, edition, pages
2024.
Keywords [en]
pose estimation, 3D reconstruction, articulated mesh, bird, common murre, temporal
National Category
Computer graphics and computer vision
Identifiers
URN: urn:nbn:se:liu:diva-208815OAI: oai:DiVA.org:liu-208815DiVA, id: diva2:1908705
Conference
IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 17-21, 2024
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Note

This work was partially supported by the Wallenberg AI, Autonomous Systems and Software Program (WASP) funded by the Knut and Alice Wallenberg Foundation.

The computations were enabled by the Berzelius resource provided by the Knut and Alice Wallenberg Foundation at the National Supercomputer Centre.

Available from: 2024-10-28 Created: 2024-10-28 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Länk till arxiv / Link to arxivLänk till konferensen / Link to conference

Authority records

Hägerlind, JohannesWandt, Bastian

Search in DiVA

By author/editor
Hägerlind, JohannesHentati-Sundberg, JonasWandt, Bastian
By organisation
Computer VisionFaculty of Science & Engineering
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 166 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf