liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Procedural modeling and physically based rendering for synthetic data generation in automotive applications
Linköping University, Department of Science and Technology, Media and Information Technology. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0003-0298-937X
Linköping University, Department of Science and Technology, Media and Information Technology. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0002-6071-2507
7D Labs.
Linköping University, Department of Science and Technology, Media and Information Technology. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0002-7765-1747
2017 (English)Other (Other academic)
Abstract [en]

We present an overview and evaluation of a new, systematic approach for generation of highly realistic, annotated synthetic data for training of deep neural networks in computer vision tasks. The main contribution is a procedural world modeling approach enabling high variability coupled with physically accurate image synthesis, and is a departure from the hand-modeled virtual worlds and approximate image synthesis methods used in real-time applications. The benefits of our approach include flexible, physically accurate and scalable image synthesis, implicit wide coverage of classes and features, and complete data introspection for annotations, which all contribute to quality and cost efficiency. To evaluate our approach and the efficacy of the resulting data, we use semantic segmentation for autonomous vehicles and robotic navigation as the main application, and we train multiple deep learning architectures using synthetic data with and without fine tuning on organic (i.e. real-world) data. The evaluation shows that our approach improves the neural network’s performance and that even modest implementation efforts produce state-of-the-art results.

Place, publisher, year, pages
2017. , p. 13
Series
arXiv.org ; 1710.06270
National Category
Other Engineering and Technologies
Identifiers
URN: urn:nbn:se:liu:diva-165751Libris ID: fr27pc2pcqpkjddkOAI: oai:DiVA.org:liu-165751DiVA, id: diva2:1431174
Available from: 2020-05-19 Created: 2020-05-19 Last updated: 2025-02-18Bibliographically approved
In thesis
1. Synthetic data for visual machine learning: A data-centric approach
Open this publication in new window or tab >>Synthetic data for visual machine learning: A data-centric approach
2022 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Deep learning allows computers to learn from observations, or else training data. Successful application development requires skills in neural network design, adequate computational resources, and a training data distribution that covers the application do-main. We are currently witnessing an artificial intelligence (AI) outbreak with enough computational power to train very deep networks and build models that achieve similar or better than human performance. The crucial factor for the algorithms to succeed has proven to be the training data fed to the learning process. Too little or low quality or out-of-the-target distribution data will lead to poorly performing models no matter the capacity and the data regularization methods.

This thesis is a data-centric approach to AI and presents a set of contributions related to synthesizing images for training supervised visual machine learning. It is motivated by the profound potential of synthetic data in cases of low availability of captured data, expensive acquisition and annotation, and privacy and ethical issues. The presented work aims to generate images similar to samples drawn from the target distribution and evaluate the generated data as the sole training data source and in conjunction with captured imagery. For this, two synthesis methods are explored: computer graphics and generative modeling. Computer graphics-based generation methods and synthetic datasets for computer vision tasks are thoroughly reviewed. In the same context, a system employing procedural modeling and physically-based rendering is introduced for data generation for urban scene understanding. The scheme is flexible, easily scalable, and produces complex and diverse images with pixel-perfect annotations at no cost. Generative Adversarial Networks (GANs) are also used to generate images for small data scenarios augmentation. The strategy advances the model’s performance and robustness. Finally, ensembles of independently trained GANs investigate ways to improve images’ diversity and create synthetic data to serve as the only training source.

The application areas of the presented contributions relate to two image modalities, natural and histopathology images, to cover different aspects in the generation methods and the tasks’ characteristics and requirements. There are showcased synthesized examples of natural images for automotive applications and weather classification, and histopathology images for breast cancer and colon adenocarcinoma metastasis detection. This thesis, as a whole, promotes data-centric supervised deep learning development by highlighting the potential of synthetic data as a training data resource. It emphasizes the control over the formation process, the ability of multi-modality formats, and the automatic generation of annotations.

Place, publisher, year, edition, pages
Linköping: Linköping University Electronic Press, 2022. p. 115
Series
Linköping Studies in Science and Technology. Dissertations, ISSN 0345-7524 ; 2202
Keywords
Training data, Synthetic images, Computer graphics, Generative modeling, Natural images, Histopathology, Digital pathology, Machine learning, Deep learning
National Category
Medical Imaging
Identifiers
urn:nbn:se:liu:diva-182336 (URN)10.3384/9789179291754 (DOI)9789179291747 (ISBN)9789179291754 (ISBN)
Public defence
2022-02-14, Domteatern, Visualiseringscenter C, Kungsgatan 54, Norrköping, 09:15 (English)
Opponent
Supervisors
Note

ISBN for PDF has been added in the PDF-version.

Available from: 2022-01-17 Created: 2022-01-17 Last updated: 2025-02-09Bibliographically approved

Open Access in DiVA

fulltext(29167 kB)471 downloads
File information
File name FULLTEXT01.pdfFile size 29167 kBChecksum SHA-512
0c9f4c4ffe94506026f1a90722292fd580664301af6e7f406a74c27cb9c20782391db7c5841877cd9f6edc2a9bacd0161ccffa9f45dfa8ed9315bdbd06f4915d
Type fulltextMimetype application/pdf
image(2567 kB)0 downloads
File information
File name IMAGE01.pngFile size 2567 kBChecksum SHA-512
21569201ace11f3056f4e1c7447eca761253dc35391a39825d9459bed2d25f6f631f505c0e14dabdeae67e76495e1639e7cd035e6d2f58a3baf3ce3cf5860c22
Type imageMimetype image/png
image(2881 kB)0 downloads
File information
File name IMAGE02.pngFile size 2881 kBChecksum SHA-512
187fcddf5fb6132f64cd2bfc59d569cf26c3b20d833c79423e52de66c37db0c2941b6c72d8e6b015c9b19a81678301f7f16bfcf77efbfacee37e87f3514c3cfd
Type imageMimetype image/png
image(543 kB)0 downloads
File information
File name IMAGE03.pngFile size 543 kBChecksum SHA-512
c167006b58229b7d0075a678a4ad3b4dca30f434a090683c61c0b53f6c0c3afd9eeb9d79f74f2a322dc5e70552229b4f02ada217ba4a453ae33c68aeb3eb69c4
Type imageMimetype image/png
image(2670 kB)0 downloads
File information
File name IMAGE04.pngFile size 2670 kBChecksum SHA-512
774674bc24a2f53205811a970ff836d09a24ed52f50f31992b0cfed50a407d67ed32ea2039ba05bca06305d0019b266c096e4207215a82e41baf2f99c313bd60
Type imageMimetype image/png
image(2543 kB)0 downloads
File information
File name IMAGE05.pngFile size 2543 kBChecksum SHA-512
d46b563974d4070168ba02eddf32e13f0fb3bad2996849c92ad2194f97707f684ec34f5445c081e4bdf17bffce3d85100a4c5528d5462d502838809e39d801ef
Type imageMimetype image/png

Other links

http://arxiv.org/abs/1710.06270

Authority records

Tsirikoglou, ApostoliaKronander, JoelUnger, Jonas

Search in DiVA

By author/editor
Tsirikoglou, ApostoliaKronander, JoelUnger, Jonas
By organisation
Media and Information TechnologyFaculty of Science & Engineering
Other Engineering and Technologies

Search outside of DiVA

GoogleGoogle Scholar
Total: 472 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 482 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf