liu.seSök publikationer i DiVA
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Optimizing Text Recognition in Mechanical Drawings: A Comprehensive Approach
Linköpings universitet, Institutionen för ekonomisk och industriell utveckling, Produktrealisering. Linköpings universitet, Tekniska fakulteten.ORCID-id: 0000-0002-5950-4962
Linköpings universitet, Institutionen för ekonomisk och industriell utveckling, Produktrealisering. Linköpings universitet, Tekniska fakulteten.
2025 (Engelska)Ingår i: Machines, E-ISSN 2075-1702, Vol. 13, nr 3, artikel-id 254Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

The digitalization of engineering drawings is a pivotal step toward automating and improving the efficiency of product design and manufacturing systems (PDMSs). This study presents eDOCr2, a framework that combines traditional OCR and image processing to extract structured information from mechanical drawings. It segments drawings into key elements-such as information blocks, dimensions, and feature control frames-achieving a text recall of 93.75% and a character error rate (CER) below 1% in a benchmark with drawings from different sources. To improve semantic understanding and reasoning, eDOCr2 integrates Vision Language models (Qwen2-VL-7B and GPT-4o) after segmentation to verify, filter, or retrieve information. This integration enables PDMS applications such as automated design validation, quality control, or manufacturing assessment. The code is available on Github.

Ort, förlag, år, upplaga, sidor
MDPI , 2025. Vol. 13, nr 3, artikel-id 254
Nyckelord [en]
mechanical drawings; optical character recognition; intelligent document processing; quality control; vision language models
Nationell ämneskategori
Produktionsteknik, arbetsvetenskap och ergonomi
Identifikatorer
URN: urn:nbn:se:liu:diva-212838DOI: 10.3390/machines13030254ISI: 001452775200001Scopus ID: 2-s2.0-105001120622OAI: oai:DiVA.org:liu-212838DiVA, id: diva2:1950437
Anmärkning

Funding Agencies|Vinnova; DART project; [2021-02481]; [2024-01420]

Tillgänglig från: 2025-04-07 Skapad: 2025-04-07 Senast uppdaterad: 2026-02-06
Ingår i avhandling
1. Learning-Based Methods for Visual Understanding in Engineering and Production Workflows
Öppna denna publikation i ny flik eller fönster >>Learning-Based Methods for Visual Understanding in Engineering and Production Workflows
2026 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

Artificial intelligence (AI) has advanced rapidly across engineering and industrial domains, yet its adoption in production environments is often constrained by the need for higher system reliability, limited availability of high-quality data, and the challenge of embedding tacit engineering knowledge into learning-based models. These limitations hinder the broader industrial push toward flexible, data-driven automation capable of handling increasing product variability and shorter production cycles.

This thesis investigates how learning-based methods can be designed and integrated as re-liable perception components within production workflows. Through four complementary case studies, the work demonstrates how visual representations—ranging from Computer-Aided Design (CAD)-derived images and engineering drawings to point clouds and RGB-D images—can be leveraged to address concrete industrial challenges across multiple stages of the manufacturing pipeline.

The first case study predicts fixturing clamp configurations for welding operations in automotive manufacturing by learning geometric patterns from CAD-derived representations. The second applies optical character recognition to engineering drawings to accelerate quality-control and documentation workflows. The third examines scene reconstruction and 3D object detection from point clouds, using synthetic data generation to mitigate data scarcity. The fourth develops a fast, zero-shot pose estimation approach for robotic manipulation, enabling reliable object localization in dynamic industrial environments.

Taken together, these studies show how AI methods informed by structured engineering knowledge can increase process efficiency, reduce manual workload, and help resolve persistent automation bottlenecks in modern manufacturing.

Ort, förlag, år, upplaga, sidor
Linköping: Linköping University Electronic Press, 2026. s. 62
Serie
Linköping Studies in Science and Technology. Dissertations, ISSN 0345-7524 ; 2506
Nationell ämneskategori
Produktionsteknik, arbetsvetenskap och ergonomi
Identifikatorer
urn:nbn:se:liu:diva-221069 (URN)10.3384/9789181184648 (DOI)9789181184631 (ISBN)9789181184648 (ISBN)
Disputation
2026-03-05, ACAS, A-building, Campus Valla, Linköping, 13:15 (Engelska)
Opponent
Handledare
Forskningsfinansiär
Vinnova, 2021-02481Vinnova, 2020-02974Vinnova, 2023-02694
Tillgänglig från: 2026-02-06 Skapad: 2026-02-06 Senast uppdaterad: 2026-02-06Bibliografiskt granskad

Open Access i DiVA

fulltext(7045 kB)1016 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 7045 kBChecksumma SHA-512
3e5de136649316abbe45d840b8637c69b31ebec172d575e4e08ad6ad777414211a52292abaf6046d90772d414fb1040f8e7e235c754b1be9c60ca69bb206585a
Typ fulltextMimetyp application/pdf

Övriga länkar

Förlagets fulltextScopus

Sök vidare i DiVA

Av författaren/redaktören
Villena Toro, JavierTarkian, Mehdi
Av organisationen
ProduktrealiseringTekniska fakulteten
I samma tidskrift
Machines
Produktionsteknik, arbetsvetenskap och ergonomi

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 1016 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 396 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf