TUHH Open Research
Help
  • Log In
    New user? Click here to register.Have you forgotten your password?
  • English
  • Deutsch
  • Communities & Collections
  • Publications
  • Research Data
  • People
  • Institutions
  • Projects
  • Statistics
  1. Home
  2. TUHH
  3. Publications
  4. An edge Is all you need: Cracking the code for generating synthetic datasets for robust crack detection models
 
Options

An edge Is all you need: Cracking the code for generating synthetic datasets for robust crack detection models

Citation Link: https://doi.org/10.15480/882.16805
Publikationstyp
Conference Paper
Date Issued
2026
Sprache
English
Author(s)
Holst, Dirk  orcid-logo
Flugzeug-Produktionstechnik M-23  
Schmedemann, Ole  orcid-logo
Flugzeug-Produktionstechnik M-23  
Schüppstuhl, Thorsten  orcid-logo
Flugzeug-Produktionstechnik M-23  
TORE-DOI
10.15480/882.16805
TORE-URI
https://hdl.handle.net/11420/61839
Journal
Procedia CIRP  
Volume
138
Start Page
1097
End Page
1102
Citation
18th CIRP Conference on Intelligent Computation in Manufacturing Engineering, CIRP ICME 2024 (2026)
Contribution to Conference
18th CIRP Conference on Intelligent Computation in Manufacturing Engineering, CIRP ICME 2024  
Publisher DOI
10.1016/j.procir.2026.01.189
Scopus ID
2-s2.0-105030652021
Publisher
Elsevier
Generating synthetic datasets for training crack detection models remains a challenge due to the variability of crack appearances and the need for pixel-level annotations. As a promising alternative to manual labeling, synthetic data generation using Perlin Noise has gained attention. However, the relationship between Perlin Noise parameters and the physical characteristics of cracks is not well-established, leading to potential dataset bias and suboptimal model performance. This paper presents a novel approach that maps Perlin Noise generated crack parameters to crack characteristics such as width, length, curvature, and bifurcations, enabling the generation of diverse and realistic crack patterns. We employ a broad domain randomization technique by projecting the generated cracks onto randomly selected background images from the ImageNet database to reduce dataset bias and enhance model robustness. Using the Partitioning Around Medoids (PAM) algorithm, we create six distinct datasets capturing a comprehensive range of crack parameter variations. We demonstrate the effectiveness of our approach by fine-tuning the state-of-the-art SegFormer 5b model on our synthetic datasets and benchmarking its performance on the Crack500 dataset. Through linear regression analysis, we can identify the key crack parameters that influence model performance. Our results show that synthetic cracks have to be long, wide, straight, and should have few bifurcations to maximize evaluation metrics such as Intersection over Union (IoU), precision, recall, and F1.
Subjects
Crack Detection
Dataset Analyses
Pavement Cracks
Perlin Noise
Synthetic Data
DDC Class
006.3: Artificial Intelligence
004: Computer Sciences
Lizenz
https://creativecommons.org/licenses/by-nc-nd/4.0/
Publication version
publishedVersion
Loading...
Thumbnail Image
Name

1-s2.0-S2212827126001903-main.pdf

Type

Main Article

Size

728.03 KB

Format

Adobe PDF

TUHH
Weiterführende Links
  • Contact
  • Send Feedback
  • Cookie settings
  • Privacy policy
  • Impress
DSpace Software

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science
Design by effective webwork GmbH

  • Deutsche NationalbibliothekDeutsche Nationalbibliothek
  • ORCiD Member OrganizationORCiD Member Organization
  • DataCiteDataCite
  • Re3DataRe3Data
  • OpenDOAROpenDOAR
  • OpenAireOpenAire
  • BASE Bielefeld Academic Search EngineBASE Bielefeld Academic Search Engine
Feedback