Holst, DirkDirkHolstSchmedemann, OleOleSchmedemannSchüppstuhl, ThorstenThorstenSchüppstuhl2026-03-042026-03-04202618th CIRP Conference on Intelligent Computation in Manufacturing Engineering, CIRP ICME 2024 (2026)https://hdl.handle.net/11420/61839Generating synthetic datasets for training crack detection models remains a challenge due to the variability of crack appearances and the need for pixel-level annotations. As a promising alternative to manual labeling, synthetic data generation using Perlin Noise has gained attention. However, the relationship between Perlin Noise parameters and the physical characteristics of cracks is not well-established, leading to potential dataset bias and suboptimal model performance. This paper presents a novel approach that maps Perlin Noise generated crack parameters to crack characteristics such as width, length, curvature, and bifurcations, enabling the generation of diverse and realistic crack patterns. We employ a broad domain randomization technique by projecting the generated cracks onto randomly selected background images from the ImageNet database to reduce dataset bias and enhance model robustness. Using the Partitioning Around Medoids (PAM) algorithm, we create six distinct datasets capturing a comprehensive range of crack parameter variations. We demonstrate the effectiveness of our approach by fine-tuning the state-of-the-art SegFormer 5b model on our synthetic datasets and benchmarking its performance on the Crack500 dataset. Through linear regression analysis, we can identify the key crack parameters that influence model performance. Our results show that synthetic cracks have to be long, wide, straight, and should have few bifurcations to maximize evaluation metrics such as Intersection over Union (IoU), precision, recall, and F1.en2212-8271Procedia CIRP202610971102Elsevierhttps://creativecommons.org/licenses/by-nc-nd/4.0/Crack DetectionDataset AnalysesPavement CracksPerlin NoiseSynthetic DataComputer Science, Information and General Works::006: Special computer methods::006.3: Artificial IntelligenceComputer Science, Information and General Works::004: Computer SciencesAn edge Is all you need: Cracking the code for generating synthetic datasets for robust crack detection modelsConference Paperhttps://doi.org/10.15480/882.1680510.1016/j.procir.2026.01.18910.15480/882.16805Conference Paper