Defects in the YOLOv5 model include inadequate feature information extraction, transmission loss, and poor expression capabilities. Based on the NEU-DET dataset’s features, which include tiny, uneven, and low-quality defect targets on steel surfaces, an improved YOLOv5 steel surface defect identification method is presented. Replace the conventional convolution in YOLOv5’s Neck with the deformable convolution network version 2 (DCNv2) to increase the model’s capacity to distinguish irregular tiny target characteristics. To improve the model’s information loss problem during information transmission, a residual network (Res) is proposed to be incorporated into the spatial pyramid pooling fast (SPPF) structure to form the SPPF_Res module, which will replace the original module and introduce the Neck part of YOLOv5. Improve information loss in the model. The spatial attention (SA) and coordinate attention (CA) are simultaneous in CAS attention modules, allowing the upgraded module to pay attention to diverse channel connections, and employ spatial information learning to improve the model’s feature representation capacity. After training the new model on the NEU-DET data set, the better model has a detection accuracy of 77.4%, which is 7.4% higher than the original YOLOv5 model, demonstrating that the enhanced approach is successful.
YU Longfei, LIU Yefeng, HE Zengpeng, MA Yihang.
A Method for Improving YOLOv5 Strip Surface Defect Detection[J]. Control Engineering of China, 2025, 32(2): 377-384