Applied Sciences, Vol. 15, Pages 11296: SANet: A Pure Vision Strip-Aware Network with PSSCA and Multistage Fusion for Weld Seam Detection


Applied Sciences, Vol. 15, Pages 11296: SANet: A Pure Vision Strip-Aware Network with PSSCA and Multistage Fusion for Weld Seam Detection

Applied Sciences doi: 10.3390/app152011296

Authors:
Zhijian Zhu
Haoran Gu
Zhao Yang
Lijie Zhao
Guoli Song
Qinghui Wang

Weld seam detection is a fundamental prerequisite for robotic welding automation, yet it remains challenging due to the elongated shape of welds, weak contrast against metallic backgrounds, and significant environmental interference in industrial scenarios. To address these challenges, we propose a novel deep neural network architecture termed SANet (Strip-Aware Network). The model is constructed upon a U-shaped backbone and integrates strip-aware feature modeling with multistage supervision. It mainly consists of two complementary modules: the Paralleled Strip and Spatial Context-Aware (PSSCA) module and the Multistage Fusion (MF) module. The PSSCA module enhances the extraction of elongated strip-like features by combining parallel strip perception with spatial context modeling, thereby improving fine-grained weld seam representation. In addition, SANet integrates the StripPooling attention mechanism as an auxiliary component to enlarge the receptive field along strip directions and enhance feature discrimination under complex backgrounds. Meanwhile, the MF module performs cross-stage feature fusion by aggregating encoder and decoder features at multiple levels, ensuring accurate boundary recovery and robust global-to-local interaction. The weld seam detection task is formulated as a two-dimensional segmentation problem and evaluated on a self-built dataset consisting of over 4000 weld seam images covering diverse industrial scenarios such as pipe joints, trusses, elbows, and furnace structures. Experimental results show that SANet achieves an IoU of 96.23% and a Dice coefficient of 98.07%, surpassing all compared models and demonstrating its superior performance in weld seam detection. These findings validate the effectiveness of the proposed architecture and highlight its potential as a low-cost, flexible, and reliable pure vision solution for intelligent welding applications.



Source link

Zhijian Zhu www.mdpi.com