In this work, we delve into the implementation approaches of two different classic attention mechanisms and the effects of two distinct methods for adding attention mechanisms in U-Net image segmentation tasks. Based on experiments, we further conclude that in the context of image segmentation tasks in computer vision, attention mechanisms, while achieving weighted channel information, can have a negative impact on the backbone network of conventional convolutional feature extraction. However, they can effectively compensate for the shortcomings when it comes to encoding high-dimensional semantic information in the encoder-decoder model structure.
|