Paper
15 March 2024 Edge-guided nonlinear dynamic convolution network for lightweight semantic segmentation
Chunyu Zhang, Fang Xu, Chengdong Wu, Chenghao Zhang
Author Affiliations +
Proceedings Volume 13075, Second International Conference on Physics, Photonics, and Optical Engineering (ICPPOE 2023); 1307502 (2024) https://doi.org/10.1117/12.3025962
Event: Second International Conference on Physics, Photonics, and Optical Engineering (ICPPOE 2023), 2023, Kunming, China
Abstract
As the demand for autonomous driving and robot vision arises, semantic segmentation has developed rapidly. Semantic segmentation provides the means for machines to understand the environment. However, current methods face a balance issue between segmentation quality and computational resources. Our proposed edge-guided non-linear dynamic convolutional network (ENNet) achieves real-time and accurate semantic segmentation. The core of our network is a novel non-linear, dynamic combination (Nd-conv) that allows for a non-linear, dynamic combination (Nd-conv) of multiple convolution weights, improving the convolution's encoding ability while maintaining a low computational burden. The Non-linear Dynamic Convolutional Modules (NDCM) is introduced to enhance the segmentation accuracy of the network while maintaining efficiency. The Multi-stage Feature Fusion Module (MFFM) is also introduced to fuse low-level details and high-level semantic information to improve the segmentation accuracy of the network. Experiments conducted on a 1080Ti GPU show that our model achieves 74.6 mIOU on the Cityscapes dataset while being able to perform inference at 202 FPS. Our network design has achieved an optimal balance between speed and accuracy.
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Chunyu Zhang, Fang Xu, Chengdong Wu, and Chenghao Zhang "Edge-guided nonlinear dynamic convolution network for lightweight semantic segmentation", Proc. SPIE 13075, Second International Conference on Physics, Photonics, and Optical Engineering (ICPPOE 2023), 1307502 (15 March 2024); https://doi.org/10.1117/12.3025962
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature fusion

Image segmentation

Computer vision technology

Back to Top