Deep image matting with cross-layer contextual information propagation

Zhou, Fenfen; Tian, Yingjie; Zhu, Siyu

doi:10.1007/s00521-024-09431-5

Deep image matting with cross-layer contextual information propagation

Original Article
Published: 20 February 2024

Volume 36, pages 6809–6825, (2024)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Fenfen Zhou^1,2,
Yingjie Tian ORCID: orcid.org/0000-0003-3837-1782^3,4,5 &
Siyu Zhu⁶

152 Accesses
Explore all metrics

Abstract

Natural image matting focuses on accurately estimating the opacity of the foreground object in an arbitrary background. Recently, deep learning-based approaches made significant progress in the matting task benefit from their powerful learning ability for semantic features. However, artifacts, blurry structures, and miscalculated pixels still often appear in some difficult regions with background interference and complex details. To address the above issues, we propose a cross-layer contextual information propagation mechanism (CCIP) that can explicitly model the long-range correlations between global and unknown regions. Specifically, we first calculate region affinity at high-level features with rich structure and semantic information; then reconstruct the adjacent low-level features by propagating information from the global region to the unknown region under the guidance of the affinity matrix; finally, transfer the reconstructed information to the corresponding decoder stage to further improve the feature distinctiveness. In addition, we design a simple and effective supervision strategy in a deep-to-shallow manner to gradually optimize the edges and details of the foreground object. We conducted extensive experiments on the common dataset Composition-1k, the alphamatting.com benchmark, and some real-world images. Compared with previous methods, the proposed method achieves competitive performance on the Composition-1k dataset (30.3 on SAD, 6.8 on MSE, 13.3 on Grad, and 26.7 on Con) and alphamatting.com benchmark (17 on average SAD rank and 16.8 on average Grad rank), while simultaneously yielding high-quality matting results on real-world images.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A survey on Image Data Augmentation for Deep Learning

Article Open access 06 July 2019

Perceptual Losses for Real-Time Style Transfer and Super-Resolution

Learning a Deep Convolutional Network for Image Super-Resolution

Data availibility

The alphamattting.com dataset is available in http://www.alphamatting.com/datasets. The composition-1k dataset is available in https://sites.google.com/view/deepimagematting.

Notes

References

Zhao H, Li H, Cheng L (2020) Improving retinal vessel segmentation with joint local loss by matting. Patt Recognit 98:107068
Article Google Scholar
Chen J, Li X, Luo L, Ma J (2022) Multi-focus image fusion based on multi-scale gradients and image matting. IEEE Trans Multimed 24:655–667
Article ADS Google Scholar
Ma Z, Kim D, Shin Y (2020) Shape-from-focus reconstruction using nonlocal matting Laplacian prior followed by MRF-based refinement. Patt Recognit 103:107302
Article Google Scholar
Lin S, Ryabtsev A, Sengupta S, Curless BL, Seitz SM, Kemelmacher-Shlizerman I (2021) Real-time high-resolution background matting. In: Computer vision and pattern recognition, pp 8762–8771
Xu N, Price BL, Cohen S, Huang TS (2017) Deep image matting. In: Computer vision and pattern recognition, pp 311–320
Hou Q, Cheng M, Hu X, Borji A, Tu Z, Torr PHS (2019) Deeply supervised salient object detection with short connections. IEEE Trans Patt Anal Mach Intell 41(4):815–828
Article Google Scholar
Zeng Y, Fu J, Chao H, Guo B (2019) Learning pyramid-context encoder network for high-quality image inpainting. In: Computer vision and pattern recognition, pp 1486–1494
Liu Y, Cheng M, Fan D, Zhang L, Bian J, Tao D (2022) Semantic edge detection with diverse deep supervision. Int J Comput Vis 130(1):179–198
Article Google Scholar
Gastal ESL, Oliveira MM (2010) Shared sampling for real-time alpha matting. Comput Graph Forum 29(2):575–584
Article Google Scholar
Feng X, Liang X, Zhang Z (2016) A cluster sampling method for image matting via sparse coding. In: European conference on computer vision, pp 204–219
Tang J, Aksoy Y, Öztireli C, Gross MH, Aydin TO (2019) Learning-based sampling for natural image matting. In: Computer vision and pattern recognition, pp 3055–3063
Levin A, Lischinski D, Weiss Y (2008) A closed-form solution to natural image matting. IEEE Trans Patt Anal Mach Intell 30(2):228–242
Article Google Scholar
Chen Q, Li D, Tang C (2013) KNN matting. IEEE Trans Patt Anal Mach Intell 35(9):2175–2188
Article Google Scholar
Hu W, Hsu J (2013) Automatic spectral video matting. Patt Recognit 46(4):1183–1194
Article ADS Google Scholar
Aksoy Y, Aydin TO, Pollefeys M (2017) Designing effective inter-pixel information flow for natural image matting. In: Computer vision and pattern recognition, pp 228–236
Lutz S, Amplianitis K, Smolic A (2018) Alphagan: Generative adversarial networks for natural image matting. In: British machine vision conference, p 259
Lu H, Dai Y, Shen C, Xu S (2019) Indices matter: learning to index for deep image matting. In: International conference on computer vision, pp 3265–3274
Cai S, Zhang X, Fan H, Huang H, Liu J, Liu J, Liu J, Wang J, Sun J (2019) Disentangled image matting. In: International conference on computer vision, pp 8818–8827
Hou Q, Liu F (2019) Context-aware image matting for simultaneous foreground and alpha estimation. In: International conference on computer vision, pp 4129–4138
Yang X, Qiao Y, Chen S, He S, Yin B, Zhang Q, Wei X, Lau RWH (2021) Smart scribbles for image matting. ACM Trans Multimed Comput Commun Appl 16(4):1–21
Article CAS Google Scholar
Li Y, Lu H (2020) Natural image matting via guided contextual attention. In: AAAI association for the advancement of artificial intelligence, pp 11450–11457
Qiao Y, Liu Y, Yang X, Zhou D, Xu M, Zhang Q, Wei X (2020) Attention-guided hierarchical structure aggregation for image matting. In: Computer vision and pattern recognition, pp 13673–13682
Dai Y, Lu H, Shen C (2021) Learning affinity-aware upsampling for deep image matting. In: Computer vision and pattern recognition, pp 6841–6850
Liu Y, Xie J, Qiao Y, Tang Y, Yang X (2022) Prior-induced information alignment for image matting. IEEE Trans Multimed 24:2727–2738
Article Google Scholar
Yu Q, Zhang J, Zhang H, Wang Y, Lin Z, Xu N, Bai Y, Yuille AL (2021) Mask guided matting via progressive refinement network. In: Computer vision and pattern recognition, pp 1154–1163
Wei T, Chen D, Zhou W, Liao J, Zhao H, Zhang W, Yu N (2021) Improved image matting via real-time user clicks and uncertainty estimation. In: Computer vision and pattern recognition, pp 15374–15383
Ding H, Zhang H, Liu C, Jiang X (2022) Deep interactive image matting with feature propagation. IEEE Trans Image Process 31:2421–2432
Article ADS PubMed Google Scholar
Qiao Y, Liu Y, Wei Z, Wang Y, Cai Q, Zhang G, Yang X (2023) Hierarchical and progressive image matting. ACM Trans Multimed Comput Commun Appl 19(2):1–23
Article Google Scholar
Dai Y, Price B, Zhang H, Shen C (2022) Boosting robustness of image matting with context assembling and strong data augmentation. In: Computer vision and pattern recognition, pp 11697–11706
Park G, Son S, Yoo J, Kim S, Kwak N (2022) Matteformer: transformer-based image matting via prior-tokens. In: Computer vision and pattern recognition, pp 11686–11696
Cai H, Xue F, Xu L, Guo L (2022) Transmatting: enhancing transparent objects matting with transformers. In: European conference on computer vision, vol 13689, pp 253–269
Li J, Zhang J, Tao D (2023) Referring image matting. In: Computer vision and pattern recognition, pp 22448–22457
Yi Z, Tang Q, Azizi S, Jang D, Xu Z (2020) Contextual residual aggregation for ultra high-resolution image inpainting. In: Computer vision and pattern recognition, pp 7505–7514
Ding H, Jiang X, Shuai B, Liu AQ, Wang G (2020) Semantic segmentation with context encoding and multi-path decoding. IEEE Trans Image Process 29:3520–3533
Article ADS Google Scholar
Grady L, Schiwietz T, Aharon S, Westermann R (2005) Random walks for interactive alpha-matting, pp 423–429
Chen L, Zhang H, Xiao J, Nie L, Shao J, Liu W, Chua T (2017) SCA-CNN: spatial and channel-wise attention in convolutional networks for image captioning. In: Computer vision and pattern recognition, pp 6298–6306
Rhemann C, Rother C, Wang J, Gelautz M, Kohli P, Rott P (2009) A perceptually motivated online benchmark for image matting. In: Computer vision and pattern recognition, pp 1826–1833
Lin T, Maire M, Belongie SJ, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft COCO: common objects in context. In: European conference on computer vision, pp 740–755
Kingma, DP, Ba J (2015) Adam: a method for stochastic optimization. In: Bengio Y, LeCun Y (eds) International conference on learning representations
Everingham M, Gool LV, Williams CKI, Winn JM, Zisserman A (2010) The pascal visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338
Article Google Scholar
Shahrian E, Rajan D, Price BL, Cohen S (2013) Improving image matting using comprehensive sampling sets. In: Computer vision and pattern recognition, pp 636–643
Cho D, Tai Y, Kweon I (2016) Natural image matting using deep convolutional neural networks. In: European conference on computer vision, pp 626–643
Yu H, Xu N, Huang Z, Zhou Y, Shi H (2021) High-resolution deep image matting. In: AAAI association for the advancement of artificial intelligence, pp 3217–3224

Download references

Acknowledgements

This work has been partially supported by grants from: National Natural Science Foundation of China (No.12071458, 71731009).

Author information

Authors and Affiliations

Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing, 100101, China
Fenfen Zhou
College of Robotics, Beijing Union University, Beijing, 100027, China
Fenfen Zhou
School of Economics and Management, UCAS, Beijing, 100190, China
Yingjie Tian
Research Center on Fictitious Economy and Data Science, CAS, Beijing, 100190, China
Yingjie Tian
Key Laboratory of Big Data Mining and Knowledge Management, CAS, Beijing, 100190, China
Yingjie Tian
Computer School, Beijing Information Science and Technology University, Beijing, 100192, China
Siyu Zhu

Authors

Fenfen Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Yingjie Tian
View author publications
You can also search for this author in PubMed Google Scholar
Siyu Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yingjie Tian.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zhou, F., Tian, Y. & Zhu, S. Deep image matting with cross-layer contextual information propagation. Neural Comput & Applic 36, 6809–6825 (2024). https://doi.org/10.1007/s00521-024-09431-5

Download citation

Received: 06 June 2023
Accepted: 15 January 2024
Published: 20 February 2024
Issue Date: April 2024
DOI: https://doi.org/10.1007/s00521-024-09431-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Deep image matting with cross-layer contextual information propagation

Abstract

Access this article

Similar content being viewed by others

A survey on Image Data Augmentation for Deep Learning

Perceptual Losses for Real-Time Style Transfer and Super-Resolution

Learning a Deep Convolutional Network for Image Super-Resolution

Data availibility

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Deep image matting with cross-layer contextual information propagation

Abstract

Access this article

Similar content being viewed by others

A survey on Image Data Augmentation for Deep Learning

Perceptual Losses for Real-Time Style Transfer and Super-Resolution

Learning a Deep Convolutional Network for Image Super-Resolution

Data availibility

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation