BitNet: Learning-Based Bit-Depth Expansion

Byun, Junyoung; Shim, Kyujin; Kim, Changick

doi:10.1007/978-3-030-20890-5_5

Junyoung Byun¹⁸,
Kyujin Shim¹⁸ &
Changick Kim¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11362))

Included in the following conference series:

Asian Conference on Computer Vision

2299 Accesses
12 Citations

Abstract

Bit-depth is the number of bits for each color channel of a pixel in an image. Although many modern displays support unprecedented higher bit-depth to show more realistic and natural colors with a high dynamic range, most media sources are still in bit-depth of 8 or lower. Since insufficient bit-depth may generate annoying false contours or lose detailed visual appearance, bit-depth expansion (BDE) from low bit-depth (LBD) images to high bit-depth (HBD) images becomes more and more important. In this paper, we adopt a learning-based approach for BDE and propose a novel CNN-based bit-depth expansion network (BitNet) that can effectively remove false contours and restore visual details at the same time. We have carefully designed our BitNet based on an encoder-decoder architecture with dilated convolutions and a novel multi-scale feature integration. We have performed various experiments with four different datasets including MIT-Adobe FiveK, Kodak, ESPL v2, and TESTIMAGES, and our proposed BitNet has achieved state-of-the-art performance in terms of PSNR and SSIM among other existing BDE methods and famous CNN-based image processing networks. Unlike previous methods that separately process each color channel, we treat all RGB channels at once and have greatly improved color restoration. In addition, our network has shown the fastest computational speed in near real-time.

J. Byun and K. Shim—Contributed equally, listed alphabetically.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Abadi, M., et al.: TensorFlow: large-scale machine learning on heterogeneous systems (2015). https://www.tensorflow.org/. Software available from tensorflow.org
Asuni, N., Giachetti, A.: TESTIMAGES: a large-scale archive for testing visual devices and basic image processing algorithms. In: STAG - Smart Tools & Apps for Graphics Conference (2014). https://doi.org/10.2312/stag.20141242
Bychkovsky, V., Paris, S., Chan, E., Durand, F.: Learning photographic global tonal adjustment with a database of input/output image pairs. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2011
Google Scholar
Chen, Q., Xu, J., Koltun, V.: Fast image processing with fully-convolutional networks. In: The IEEE International Conference on Computer Vision (ICCV), October 2017
Google Scholar
Cheng, C.H., Au, O.C., Liu, C.H., Yip, K.Y.: Bit-depth expansion by contour region reconstruction. In: The IEEE International Symposium on Circuits and Systems, pp. 944–947, May 2009
Google Scholar
Chollet, F., et al.: Keras (2015). https://keras.io
Daly, S., Feng, X.: Decontouring: Prevention and removal of false contour artifacts. In: The International Society for Optical Engineering (SPIE). vol. 5292, pp. 130–149 (2004)
Google Scholar
Dong, C., Deng, Y., Loy, C.C., Tang, X.: Compression artifacts reduction by a deep convolutional network. In: The IEEE International Conference on Computer Vision (ICCV), pp. 576–584, December 2015
Google Scholar
Dumoulin, V., Visin, F.: A guide to convolution arithmetic for deep learning (2016), arXiv preprint arXiv:1603.07285
Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. In: Teh, Y.W., Titterington, M. (eds.) Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Proceedings of Machine Learning Research, vol. 9, pp. 249–256. PMLR, May 2010
Google Scholar
Guo, J., Chao, H.: One-to-many network for visually pleasing compression artifacts reduction. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4867–4876, July 2017
Google Scholar
Hall, C., Lowe, M.: Mobile HDR: Dolby vision, HDR10 and mobile HDR premium explained (2018). https://www.pocket-lint.com/phones/news/dolby/138387-mobile-hdr-dolby-vision-hdr10-and-mobile-hdr-premium-explained. Accessed 23 June 2018
Keysers, D., Lampert, C.H., Breuel, T.M.: Color image dequantization by constrained diffusion. In: The International Society for Optical Engineering (SPIE), vol. 6058, p. 6058-1–6058-10, January 2006
Google Scholar
Kingma, D., Ba, J.: Adam: a method for stochastic optimization. In: International Conference on Learning Representations (ICLR), May 2015
Google Scholar
Kodak: Kodak lossless true color image suite (1999). http://r0k.us/graphics/kodak/. Accessed 23 June 2018
Kundu, D., Evans, B.L.: Full-reference visual quality assessment for synthetic images: a subjective study. In: The IEEE International Conference on Image Processing (ICIP), pp. 2374–2378, September 2015
Google Scholar
Lefkimmiatis, S.: Universal denoising networks: a novel CNN architecture for image denoising. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2018
Google Scholar
Liu, C.H., Au, O.C., Wong, P.H.W., Kung, M.C., Chao, S.C.: Bit-depth expansion by adaptive filter. In: IEEE International Symposium on Circuits and Systems, pp. 496–499, May 2008
Google Scholar
Liu, J., Zhai, G., Yang, X., Chen, C.W.: IPAD: intensity potential for adaptive de-quantization. IEEE Trans. Image Process. (2018)
Google Scholar
Liu, S., Pan, J., Yang, M.-H.: Learning recursive filters for low-level vision via a hybrid neural network. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9908, pp. 560–576. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46493-0_34
Chapter Google Scholar
Maas, A.L., Hannun, A.Y., Ng, A.Y.: Rectifier nonlinearities improve neural network acoustic models. In: ICML Workshop on Deep Learning for Audio, Speech and Language Processing (2013)
Google Scholar
Mantiuk, R., Krawczyk, G., Myszkowski, K., Seidel, H.P.: Perception-motivated high dynamic range video encoding. Special Issue ACM Transa. Graph. (SIGGRAPH) 23, 733–741 (2004)
Article Google Scholar
Mizuno, A., Ikebe, M.: Bit-depth expansion for noisy contour reduction in natural images. In: The IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1671–1675, March 2016
Google Scholar
Park, M.H., Lee, J.W., Park, R.H., Kim, J.S.: False contour reduction using neural networks and adaptive bi-directional smoothing. IEEE Trans. Consum. Electron. 56(2), 870–878 (2010)
Article Google Scholar
Seetzen, H., et al.: High dynamic range display systems. Special Issue ACM Trans. Graph. (SIGGRAPH), 760–768 (2004)
Article Google Scholar
Ulichney, R.A., Cheung, S.: Pixel bit-depth increase by bit replication. In: The International Society for Optical Engineering (SPIE), vol. 3300, pp. 3300-1–3300-10, January 1998
Google Scholar
Wan, P., Au, O.C., Tang, K., Guo, Y., Fang, L.: From 2D extrapolation to 1D interpolation: content adaptive image bit-depth expansion. In: The IEEE International Conference on Multimedia and Expo (ICME), pp. 170–175, July 2012
Google Scholar
Wan, P., Cheung, G., Florencio, D., Zhang, C., Au, O.C.: Image bit-depth enhancement via maximum a posteriori estimation of AC signal. IEEE Trans. Image Process. 25(6), 2896–2909 (2016)
Article MathSciNet Google Scholar
Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)
Article Google Scholar
Zhang, K., Zuo, W., Zhang, L.: FFDNET: toward a fast and flexible solution for cnn-based image denoising. IEEE Trans. Image Process. 27(9), 4608–4622 (2018)
Article MathSciNet Google Scholar
Zhao, H., Gallo, O., Frosio, I., Kautz, J.: Loss functions for image restoration with neural networks. IEEE Trans. Comput. Imaging 3(1), 47–57 (2017)
Article Google Scholar

Download references

Acknowledgement

This work was supported by MCST (Ministry of Culture, Sports & Tourism)/KOCCA (KoreaCreativeContentAgency) (R2016030044 - Development of Centext-Based Sports Video Analysis, Summarization, and Retrieval Technologies). The authors would like to thank Jing Liu [19] for releasing source codes for various BDE methods.

Author information

Authors and Affiliations

School of Electrical Engineering, KAIST, Daejeon, Republic of Korea
Junyoung Byun, Kyujin Shim & Changick Kim

Authors

Junyoung Byun
View author publications
You can also search for this author in PubMed Google Scholar
Kyujin Shim
View author publications
You can also search for this author in PubMed Google Scholar
Changick Kim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Changick Kim .

Editor information

Editors and Affiliations

IIIT Hyderabad, Hyderabad, India
C. V. Jawahar
ANU, Canberra, ACT, Australia
Hongdong Li
Simon Fraser University, Burnaby, BC, Canada
Greg Mori
ETH Zurich, Zurich, Zürich, Switzerland
Konrad Schindler

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 62508 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Byun, J., Shim, K., Kim, C. (2019). BitNet: Learning-Based Bit-Depth Expansion. In: Jawahar, C., Li, H., Mori, G., Schindler, K. (eds) Computer Vision – ACCV 2018. ACCV 2018. Lecture Notes in Computer Science(), vol 11362. Springer, Cham. https://doi.org/10.1007/978-3-030-20890-5_5

Download citation

DOI: https://doi.org/10.1007/978-3-030-20890-5_5
Published: 02 June 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-20889-9
Online ISBN: 978-3-030-20890-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics