Skip to main content

BitNet: Learning-Based Bit-Depth Expansion

  • Conference paper
  • First Online:
Computer Vision – ACCV 2018 (ACCV 2018)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11362))

Included in the following conference series:

Abstract

Bit-depth is the number of bits for each color channel of a pixel in an image. Although many modern displays support unprecedented higher bit-depth to show more realistic and natural colors with a high dynamic range, most media sources are still in bit-depth of 8 or lower. Since insufficient bit-depth may generate annoying false contours or lose detailed visual appearance, bit-depth expansion (BDE) from low bit-depth (LBD) images to high bit-depth (HBD) images becomes more and more important. In this paper, we adopt a learning-based approach for BDE and propose a novel CNN-based bit-depth expansion network (BitNet) that can effectively remove false contours and restore visual details at the same time. We have carefully designed our BitNet based on an encoder-decoder architecture with dilated convolutions and a novel multi-scale feature integration. We have performed various experiments with four different datasets including MIT-Adobe FiveK, Kodak, ESPL v2, and TESTIMAGES, and our proposed BitNet has achieved state-of-the-art performance in terms of PSNR and SSIM among other existing BDE methods and famous CNN-based image processing networks. Unlike previous methods that separately process each color channel, we treat all RGB channels at once and have greatly improved color restoration. In addition, our network has shown the fastest computational speed in near real-time.

J. Byun and K. Shim—Contributed equally, listed alphabetically.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://github.com/CQFIO/FastImageProcessing.

  2. 2.

    https://github.com/silverneko/Linear-RNN.

  3. 3.

    https://sites.google.com/site/jingliu198810/publication.

References

  1. Abadi, M., et al.: TensorFlow: large-scale machine learning on heterogeneous systems (2015). https://www.tensorflow.org/. Software available from tensorflow.org

  2. Asuni, N., Giachetti, A.: TESTIMAGES: a large-scale archive for testing visual devices and basic image processing algorithms. In: STAG - Smart Tools & Apps for Graphics Conference (2014). https://doi.org/10.2312/stag.20141242

  3. Bychkovsky, V., Paris, S., Chan, E., Durand, F.: Learning photographic global tonal adjustment with a database of input/output image pairs. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2011

    Google Scholar 

  4. Chen, Q., Xu, J., Koltun, V.: Fast image processing with fully-convolutional networks. In: The IEEE International Conference on Computer Vision (ICCV), October 2017

    Google Scholar 

  5. Cheng, C.H., Au, O.C., Liu, C.H., Yip, K.Y.: Bit-depth expansion by contour region reconstruction. In: The IEEE International Symposium on Circuits and Systems, pp. 944–947, May 2009

    Google Scholar 

  6. Chollet, F., et al.: Keras (2015). https://keras.io

  7. Daly, S., Feng, X.: Decontouring: Prevention and removal of false contour artifacts. In: The International Society for Optical Engineering (SPIE). vol. 5292, pp. 130–149 (2004)

    Google Scholar 

  8. Dong, C., Deng, Y., Loy, C.C., Tang, X.: Compression artifacts reduction by a deep convolutional network. In: The IEEE International Conference on Computer Vision (ICCV), pp. 576–584, December 2015

    Google Scholar 

  9. Dumoulin, V., Visin, F.: A guide to convolution arithmetic for deep learning (2016), arXiv preprint arXiv:1603.07285

  10. Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. In: Teh, Y.W., Titterington, M. (eds.) Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Proceedings of Machine Learning Research, vol. 9, pp. 249–256. PMLR, May 2010

    Google Scholar 

  11. Guo, J., Chao, H.: One-to-many network for visually pleasing compression artifacts reduction. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4867–4876, July 2017

    Google Scholar 

  12. Hall, C., Lowe, M.: Mobile HDR: Dolby vision, HDR10 and mobile HDR premium explained (2018). https://www.pocket-lint.com/phones/news/dolby/138387-mobile-hdr-dolby-vision-hdr10-and-mobile-hdr-premium-explained. Accessed 23 June 2018

  13. Keysers, D., Lampert, C.H., Breuel, T.M.: Color image dequantization by constrained diffusion. In: The International Society for Optical Engineering (SPIE), vol. 6058, p. 6058-1–6058-10, January 2006

    Google Scholar 

  14. Kingma, D., Ba, J.: Adam: a method for stochastic optimization. In: International Conference on Learning Representations (ICLR), May 2015

    Google Scholar 

  15. Kodak: Kodak lossless true color image suite (1999). http://r0k.us/graphics/kodak/. Accessed 23 June 2018

  16. Kundu, D., Evans, B.L.: Full-reference visual quality assessment for synthetic images: a subjective study. In: The IEEE International Conference on Image Processing (ICIP), pp. 2374–2378, September 2015

    Google Scholar 

  17. Lefkimmiatis, S.: Universal denoising networks: a novel CNN architecture for image denoising. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2018

    Google Scholar 

  18. Liu, C.H., Au, O.C., Wong, P.H.W., Kung, M.C., Chao, S.C.: Bit-depth expansion by adaptive filter. In: IEEE International Symposium on Circuits and Systems, pp. 496–499, May 2008

    Google Scholar 

  19. Liu, J., Zhai, G., Yang, X., Chen, C.W.: IPAD: intensity potential for adaptive de-quantization. IEEE Trans. Image Process. (2018)

    Google Scholar 

  20. Liu, S., Pan, J., Yang, M.-H.: Learning recursive filters for low-level vision via a hybrid neural network. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9908, pp. 560–576. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46493-0_34

    Chapter  Google Scholar 

  21. Maas, A.L., Hannun, A.Y., Ng, A.Y.: Rectifier nonlinearities improve neural network acoustic models. In: ICML Workshop on Deep Learning for Audio, Speech and Language Processing (2013)

    Google Scholar 

  22. Mantiuk, R., Krawczyk, G., Myszkowski, K., Seidel, H.P.: Perception-motivated high dynamic range video encoding. Special Issue ACM Transa. Graph. (SIGGRAPH) 23, 733–741 (2004)

    Article  Google Scholar 

  23. Mizuno, A., Ikebe, M.: Bit-depth expansion for noisy contour reduction in natural images. In: The IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1671–1675, March 2016

    Google Scholar 

  24. Park, M.H., Lee, J.W., Park, R.H., Kim, J.S.: False contour reduction using neural networks and adaptive bi-directional smoothing. IEEE Trans. Consum. Electron. 56(2), 870–878 (2010)

    Article  Google Scholar 

  25. Seetzen, H., et al.: High dynamic range display systems. Special Issue ACM Trans. Graph. (SIGGRAPH), 760–768 (2004)

    Article  Google Scholar 

  26. Ulichney, R.A., Cheung, S.: Pixel bit-depth increase by bit replication. In: The International Society for Optical Engineering (SPIE), vol. 3300, pp. 3300-1–3300-10, January 1998

    Google Scholar 

  27. Wan, P., Au, O.C., Tang, K., Guo, Y., Fang, L.: From 2D extrapolation to 1D interpolation: content adaptive image bit-depth expansion. In: The IEEE International Conference on Multimedia and Expo (ICME), pp. 170–175, July 2012

    Google Scholar 

  28. Wan, P., Cheung, G., Florencio, D., Zhang, C., Au, O.C.: Image bit-depth enhancement via maximum a posteriori estimation of AC signal. IEEE Trans. Image Process. 25(6), 2896–2909 (2016)

    Article  MathSciNet  Google Scholar 

  29. Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)

    Article  Google Scholar 

  30. Zhang, K., Zuo, W., Zhang, L.: FFDNET: toward a fast and flexible solution for cnn-based image denoising. IEEE Trans. Image Process. 27(9), 4608–4622 (2018)

    Article  MathSciNet  Google Scholar 

  31. Zhao, H., Gallo, O., Frosio, I., Kautz, J.: Loss functions for image restoration with neural networks. IEEE Trans. Comput. Imaging 3(1), 47–57 (2017)

    Article  Google Scholar 

Download references

Acknowledgement

This work was supported by MCST (Ministry of Culture, Sports & Tourism)/KOCCA (KoreaCreativeContentAgency) (R2016030044 - Development of Centext-Based Sports Video Analysis, Summarization, and Retrieval Technologies). The authors would like to thank Jing Liu [19] for releasing source codes for various BDE methods.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Changick Kim .

Editor information

Editors and Affiliations

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 62508 KB)

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Byun, J., Shim, K., Kim, C. (2019). BitNet: Learning-Based Bit-Depth Expansion. In: Jawahar, C., Li, H., Mori, G., Schindler, K. (eds) Computer Vision – ACCV 2018. ACCV 2018. Lecture Notes in Computer Science(), vol 11362. Springer, Cham. https://doi.org/10.1007/978-3-030-20890-5_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-20890-5_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-20889-9

  • Online ISBN: 978-3-030-20890-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics