skip to main content
10.1145/3394171.3413500acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

BS-MCVR: Binary-sensing based Mobile-cloud Visual Recognition

Authors Info & Claims
Published:12 October 2020Publication History

ABSTRACT

The mobile-cloud based visual recognition (MCVR) system, in which the low-end mobile sensors are deployed to persistently collect and transmit visual data to the cloud for analysis and recognition, is important for visual monitoring applications such as wildfire detection, wildlife monitoring, etc. However, the current MCVR systems are mostly human-perception-oriented, which consume many computational resources and much energy for data sensing as well as much bandwidth for data transmission, limiting their large-scale deployment. In this work, we present a machine-perception-oriented MCVR system, called BS-MCVR, where the mobile end is designed to efficiently sense highly compact and discriminative features directly from the scene, and the sensed features are analyzed on the cloud for recognition. Particularly, the mobile end is designed to operate with completely binary operations and generate fixed-point feature maps. Experiments on benchmark datasets show that our system only needs to transmit 1/200 the amount of original image data without degrading much the recognition accuracy, while it consumes minimal computational cost in the data sensing process. BS-MCVR provides a highly cost-effective solution for deploying MCVR systems at a large-scale.

Skip Supplemental Material Section

Supplemental Material

References

  1. Bryce E Bayer. 1976. Color Imaging Array. (July 1976).Google ScholarGoogle Scholar
  2. Yoshua Bengio, Nicholas Léonard, and Aaron Courville. 2013. Estimating or Propagating Gradients through Stochastic Neurons for Conditional Computation. arXiv preprint arXiv:1308.3432 (2013).Google ScholarGoogle Scholar
  3. Yifan Bo and Haiyan Wang. 2011. The Application of Cloud Computing and the Internet of Things in Agriculture and Forestry. In 2011 International Joint Conference on Service Sciences. IEEE, 168--172.Google ScholarGoogle Scholar
  4. Gary Bradski. 2000. The Opencv Library. Dr Dobb's J. Software Tools, Vol. 25 (2000), 120--125.Google ScholarGoogle Scholar
  5. Mark Buckler, Suren Jayasuriya, and Adrian Sampson. 2017. Reconfiguring the Imaging Pipeline for Computer Vision. In Proceedings of the IEEE International Conference on Computer Vision. 975--984.Google ScholarGoogle ScholarCross RefCross Ref
  6. Lahiru D Chamain, Sen-ching Samson Cheung, and Zhi Ding. 2019. Quannet: Joint Image Compression and Classification over Channels with Limited Bandwidth. In 2019 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 338--343.Google ScholarGoogle ScholarCross RefCross Ref
  7. Xiaozhi Chen, Kaustav Kundu, Ziyu Zhang, Huimin Ma, Sanja Fidler, and Raquel Urtasun. 2016. Monocular 3d Object Detection for Autonomous Driving. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2147--2156.Google ScholarGoogle ScholarCross RefCross Ref
  8. Matthieu Courbariaux, Yoshua Bengio, and Jean-Pierre David. 2015. Binaryconnect: Training Deep Neural Networks with Binary Weights during Propagations. In Advances in Neural Information Processing Systems. 3123--3131.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A Large-Scale Hierarchical Image Database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition. Ieee, 248--255.Google ScholarGoogle Scholar
  10. Zidong Du, Robert Fasthuber, Tianshi Chen, Paolo Ienne, Ling Li, Tao Luo, Xiaobing Feng, Yunji Chen, and Olivier Temam. 2015. ShiDianNao: Shifting Vision Processing Closer to the Sensor. In ACM SIGARCH Computer Architecture News, Vol. 43. ACM, 92--104.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Andy Rosales Elias, Nevena Golubovic, Chandra Krintz, and Rich Wolski. 2017. Where's the Bear?-Automating Wildlife Image Processing Using Iot and Edge Cloud Systems. In 2017 IEEE /ACM Second International Conference on Internet-of-Things Design and Implementation (IoTDI). IEEE, 247--258.Google ScholarGoogle Scholar
  12. Giaime Ginesu, Maurizio Pintus, and Daniele D Giusto. 2012. Objective Assessment of the WebP Image Coding Algorithm. Signal Processing: Image Communication, Vol. 27, 8 (2012), 867--874.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Mehdi Habibzadeh, Mahboobeh Jannesari, Zahra Rezaei, Hossein Baharvand, and Mehdi Totonchi. 2018. Automatic White Blood Cell Classification Using Pre-Trained Deep Learning Models: ResNet and Inception. In Tenth International Conference on Machine Vision (ICMV 2017), Vol. 10696. International Society for Optics and Photonics, 1069612.Google ScholarGoogle ScholarCross RefCross Ref
  14. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 770--778.Google ScholarGoogle ScholarCross RefCross Ref
  15. Mitsuyoshi Hori, Eiji Kawashima, and Tomihiro Yamazaki. 2010. Application of Cloud Computing to Agriculture and Prospects in Other Fields. Fujitsu Sci. Tech. J, Vol. 46, 4 (2010), 446--454.Google ScholarGoogle Scholar
  16. David A Huffman. 1952. A Method for the Construction of Minimum-Redundancy Codes. Proceedings of the IRE, Vol. 40, 9 (1952), 1098--1101.Google ScholarGoogle ScholarCross RefCross Ref
  17. Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, and Ondrej Chum. 2019. Label Propagation for Deep Semi-Supervised Learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5070--5079.Google ScholarGoogle ScholarCross RefCross Ref
  18. Raghuraman Krishnamoorthi. 2018. Quantizing Deep Convolutional Networks for Efficient Inference: A Whitepaper. arXiv preprint arXiv:1806.08342 (2018).Google ScholarGoogle Scholar
  19. Alex Krizhevsky, Geoffrey Hinton, et almbox. 2009. Learning Multiple Layers of Features from Tiny Images. Technical Report. Citeseer.Google ScholarGoogle Scholar
  20. Ki Bum Lee, Sejune Cheon, and Chang Ouk Kim. 2017. A Convolutional Neural Network for Fault Classification and Diagnosis in Semiconductor Manufacturing Processes. IEEE Transactions on Semiconductor Manufacturing, Vol. 30, 2 (2017), 135--142.Google ScholarGoogle ScholarCross RefCross Ref
  21. F Li, Y Ma, X Zhang, XW Yu, PF Feng, and MB Zhang. 2015. Research and Design of a Forest Management Mobile Service Cloud Platform for the Natural Forest Protection Project. In Future Communication Technology and Engineering: Proceedings of the 2014 International Conference on Future Communication Technology and Engineering (FCTE 2014), Shenzhen, China, 16-17 November 2014. CRC Press, 139.Google ScholarGoogle ScholarCross RefCross Ref
  22. Zihao Liu, Tao Liu, Wujie Wen, Lei Jiang, Jie Xu, Yanzhi Wang, and Gang Quan. 2018. DeepN -JPEG: A Deep Neural Network Favorable JPEG -Based Image Compression Framework. In Proceedings of the 55th Annual Design Automation Conference. ACM, 18.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Ekdeep Singh Lubana, Robert P Dick, Vinayak Aggarwal, and Pyari Mohan Pradhan. 2019. Minimalistic Image Signal Processing for Deep Learning Applications. In 2019 IEEE International Conference on Image Processing (ICIP). IEEE, 4165--4169.Google ScholarGoogle Scholar
  24. Bradley McDanel, Surat Teerapittayanon, and HT Kung. 2017. Embedded binarized neural networks. arXiv preprint arXiv:1709.02260 (2017).Google ScholarGoogle Scholar
  25. Fabian Mentzer, Eirikur Agustsson, Michael Tschannen, Radu Timofte, and Luc Van Gool. 2018. Conditional Probability Models for Deep Image Compression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4394--4402.Google ScholarGoogle ScholarCross RefCross Ref
  26. Jeffrey C Mogul, Fred Douglis, Anja Feldmann, and Balachander Krishnamurthy. 1997. Potential Benefits of Delta Encoding and Data Compression for HTTP. In ACM SIGCOMM Computer Communication Review, Vol. 27. ACM, 181--194.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Khan Muhammad, Jamil Ahmad, Zhihan Lv, Paolo Bellavista, Po Yang, and Sung Wook Baik. 2018. Efficient Deep CNN -Based Fire Detection and Localization in Video Surveillance Applications. IEEE Transactions on Systems, Man, and Cybernetics: Systems, Vol. 49, 7 (2018), 1419--1434.Google ScholarGoogle ScholarCross RefCross Ref
  28. Guillaume Obozinski, Ben Taskar, and Michael Jordan. 2006. Multi-Task Feature Selection. Statistics Department, UC Berkeley, Tech. Rep, Vol. 2, 2.2 (2006).Google ScholarGoogle Scholar
  29. Mohammad Rastegari, Vicente Ordonez, Joseph Redmon, and Ali Farhadi. 2016. Xnor-Net: Imagenet Classification Using Binary Convolutional Neural Networks. In European Conference on Computer Vision. Springer, 525--542.Google ScholarGoogle Scholar
  30. Crefeda Faviola Rodrigues, Graham Riley, and Mikel Luján. 2017. Fine-Grained Energy Profiling for Deep Convolutional Neural Networks on the Jetson TX1. In 2017 IEEE International Symposium on Workload Characterization (IISWC). IEEE, 114--115.Google ScholarGoogle ScholarCross RefCross Ref
  31. Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-Net: Convolutional Networks for Biomedical Image Segmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 234--241.Google ScholarGoogle Scholar
  32. Anush Sankaran, Mayank Vatsa, Richa Singh, and Angshul Majumdar. 2017. Group Sparse Autoencoder. Image and Vision Computing, Vol. 60 (2017), 64--74.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Surat Teerapittayanon, Bradley McDanel, and Hsiang-Tsung Kung. 2017. Distributed deep neural networks over the cloud, the edge and end devices. In 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS). IEEE, 328--339.Google ScholarGoogle ScholarCross RefCross Ref
  34. Oriol Vinyals, Charles Blundell, Timothy Lillicrap, Daan Wierstra, et almbox. 2016. Matching Networks for One Shot Learning. In Advances in Neural Information Processing Systems. 3630--3638.Google ScholarGoogle Scholar
  35. Gregory K Wallace. 1992. The JPEG Still Picture Compression Standard. IEEE transactions on consumer electronics, Vol. 38, 1 (1992), xviii--xxxiv.Google ScholarGoogle Scholar
  36. Shengke Wang, Qinghong Dong, Lianghua Duan, Yujuan Sun, Muwei Jian, Jianzhong Li, and Junyu Dong. 2019. A Fast Internal Wave Detection Method Based on PCANet for Ocean Monitoring. Journal of Intelligent Systems, Vol. 28, 1 (2019), 103--113.Google ScholarGoogle ScholarCross RefCross Ref
  37. Felix Weber and Reinhard Schütte. 2019. A Domain-Oriented Analysis of the Impact of Machine Learning--the Case of Retailing. Big Data and Cognitive Computing, Vol. 3, 1 (2019), 11.Google ScholarGoogle ScholarCross RefCross Ref
  38. Chyuan-Tyng Wu, Leo F Isikdogan, Sushma Rao, Bhavin Nayak, Timo Gerasimow, Aleksandar Sutic, Liron Ain-kedem, and Gilad Michael. 2019 a. VisionISP: Repurposing the Image Signal Processor for Computer Vision Applications. In 2019 IEEE International Conference on Image Processing (ICIP). IEEE, 4624--4628.Google ScholarGoogle Scholar
  39. Zifeng Wu, Chunhua Shen, and Anton Van Den Hengel. 2019 b. Wider or Deeper: Revisiting the Resnet Model for Visual Recognition. Pattern Recognition, Vol. 90 (2019), 119--133.Google ScholarGoogle ScholarCross RefCross Ref
  40. Xiaoyuan Yu, Jiangping Wang, Roland Kays, Patrick A Jansen, Tianjiang Wang, and Thomas Huang. 2013. Automated Identification of Animal Species in Camera Trap Images. EURASIP Journal on Image and Video Processing, Vol. 2013, 1 (2013), 52.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. BS-MCVR: Binary-sensing based Mobile-cloud Visual Recognition

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        MM '20: Proceedings of the 28th ACM International Conference on Multimedia
        October 2020
        4889 pages
        ISBN:9781450379885
        DOI:10.1145/3394171

        Copyright © 2020 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 12 October 2020

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate995of4,171submissions,24%

        Upcoming Conference

        MM '24
        MM '24: The 32nd ACM International Conference on Multimedia
        October 28 - November 1, 2024
        Melbourne , VIC , Australia

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader