Is Tiny Deep Learning the New Deep Learning?

Roveri, Manuel

doi:10.1007/978-981-19-3391-2_2

Manuel Roveri ORCID: orcid.org/0000-0001-7828-7687⁶

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 142))

536 Accesses
4 Citations

Abstract

The computing everywhere paradigm is paving the way for the pervasive diffusion of tiny devices (such as Internet-of-Things or edge computing devices) endowed with intelligent abilities. Achieving this goal requires machine and deep learning solutions to be completely redesigned to fit the severe technological constraints on computation, memory, and power consumption typically characterizing these tiny devices. The aim of this paper is to explore tiny machine learning (TinyML) and introduce tiny deep learning (TinyDL) for the design, development, and deployment of machine and deep learning solutions for (an ecosystem of) tiny devices, hence supporting intelligent and pervasive applications following the computing everywhere paradigm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Deep and Wide Tiny Machine Learning

TinyML: Tools, applications, challenges, and future research directions

Article 09 September 2023

Recent Trends in Deep Learning with Applications

Notes

1.
This issue will be addressed in Sect. 5 by introducing on-device learning mechanisms for ML and DL.
2.
What is described here can easily be extended to other families of DL solutions.
3.
Here, 242 Kb is the memory required to store all of the weights of all of the processing layers, whereas 37 Kb is the memory required to store the intermediate processing results.

References

Alippi C, Disabato S, Roveri M (2018) April) Moving convolutional neural networks to embedded systems: the AlexNet and VGG-16 Case. In: 17th ACM/IEEE International conference on information processing in sensor networks (IPSN). IEEE, Porto, pp 212–223
Google Scholar
Alippi C, Roveri M (2017) The (Not) far-away path to smart cyber-physical systems: an information-centric framework. Computer 50(4):38–47
Article Google Scholar
Cai H, Gan C, Zhu L, Han S (2020) Tiny transfer learning: towards memory-efficient on-device learning
Google Scholar
Canonaco G, Bergamasco A, Mongelluzzo A, Roveri M (2021) Adaptive federated learning in presence of concept drift. In: 2021 International joint conference on neural networks (IJCNN). IEEE, New York, pp 1–7
Google Scholar
Disabato S, Roveri M (2018) Reducing the computation load of convolutional neural networks through gate classification. In: 2018 International joint conference on neural networks (IJCNN). IEEE, New York, pp 1–8
Google Scholar
Disabato S, Roveri M (2020) Incremental on-device tiny machine learning. In: Proceedings of the 2nd International workshop on challenges in artificial intelligence and machine learning for internet of things, pp 7–13
Google Scholar
Disabato S, Roveri M, Alippi C (2021) Distributed deep convolutional neural networks for the internet-of-things. IEEE Trans Comput
Google Scholar
Ditzler G, Roveri M, Alippi C, Polikar R (2015) Learning in nonstationary environments: a survey. IEEE Comput Intell Maga 10(4):12–25
Article Google Scholar
Falbo V, Apicella T, Aurioso D, Danese L, Bellotti F, Berta R, Gloria AD (2019) Analyzing machine learning on mainstream microcontrollers. In: International conference on applications in electronics pervading industry, environment and society. Springer, Berlin, pp 103–108
Google Scholar
Frankle J, Carbin M (2018) The lottery ticket hypothesis: finding sparse, trainable neural networks. arXiv preprint arXiv:1803.03635
Fukushima K (1980) Neocognitron: a self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol Cybern 36:193–202
Article Google Scholar
Gholami A, Kim S, Dong Z, Yao Z, Mahoney MW, Keutzer K (2021) A survey of quantization methods for efficient neural network inference. arXiv preprint arXiv:2103.13630
Higginbotham S (2019) Machine learning on the edge-[internet of everything]. IEEE Spectrum 57(1):20
Article Google Scholar
Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017) Mobilenets: efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861
Hu D, Krishnamachari B (2020) Fast and accurate streaming CNN inference via communication compression on the edge. In: 2020 IEEE/ACM fifth international conference on internet-of-things design and implementation (IoTDI). IEEE, New York, pp 157–163
Google Scholar
Iandola FN, Han S, Moskewicz MW, Ashraf K, Dally WJ, Keutzer K (2016) Squeezenet: Alexnet-level accuracy with 50\(\times \) fewer parameters and <0.5 mb model size. arXiv preprint arXiv:1602.07360
Ivakhnenko AG (1971) Polynomial theory of complex systems. IEEE Trans Syst Man Cybern 4:364–378
Article MathSciNet Google Scholar
Kephart JO, Chess DM (2003) The vision of autonomic computing. Computer 36(1):41–50
Article MathSciNet Google Scholar
Khan LU, Saad W, Han Z, Hossain E, Hong CS (2021) Federated learning for internet of things: recent advances, taxonomy, and open challenges. IEEE Commun Surv Tutorials
Google Scholar
Kolda TG, Bader BW (2009) Tensor decompositions and applications. SIAM Review 51(3):455–500
Article MathSciNet Google Scholar
Konečnỳ J, McMahan B, Ramage D (2015) Federated optimization: distributed optimization beyond the datacenter. arXiv preprint arXiv:1511.03575
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25
Google Scholar
Lai L, Suda N, Chandra V (2018) CMSIS-NN: efficient neural network kernels for ARM Cortex-M CPUs. arXiv preprint arXiv:1801.06601
LeCun Y, Boser B, Denker JS, Henderson D, Howard RE, Hubbard W, Jackel LD (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551
Article Google Scholar
LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Article Google Scholar
Liang T, Glossner J, Wang L, Shi S, Zhang X (2021) Pruning and quantization for deep neural network acceleration: a survey. Neurocomputing 461:370–403
Article Google Scholar
Liu J, Tripathi S, Kurup U, Shah M (2020) Pruning algorithms to accelerate convolutional neural networks for edge applications: a survey. arXiv preprint arXiv:2005.04275
Liu W, Wang Z, Liu X, Zeng N, Liu Y, Alsaadi FE (2017) A survey of deep neural network architectures and their applications. Neurocomputing 234:11–26
Article Google Scholar
McMahan B, Moore E, Ramage D, Hampson S, Arcas BA (2017) Communication-efficient learning of deep networks from decentralized data. In: Artificial intelligence and statistics. PMLR, pp 1273–1282
Google Scholar
Nagel M, Fournarakis M, Amjad RA, Bondarenko Y, van Baalen M, Blankevoort T (2021) A white paper on neural network quantization. arXiv preprint arXiv:2106.08295
Pouyanfar S, Sadiq S, Yan Y, Tian H, Tao Y, Reyes MP, Shyu ML, Chen SC, Iyengar SS (2018) A survey on deep learning: algorithms, techniques, and applications. ACM Comput Surveys (CSUR) 51(5):1–36
Article Google Scholar
Ren H, Anicic D, Runkler TA (2021) Tinyol: Tinyml with online-learning on microcontrollers. In: 2021 International joint conference on neural networks (IJCNN). IEEE, New York, pp 1–8
Google Scholar
Reuther A, Michaleas P, Jones M, Gadepally V, Samsi S, Kepner J (2019) Survey and benchmarking of machine learning accelerators. In: 2019 IEEE high performance extreme computing conference (HPEC). IEEE, New York, pp 1–9
Google Scholar
Sanchez-Iborra R, Skarmeta AF (2020) TinyMLL-enabled frugal smart objects: challenges and opportunities. IEEE Circuits Syst Maga 20(3):4–18
Article Google Scholar
Scardapane S, Scarpiniti M, Baccarelli E, Uncini A (2020) Why should we add early exits to neural networks? Cogn Comput 12(5):954–966
Article Google Scholar
Shakarami A, Ghobaei-Arani M, Shahidinejad A (2020) A survey on the computation offloading approaches in mobile edge computing: a machine learning-based perspective. Comput Networks 182:107496
Article Google Scholar
STMicroelectronics: X-cube-ai (2021) https://www.st.com/en/embedded-software/x-cube-ai.html
Sze V, Chen YH, Yang TJ, Emer JS (2017) Efficient processing of deep neural networks: a tutorial and survey. Proc IEEE 105(12):2295–2329
Article Google Scholar
Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2016) Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2818–2826
Google Scholar
Teerapittayanon S, McDanel B, Kung HT (2017) Distributed deep neural networks over the cloud, the edge and end devices. In: 2017 IEEE 37th international conference on distributed computing systems (ICDCS). IEEE, New York, pp 328–339
Google Scholar
Verbraeken J, Wolting M, Katzy J, Kloppenburg J, Verbelen T, Rellermeyer JS (2020) A survey on distributed machine learning. ACM Comput Surveys (CSUR) 53(2):1–33
Article Google Scholar
Warden P, Situnayake D (2019) TinyML. O’Reilly Media (Incorporated)
Google Scholar
Xu R, Wunsch D (2005) Survey of clustering algorithms. IEEE Trans Neural Networks 16(3):645–678
Article Google Scholar
Yang Q, Liu Y, Chen T, Tong Y (2019) Federated machine learning: concept and applications. ACM Trans Intell Syst Technol (TIST) 10(2):1–19
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Elettronica, Informazione e Bioingegneria Politecnico di Milano, Milan, Italy
Manuel Roveri

Authors

Manuel Roveri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Manuel Roveri .

Editor information

Editors and Affiliations

Director of the Cloud Computing and Distributed Systems (CLOUDS) Laboratory, University of Melbourne, Melbourne, VIC, Australia
Rajkumar Buyya
Computer Science School (FI), Technical University of Madrid, Madrid, Spain
Susanna Munoz Hernandez
Department of Information Technology, Vasavi College of Engineering, Hyderabad, India
Ram Mohan Rao Kovvur
Department of Information Technology, Vasavi College of Engineering, Hyderabad, India
T. Hitendra Sarma

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Roveri, M. (2023). Is Tiny Deep Learning the New Deep Learning?. In: Buyya, R., Hernandez, S.M., Kovvur, R.M.R., Sarma, T.H. (eds) Computational Intelligence and Data Analytics. Lecture Notes on Data Engineering and Communications Technologies, vol 142. Springer, Singapore. https://doi.org/10.1007/978-981-19-3391-2_2

Download citation

DOI: https://doi.org/10.1007/978-981-19-3391-2_2
Published: 02 September 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-3390-5
Online ISBN: 978-981-19-3391-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Is Tiny Deep Learning the New Deep Learning?

Abstract

Access this chapter

Similar content being viewed by others

Deep and Wide Tiny Machine Learning

TinyML: Tools, applications, challenges, and future research directions

Recent Trends in Deep Learning with Applications

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Is Tiny Deep Learning the New Deep Learning?

Abstract

Access this chapter

Similar content being viewed by others

Deep and Wide Tiny Machine Learning

TinyML: Tools, applications, challenges, and future research directions

Recent Trends in Deep Learning with Applications

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation