Abstract
In the high-performance computing (HPC) domain, federated learning has gained immense popularity. Especially in emotional and physical health analytics and experimental facilities. Federated learning is one of the most promising distributed machine learning frameworks because it supports data privacy and security by not sharing the clients’ data but instead sharing their local models. In federated learning, many clients explicitly train their machine learning/deep learning models (local training) before aggregating them as a global model at the global server. However, the FL framework is difficult to build and deploy across multiple distributed clients due to its heterogeneous nature. We developed Docker-enabled federated learning (DFL) by utilizing client-agnostic technologies like Docker containers to simplify the deployment of FL frameworks for data stream processing on the heterogeneous client. In the DFL, the clients and global servers are written using TensorFlow and lightweight message queuing telemetry transport protocol to communicate between clients and global servers in the IoT environment. Furthermore, the DFL’s effectiveness, efficiency, and scalability are evaluated in the test case scenario where real-time emotion state classification is done from distributed multi-modal physiological data streams under various practical configurations.
Similar content being viewed by others
Availability of data and materials
Publically available DEAP dataset [38].
Code availability
Notes
DFL’s source code: https://github.com/officialarijit/Fed-ReMECS-Docker.
DockerHub: https://hub.docker.com/.
DEAP dataset link: https://www.eecs.qmul.ac.uk/mmv/datasets/deap/.
References
Kim J, Kim D, Lee J (2021) Design and implementation of kubernetes enabled federated learning platform. In: 2021 international conference on information and communication technology convergence (ICTC), pp. 410–412. https://doi.org/10.1109/ICTC52510.2021.9620986
Shivadekar S, Mangalagiri J, Nguyen P, Chapman D, Halem M, Gite R (2021) An intelligent parallel distributed streaming framework for near real-time science sensors and high-resolution medical images. In: 50th international conference on parallel processing workshop. ICPP Workshops ’21. Association for computing machinery, New York, NY, USA. https://doi.org/10.1145/3458744.3474039
Chen Z, Liao W, Hua K, Lu C, Yu W (2021) Towards asynchronous federated learning for heterogeneous edge-powered internet of things. Digital Commun Netw 7(3):317–326. https://doi.org/10.1016/j.dcan.2021.04.001
Abreha HG, Hayajneh M, Serhani MA (2022) Federated learning in edge computing: a systematic survey. Sensors. https://doi.org/10.3390/s22020450
Wan X, Guan X, Wang T, Bai G, Choi B-Y (2018) Application deployment using microservice and docker containers: Framework and optimization. J Netw Comput Appl 119:97–109. https://doi.org/10.1016/j.jnca.2018.07.003
McMahan B, Moore E, Ramage D, Hampson S, Arcas BAy (2017) Communication-efficient learning of deep networks from decentralized data. In: Singh A, Zhu J (eds.) Proceedings of the 20th international conference on artificial intelligence and statistics. Proceedings of machine learning research, vol 54, pp 1273–1282. https://proceedings.mlr.press/v54/mcmahan17a.html
Nandi A, Xhafa F (2022) A federated learning method for real-time emotion state classification from multi-modal streaming. Methods 204:340–347. https://doi.org/10.1016/j.ymeth.2022.03.005
Novakouski M, Lewis G (2021) Operating at the edge. Carnegie Mellon University’s Software Engineering Institute Blog. Accessed 2023 Jan 24 (2021). http://insights.sei.cmu.edu/blog/operating-at-the-edge/
Pitstick K, Ratzlaff J (2022) Containerization at the Edge. Carnegie Mellon University’s Software Engineering Institute Blog. Accessed 24 Jan 2023 (2022). http://insights.sei.cmu.edu/blog/containerization-at-the-edge/
Damián Segrelles Quilis J, López-Huguet S, Lozano P, Blanquer I (2023) A federated cloud architecture for processing of cancer images on a distributed storage. Futur Gen Comput Syst 139:38–52. https://doi.org/10.1016/j.future.2022.09.019
Zou Z, Xie Y, Huang K, Xu G, Feng D, Long D (2022) A docker container anomaly monitoring system based on optimized isolation forest. IEEE Trans Cloud Comput 10(1):134–145. https://doi.org/10.1109/TCC.2019.2935724
Zhuang W, Gan X, Wen Y, Zhang S (2022) Easyfl: a low-code federated learning platform for dummies. IEEE Internet Things J 9(15):13740–13754. https://doi.org/10.1109/JIOT.2022.3143842
Caldas S, Duddu SMK, Wu P, Li T, Konečnỳ J, McMahan HB, Smith V, Talwalkar A (2018) Leaf: a benchmark for federated settings. arXiv preprint arXiv:1812.01097
Tensorflow Federated. https://www.tensorflow.org/federated
Ryffel T, Trask A, Dahl M, Wagner B, Mancuso J, Rueckert D, Passerat-Palmbach J (2018) A generic framework for privacy preserving deep learning. arXiv preprint arXiv:1811.04017
FederatedAI: Federatedai/Fate: An Industrial Grade Federated Learning Framework. https://github.com/FederatedAI/FATE
Ma Y, Yu D, Wu T, Wang H (2019) Paddlepaddle: an open-source deep learning platform from industrial practice. Front Data Comput 1(1):105–115
Nokleby M, Raja H, Bajwa WU (2020) Scaling-up distributed processing of data streams for machine learning. Proc IEEE 108(11):1984–2012. https://doi.org/10.1109/JPROC.2020.3021381
Bifet A, Holmes G, Kirkby R, Pfahringer B (2010) Moa: Massive online analysis. J Mach Learn Res 11:1601–1604
Gomes HM, Bifet A, Read J, Barddal JP, Enembreck F, Pfharinger B, Holmes G, Abdessalem T (2017) Adaptive random forests for evolving data stream classification. Mach Learn 106:1469–1495
Kolter JZ, Maloof MA (2007) Dynamic weighted majority: an ensemble method for drifting concepts. J Mach Learn Res 8(91):2755–2790
Elwell R, Polikar R (2011) Incremental learning of concept drift in nonstationary environments. IEEE Trans Neural Netw 22(10):1517–1531. https://doi.org/10.1109/TNN.2011.2160459
Polikar R, Upda L, Upda SS, Honavar V (2001) Learn++: an incremental learning algorithm for supervised neural networks. IEEE Trans Syst Man Cybernet Part C (Appl Rev) 31(4):497–508. https://doi.org/10.1109/5326.983933
Gomes HM, Read J, Bifet A (2019) Streaming random patches for evolving data stream classification. In: 2019 IEEE international conference on data mining (ICDM), pp 240–249. https://doi.org/10.1109/ICDM.2019.00034
Haddadpour F, Kamani MM, Mokhtari A, Mahdavi M (2020) Federated learning with compression: unified analysis and sharp guarantees. arXiv preprint arXiv:2007.01154
He C, Li S, So J, Zhang M, Wang H, Wang X, Vepakomma P, Singh A, Qiu H, Shen L, Zhao P, Kang Y, Liu Y, Raskar R, Yang Q, Annavaram M, Avestimehr S (2020) Fedml: a research library and benchmark for federated machine learning. arXiv:2007.13518
Abdulrahman S, Tout H, Ould-Slimane H, Mourad A, Talhi C, Guizani M (2021) A survey on federated learning: the journey from centralized to distributed on-site learning and beyond. IEEE Internet Things J 8(7):5476–5497. https://doi.org/10.1109/JIOT.2020.3030072
Arafeh M, Otrok H, Ould-Slimane H, Mourad A, Talhi C, Damiani E (2023) Modularfed: leveraging modularity in federated learning frameworks. Internet of Things 22:100694. https://doi.org/10.1016/j.iot.2023.100694
Ismail BI, Mostajeran Goortani E, Ab Karim MB, Ming Tat W, Setapa S, Luke JY, Hong Hoe O (2015) Evaluation of docker as edge computing platform. In: 2015 IEEE conference on open systems (ICOS), pp 130–135. https://doi.org/10.1109/ICOS.2015.7377291
Anderson C (2015) Docker [software engineering]. IEEE Softw 32(3):102–3. https://doi.org/10.1109/MS.2015.62
Ismail BI, Jagadisan D, Khalid MF (2011) Determining overhead, variance & isolation metrics in virtualization for iaas cloud. In: Lin SC, Yen E (eds) Data driven e-Science. Springer, New York, NY, pp 315–330
Felter W, Ferreira A, Rajamony R, Rubio J (2015) An updated performance comparison of virtual machines and linux containers. In: 2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 171–172. https://doi.org/10.1109/ISPASS.2015.7095802
Nandi A, Xhafa F, Subirats L, Fort S (2021) Real-time multimodal emotion classification system in e-learning context. In: Proceedings of the 22nd engineering applications of neural networks conference, pp 423–435
Wan Z, Zhang Z, Yin R, Yu G (2022) Kfiml: Kubernetes-based fog computing iot platform for online machine learning. IEEE Internet Things J 9(19):19463–19476. https://doi.org/10.1109/JIOT.2022.3168085
Zhang Y, Jiang C, Yue B, Wan J, Guizani M (2022) Information fusion for edge intelligence: a survey. Inf Fusion 81:171–186
Zawad S, Yan F, Anwar A (2022) In: Ludwig, H., Baracaldo, N. (eds.) Introduction to federated learning systems, pp. 195–212. Springer, Cham. https://doi.org/10.1007/978-3-030-96896-0_9
Chahoud M, Otoum S, Mourad A (2023) On the feasibility of federated learning towards on-demand client deployment at the edge. Inf Process Manag 60(1):103150. https://doi.org/10.1016/j.ipm.2022.103150
Koelstra S, Muhl C, Soleymani M, Lee J-S, Yazdani A, Ebrahimi T, Pun T, Nijholt A, Patras I (2012) Deap: a database for emotion analysis;using physiological signals. IEEE Trans Affect Comput 3(1):18–31
Ayata D, Yaslan Y, Kamaşak M (2016) Emotion recognition via random forest and galvanic skin response: comparison of time based feature sets, window sizes and wavelet approaches. In: Medical technologies national congress, pp 1–4
Candra H, Yuwono M, Chai R, Handojoseno A, Elamvazuthi I, Nguyen HT, Su S (2015) Investigation of window size in classification of EEg-emotion signal with wavelet entropy and support vector machine. In: 37th annual international conference of the IEEE EMBS, pp 7250–7253
Nandi A, Jana ND, Das S (2020) Improving the performance of neural networks with an ensemble of activation functions. In: 2020 international joint conference on neural networks (IJCNN), pp 1–7. https://doi.org/10.1109/IJCNN48605.2020.9207277
Kang D-H, Kim D-H (2022) 1d convolutional autoencoder-based ppg and gsr signals for real-time emotion classification. IEEE Access 10:91332–91345. https://doi.org/10.1109/ACCESS.2022.3201342
Ayata D, Yaslan Y, Kamasak EM (2020) Emotion recognition from multimodal physiological signals for emotion aware healthcare systems. J Med Biol Eng 149–157
Ayata D, Yaslan Y, Kamasak ME (2018) Emotion based music recommendation system using wearable physiological sensors. IEEE Trans Consum Electron 64(2):196–203. https://doi.org/10.1109/TCE.2018.2844736
Hassan MM, Alam MGR, Uddin MZ, Huda S, Almogren A, Fortino G (2019) Human emotion recognition using deep belief network architecture. Inf Fusion 51:10–18. https://doi.org/10.1016/j.inffus.2018.10.009
Fu Z, Zhang B, He X, Li Y, Wang H, Huang J (2022) Emotion recognition based on multi-modal physiological signals and transfer learning. Front Neurosci. https://doi.org/10.3389/fnins.2022.1000716
Acknowledgements
Arijit Nandi is a fellow of Eurecat’s "Vicente López" PhD grant program. This study has been partially funded by ACCIÓ, Spain (Pla d’Actuació de Centres Tecnológics 2021) under the project TutorIA. We would like to thank the authors of DEAP dataset [38] for sharing with us.
Funding
Project TutorIA, ACCIÓ, Generalitat de Catalunya, Spain.
Author information
Authors and Affiliations
Contributions
All authors contributed to designing the model and the computational framework, implementation, analysis of the results and writing of the manuscript.
Corresponding authors
Ethics declarations
Conflict of interest
Not applicable.
Ethics approval
Not applicable.
Consent to participate
Not applicable.
Consent for publication
All authors have agreed on the publication.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Nandi, A., Xhafa, F. & Kumar, R. A Docker-based federated learning framework design and deployment for multi-modal data stream classification. Computing 105, 2195–2229 (2023). https://doi.org/10.1007/s00607-023-01179-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00607-023-01179-5
Keywords
- Federated learning
- High performance computing
- Multi-modal data streaming
- Docker-container
- Real-time emotion classification