Skip to main content

Advertisement

Log in

A Docker-based federated learning framework design and deployment for multi-modal data stream classification

  • Regular Paper
  • Published:
Computing Aims and scope Submit manuscript

Abstract

In the high-performance computing (HPC) domain, federated learning has gained immense popularity. Especially in emotional and physical health analytics and experimental facilities. Federated learning is one of the most promising distributed machine learning frameworks because it supports data privacy and security by not sharing the clients’ data but instead sharing their local models. In federated learning, many clients explicitly train their machine learning/deep learning models (local training) before aggregating them as a global model at the global server. However, the FL framework is difficult to build and deploy across multiple distributed clients due to its heterogeneous nature. We developed Docker-enabled federated learning (DFL) by utilizing client-agnostic technologies like Docker containers to simplify the deployment of FL frameworks for data stream processing on the heterogeneous client. In the DFL, the clients and global servers are written using TensorFlow and lightweight message queuing telemetry transport protocol to communicate between clients and global servers in the IoT environment. Furthermore, the DFL’s effectiveness, efficiency, and scalability are evaluated in the test case scenario where real-time emotion state classification is done from distributed multi-modal physiological data streams under various practical configurations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Availability of data and materials

Publically available DEAP dataset [38].

Code availability

https://github.com/officialarijit/Fed-ReMECS-Docker.

Notes

  1. DFL’s source code: https://github.com/officialarijit/Fed-ReMECS-Docker.

  2. DockerHub: https://hub.docker.com/.

  3. Fed-clients: https://hub.docker.com/repository/docker/arijitnandi/fedclient.

  4. Fed-server: https://hub.docker.com/repository/docker/arijitnandi/fedserver.

  5. https://www.tensorflow.org/.

  6. https://github.com/emqx/emqx.

  7. https://www.emqx.io/.

  8. DEAP dataset link: https://www.eecs.qmul.ac.uk/mmv/datasets/deap/.

References

  1. Kim J, Kim D, Lee J (2021) Design and implementation of kubernetes enabled federated learning platform. In: 2021 international conference on information and communication technology convergence (ICTC), pp. 410–412. https://doi.org/10.1109/ICTC52510.2021.9620986

  2. Shivadekar S, Mangalagiri J, Nguyen P, Chapman D, Halem M, Gite R (2021) An intelligent parallel distributed streaming framework for near real-time science sensors and high-resolution medical images. In: 50th international conference on parallel processing workshop. ICPP Workshops ’21. Association for computing machinery, New York, NY, USA. https://doi.org/10.1145/3458744.3474039

  3. Chen Z, Liao W, Hua K, Lu C, Yu W (2021) Towards asynchronous federated learning for heterogeneous edge-powered internet of things. Digital Commun Netw 7(3):317–326. https://doi.org/10.1016/j.dcan.2021.04.001

    Article  Google Scholar 

  4. Abreha HG, Hayajneh M, Serhani MA (2022) Federated learning in edge computing: a systematic survey. Sensors. https://doi.org/10.3390/s22020450

    Article  Google Scholar 

  5. Wan X, Guan X, Wang T, Bai G, Choi B-Y (2018) Application deployment using microservice and docker containers: Framework and optimization. J Netw Comput Appl 119:97–109. https://doi.org/10.1016/j.jnca.2018.07.003

    Article  Google Scholar 

  6. McMahan B, Moore E, Ramage D, Hampson S, Arcas BAy (2017) Communication-efficient learning of deep networks from decentralized data. In: Singh A, Zhu J (eds.) Proceedings of the 20th international conference on artificial intelligence and statistics. Proceedings of machine learning research, vol 54, pp 1273–1282. https://proceedings.mlr.press/v54/mcmahan17a.html

  7. Nandi A, Xhafa F (2022) A federated learning method for real-time emotion state classification from multi-modal streaming. Methods 204:340–347. https://doi.org/10.1016/j.ymeth.2022.03.005

    Article  Google Scholar 

  8. Novakouski M, Lewis G (2021) Operating at the edge. Carnegie Mellon University’s Software Engineering Institute Blog. Accessed 2023 Jan 24 (2021). http://insights.sei.cmu.edu/blog/operating-at-the-edge/

  9. Pitstick K, Ratzlaff J (2022) Containerization at the Edge. Carnegie Mellon University’s Software Engineering Institute Blog. Accessed 24 Jan 2023 (2022). http://insights.sei.cmu.edu/blog/containerization-at-the-edge/

  10. Damián Segrelles Quilis J, López-Huguet S, Lozano P, Blanquer I (2023) A federated cloud architecture for processing of cancer images on a distributed storage. Futur Gen Comput Syst 139:38–52. https://doi.org/10.1016/j.future.2022.09.019

    Article  Google Scholar 

  11. Zou Z, Xie Y, Huang K, Xu G, Feng D, Long D (2022) A docker container anomaly monitoring system based on optimized isolation forest. IEEE Trans Cloud Comput 10(1):134–145. https://doi.org/10.1109/TCC.2019.2935724

    Article  Google Scholar 

  12. Zhuang W, Gan X, Wen Y, Zhang S (2022) Easyfl: a low-code federated learning platform for dummies. IEEE Internet Things J 9(15):13740–13754. https://doi.org/10.1109/JIOT.2022.3143842

    Article  Google Scholar 

  13. Caldas S, Duddu SMK, Wu P, Li T, Konečnỳ J, McMahan HB, Smith V, Talwalkar A (2018) Leaf: a benchmark for federated settings. arXiv preprint arXiv:1812.01097

  14. Tensorflow Federated. https://www.tensorflow.org/federated

  15. Ryffel T, Trask A, Dahl M, Wagner B, Mancuso J, Rueckert D, Passerat-Palmbach J (2018) A generic framework for privacy preserving deep learning. arXiv preprint arXiv:1811.04017

  16. FederatedAI: Federatedai/Fate: An Industrial Grade Federated Learning Framework. https://github.com/FederatedAI/FATE

  17. Ma Y, Yu D, Wu T, Wang H (2019) Paddlepaddle: an open-source deep learning platform from industrial practice. Front Data Comput 1(1):105–115

    Google Scholar 

  18. Nokleby M, Raja H, Bajwa WU (2020) Scaling-up distributed processing of data streams for machine learning. Proc IEEE 108(11):1984–2012. https://doi.org/10.1109/JPROC.2020.3021381

    Article  Google Scholar 

  19. Bifet A, Holmes G, Kirkby R, Pfahringer B (2010) Moa: Massive online analysis. J Mach Learn Res 11:1601–1604

    Google Scholar 

  20. Gomes HM, Bifet A, Read J, Barddal JP, Enembreck F, Pfharinger B, Holmes G, Abdessalem T (2017) Adaptive random forests for evolving data stream classification. Mach Learn 106:1469–1495

    Article  MathSciNet  Google Scholar 

  21. Kolter JZ, Maloof MA (2007) Dynamic weighted majority: an ensemble method for drifting concepts. J Mach Learn Res 8(91):2755–2790

    MATH  Google Scholar 

  22. Elwell R, Polikar R (2011) Incremental learning of concept drift in nonstationary environments. IEEE Trans Neural Netw 22(10):1517–1531. https://doi.org/10.1109/TNN.2011.2160459

    Article  Google Scholar 

  23. Polikar R, Upda L, Upda SS, Honavar V (2001) Learn++: an incremental learning algorithm for supervised neural networks. IEEE Trans Syst Man Cybernet Part C (Appl Rev) 31(4):497–508. https://doi.org/10.1109/5326.983933

    Article  Google Scholar 

  24. Gomes HM, Read J, Bifet A (2019) Streaming random patches for evolving data stream classification. In: 2019 IEEE international conference on data mining (ICDM), pp 240–249. https://doi.org/10.1109/ICDM.2019.00034

  25. Haddadpour F, Kamani MM, Mokhtari A, Mahdavi M (2020) Federated learning with compression: unified analysis and sharp guarantees. arXiv preprint arXiv:2007.01154

  26. He C, Li S, So J, Zhang M, Wang H, Wang X, Vepakomma P, Singh A, Qiu H, Shen L, Zhao P, Kang Y, Liu Y, Raskar R, Yang Q, Annavaram M, Avestimehr S (2020) Fedml: a research library and benchmark for federated machine learning. arXiv:2007.13518

  27. Abdulrahman S, Tout H, Ould-Slimane H, Mourad A, Talhi C, Guizani M (2021) A survey on federated learning: the journey from centralized to distributed on-site learning and beyond. IEEE Internet Things J 8(7):5476–5497. https://doi.org/10.1109/JIOT.2020.3030072

    Article  Google Scholar 

  28. Arafeh M, Otrok H, Ould-Slimane H, Mourad A, Talhi C, Damiani E (2023) Modularfed: leveraging modularity in federated learning frameworks. Internet of Things 22:100694. https://doi.org/10.1016/j.iot.2023.100694

    Article  Google Scholar 

  29. Ismail BI, Mostajeran Goortani E, Ab Karim MB, Ming Tat W, Setapa S, Luke JY, Hong Hoe O (2015) Evaluation of docker as edge computing platform. In: 2015 IEEE conference on open systems (ICOS), pp 130–135. https://doi.org/10.1109/ICOS.2015.7377291

  30. Anderson C (2015) Docker [software engineering]. IEEE Softw 32(3):102–3. https://doi.org/10.1109/MS.2015.62

    Article  Google Scholar 

  31. Ismail BI, Jagadisan D, Khalid MF (2011) Determining overhead, variance & isolation metrics in virtualization for iaas cloud. In: Lin SC, Yen E (eds) Data driven e-Science. Springer, New York, NY, pp 315–330

    Chapter  Google Scholar 

  32. Felter W, Ferreira A, Rajamony R, Rubio J (2015) An updated performance comparison of virtual machines and linux containers. In: 2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 171–172. https://doi.org/10.1109/ISPASS.2015.7095802

  33. Nandi A, Xhafa F, Subirats L, Fort S (2021) Real-time multimodal emotion classification system in e-learning context. In: Proceedings of the 22nd engineering applications of neural networks conference, pp 423–435

  34. Wan Z, Zhang Z, Yin R, Yu G (2022) Kfiml: Kubernetes-based fog computing iot platform for online machine learning. IEEE Internet Things J 9(19):19463–19476. https://doi.org/10.1109/JIOT.2022.3168085

    Article  Google Scholar 

  35. Zhang Y, Jiang C, Yue B, Wan J, Guizani M (2022) Information fusion for edge intelligence: a survey. Inf Fusion 81:171–186

    Article  Google Scholar 

  36. Zawad S, Yan F, Anwar A (2022) In: Ludwig, H., Baracaldo, N. (eds.) Introduction to federated learning systems, pp. 195–212. Springer, Cham. https://doi.org/10.1007/978-3-030-96896-0_9

  37. Chahoud M, Otoum S, Mourad A (2023) On the feasibility of federated learning towards on-demand client deployment at the edge. Inf Process Manag 60(1):103150. https://doi.org/10.1016/j.ipm.2022.103150

    Article  Google Scholar 

  38. Koelstra S, Muhl C, Soleymani M, Lee J-S, Yazdani A, Ebrahimi T, Pun T, Nijholt A, Patras I (2012) Deap: a database for emotion analysis;using physiological signals. IEEE Trans Affect Comput 3(1):18–31

    Article  Google Scholar 

  39. Ayata D, Yaslan Y, Kamaşak M (2016) Emotion recognition via random forest and galvanic skin response: comparison of time based feature sets, window sizes and wavelet approaches. In: Medical technologies national congress, pp 1–4

  40. Candra H, Yuwono M, Chai R, Handojoseno A, Elamvazuthi I, Nguyen HT, Su S (2015) Investigation of window size in classification of EEg-emotion signal with wavelet entropy and support vector machine. In: 37th annual international conference of the IEEE EMBS, pp 7250–7253

  41. Nandi A, Jana ND, Das S (2020) Improving the performance of neural networks with an ensemble of activation functions. In: 2020 international joint conference on neural networks (IJCNN), pp 1–7. https://doi.org/10.1109/IJCNN48605.2020.9207277

  42. Kang D-H, Kim D-H (2022) 1d convolutional autoencoder-based ppg and gsr signals for real-time emotion classification. IEEE Access 10:91332–91345. https://doi.org/10.1109/ACCESS.2022.3201342

    Article  Google Scholar 

  43. Ayata D, Yaslan Y, Kamasak EM (2020) Emotion recognition from multimodal physiological signals for emotion aware healthcare systems. J Med Biol Eng 149–157

  44. Ayata D, Yaslan Y, Kamasak ME (2018) Emotion based music recommendation system using wearable physiological sensors. IEEE Trans Consum Electron 64(2):196–203. https://doi.org/10.1109/TCE.2018.2844736

    Article  Google Scholar 

  45. Hassan MM, Alam MGR, Uddin MZ, Huda S, Almogren A, Fortino G (2019) Human emotion recognition using deep belief network architecture. Inf Fusion 51:10–18. https://doi.org/10.1016/j.inffus.2018.10.009

    Article  Google Scholar 

  46. Fu Z, Zhang B, He X, Li Y, Wang H, Huang J (2022) Emotion recognition based on multi-modal physiological signals and transfer learning. Front Neurosci. https://doi.org/10.3389/fnins.2022.1000716

    Article  Google Scholar 

Download references

Acknowledgements

Arijit Nandi is a fellow of Eurecat’s "Vicente López" PhD grant program. This study has been partially funded by ACCIÓ, Spain (Pla d’Actuació de Centres Tecnológics 2021) under the project TutorIA. We would like to thank the authors of DEAP dataset [38] for sharing with us.

Funding

Project TutorIA, ACCIÓ, Generalitat de Catalunya, Spain.

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to designing the model and the computational framework, implementation, analysis of the results and writing of the manuscript.

Corresponding authors

Correspondence to Arijit Nandi or Fatos Xhafa.

Ethics declarations

Conflict of interest

Not applicable.

Ethics approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

All authors have agreed on the publication.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nandi, A., Xhafa, F. & Kumar, R. A Docker-based federated learning framework design and deployment for multi-modal data stream classification. Computing 105, 2195–2229 (2023). https://doi.org/10.1007/s00607-023-01179-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00607-023-01179-5

Keywords

Mathematics Subject Classification

Navigation