Computer Science and Information Systems 2022 Volume 19, Issue 1, Pages: 47-63
https://doi.org/10.2298/CSIS201221043N
Full text ( 938 KB)
Cited by
Comparative analysis of HAR datasets using classification algorithms
Nayak Suvra (Department of Computer Science, Rama Devi Women’s University, Bhubaneswar, India), suvra.nayak24@gmail.com
Panigrahi Chhabi Rani (Department of Computer Science, Rama Devi Women’s University, Bhubaneswar, India), panigrahichhabi@gmail.com
Pati Bibudhendu (Department of Computer Science, Rama Devi Women’s University, Bhubaneswar, India), patibibudhendu@gmail.com
Nanda Sarmistha (Department of Computer Science, Rama Devi Women’s University, Bhubaneswar, India), sarmisthananda@gmail.com
Hsieh Meng-Yen (Department of Computer Science & Information Engineering, Providence University, Taiwan), mengyen@gm.pu.edu.tw
In the current research and development era, Human Activity Recognition (HAR) plays a vital role in analyzing the movements and activities of a human being. The main objective of HAR is to infer the current behaviour by extracting previous information. Now-a-days, the continuous improvement of living condition of human beings changes human society dramatically. To detect the activities of human beings, various devices, such as smartphones and smart watches, use different types of sensors, such as multi modal sensors, non-video based and video-based sensors, and so on. Among the entire machine learning approaches, tasks in different applications adopt extensively classification techniques, in terms of smart homes by active and assisted living, healthcare, security and surveillance, making decisions, tele-immersion, forecasting weather, official tasks, and prediction of risk analysis in society. In this paper, we perform three classification algorithms, Sequential Minimal Optimization (SMO), Random Forest (RF), and Simple Logistic (SL) with the two HAR datasets, UCI HAR and WISDM, downloaded from the UCI repository. The experiment described in this paper uses the WEKA tool to evaluate performance with the matrices, Kappa statistics, relative absolute error, mean absolute error, ROC Area, and PRC Area by 10-fold cross validation technique. We also provide a comparative analysis of the classification algorithms with the two determined datasets by calculating the accuracy with precision, recall, and F-measure metrics. In the experimental results, all the three algorithms with the UCI HAR datasets achieve nearly the same accuracy of 98%.The RF algorithm with the WISDM dataset has the accuracy of 90.69%,better than the others.
Keywords: Machine Learning, Human Activity Recognition, WEKA, Classifier, Classification algorithms
Show references
Sarle, W. S. Neural networks and statistical models, Proceedings of the Nineteenth Annual SAS Users Group International Conference, Cary, NC: SAS Institute, USA, pp. 1538-1550. (1994).
Ann, O. C., and Theng, L. B. Human activity recognition: a review. In 2014 IEEE international conference on the control system, computing, and engineering (ICCSCE 2014) (pp. 389-393). IEEE. (2014).
Oikonomopoulos, A., and Pantic, M. Human activity recognition using hierarchically-mined feature constellations. In International Symposium on Visual Computing, Springer, Berlin, Heidelberg, pp. 150-159. (2013).
Roshtkhari, M. J., and Levine, M. D. Human activity recognition in videos using a single example. Image and Vision Computing, 31(11), pp. 864-876. (2013).
Anguita, D., Ghio, A., Oneto, L., Parra, X., and Reyes-Ortiz, J. L. A public domain dataset for human activity recognition using smartphones. In ESANN proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, Bruges (Belgium), 24-26 April 2013, ISBN 978-2-87419-081-0. (2013).
Weiss, G. M., Yoneda, K., and Hayajneh, T. Smartphone and Smart watch-Based Biometrics Using Activities of Daily Living. IEEE Access, 7, pp. 133190-133202. (2019).
Ho, T. K. Random decision forests. In Proceedings of 3rd international conference on document analysis and recognition, Vol. 1, pp. 278-282. IEEE. (1995).
Ho, T. K. The random subspace method for constructing decision forests. IEEE transactions on pattern analysis and machine intelligence, 20(8), 832-844. (1998).
Chang, C. C., and Lin, C. J. LIBSVM: A library for support vector machines. ACM transactions on intelligent systems and technology (TIST), 2(3), pp. 1-27. (2011).
Zanni, L., Serafini, T., and Zanghirati, G. Parallel software for training large scale support vector machines on multiprocessor systems. Journal of Machine Learning Research, 7(Jul), pp. 1467-1492. (2006).
Tolles, J.,andMeurer, W. J. Logistic regression: relating patient characteristics to outcomes. Jama, 316(5), pp. 533-534. (2016).
Singaravelan, S., Arun, R., Arun Shunmugam, D., Ruba Soundar, K., Mayakrishnan, R., and Murugan, D. Analysis of classification algorithms on different datasets. Review of Innovation and Competitiveness: A Journal of Economic and Social Research, 4(2), pp. 41-54. (2018).
Ranasinghe, S., Al Machot, F., and Mayr, H. C. A review of applications of activity recognition systems with regard to performance and evaluation. International Journal of Distributed Sensor Networks, 12(8), 1550147716665520. (2016).
Krishnan, N. C., Juillard, C., Colbry, D., and Panchanathan, S. Recognition of hand movements using wearable accelerometers. Journal of Ambient Intelligence and Smart Environments, 1(2), pp. 143-155. (2009).
Florence, A. M., and Savithri, R. Talent knowledge acquisition using the C4.5 classification algorithm. International Journal of Emerging Technologies in Computational and Applied Sciences, pp. 406-410. (2013).
Dua, D., and Graff, C. UCI Machine Learning Repository, http://archive.ics.uci.edu/ml, Accessed: 2020-05-01 (2019).
Alghobiri, M. A Comparative Analysis of Classification Algorithms on Diverse Datasets, Engineering, Technology & Applied Science Research, 8(2), pp. 2790-2795. (2018).
Platt, J. Sequential minimal optimization: A fast algorithm for training support vector machines, Technical Report MSR-TR-98-14(1998).
Breiman, L. Random forests. Machine learning, 45(1), 5-32. (2001).
Sumner, M., Frank, E., & Hall, M. Speeding up logistic model tree induction. In European conference on principles of data mining and knowledge discovery (pp. 675-683). Springer, Berlin, Heidelberg. (2005).
Chang, F., Ge, L., Li, S., Wu, K., & Wang, Y. Self-adaptive spatial-temporal network based on heterogeneous data for air quality prediction. Connection Science, 1-20. (2020).
Zhang, S., Hu, Z., Zhu, G., Jin, M., & Li, K. C. Sentiment classification model for Chinese micro-blog comments based on key sentences extraction. Soft Computing. (2020).
Nanda, S., Panigrahi, C. R., Pati, B., & Mishra, A. A Novel Approach to Detect Emergency Using Machine Learning. In Progress in Advanced Computing and Intelligent Engineering (pp. 185-192). Springer, Singapore.
Verma, R. K., Singh, P., Panigrahi, C. R., & Pati, B. ISS: Intelligent Security System Using Facial Recognition. In Progress in Advanced Computing and Intelligent Engineering (pp. 96-101). Springer, Singapore. (2019).
Wang, Q., Zhu, G., Zhang, S., Li, K. C., Chen, X., & Xu, H. Extending emotional lexicon for improving the classification accuracy of Chinese film reviews. Connection Science, 1-20. (2020).
Tian, Q., Han, D., Li, K. C., Liu, X., Duan, L., & Castiglione, A. An intrusion detection approach based on improved deep belief network. Applied Intelligence. (2020).
Ramasamy, V., Gomathy, B., Obulesu, O., Sarkar, J. L., Panigrahi, C. R., Pati, B., & Majumder, A. Machine Learning Techniques and Tools: Merits and Demerits. New Age Analytics, 23. (2020).
Qin, P., Chen, J., Zhang, K., & Chai, R. Convolutional neural networks and hash learning for feature extraction and of fast retrieval of pulmonary nodules. Computer Science and Information Systems, 15(3), 517-531. (2018).
Zhang, C., Pan, X., Li, H., Gardiner, A., Sargent, I., Hare, J., & Atkinson, P. M. A hybrid MLP-CNN classifier for very fine resolution remotely sensed image classification. ISPRS Journal of Photogrammetry and Remote Sensing, 140, 133-144. (2018).
Cao, L., &Shen, H. Imbalanced data classification based on hybrid resampling and twin support vector machine. Computer Science and Information Systems, 14(3), 579-595. (2017).
Albawi, S., Mohammed, T. A., & Al-Zawi, S. Understanding of a convolutional neural network. In 2017 International Conference on Engineering and Technology (ICET) (pp. 1-6). IEEE. (2017).
Bhargava, N., Sharma, G., Bhargava, R., & Mathuria, M. Decision tree analysis on j48 algorithm for data mining. Proceedings of International Journal of Advanced Research in Computer Science and Software Engineering, 3(6). (2013).
Burns, D. M., and Cari M. W. Personalized Activity Recognition with Deep Triplet Embeddings. arXiv preprint arXiv: 2001.05517 (2020).
Ain, A., and Vivek K. Human activity classification in smartphones using accelerometer and gyroscope sensors. IEEE Sensors Journal 18, no. 3 (2017): 1169-1177.
Walse, K. H., Rajiv, V. D., and Vilas, M. T. A study of human activity recognition using Ada Boost classifiers on WISDM dataset. The Institute of Integrative Omics and Applied Biotechnology Journal 7, no. 2: 68-76. (2016)
Kutlay, M. A., and Sadina, G. Application of machine learning in healthcare: Analysis on mhealth dataset. Southeast Europe Journal of Soft Computing 4, no. 2. (2016)
Yang, C., Wenxiang J., and Zhongwen G. Time Series Data Classification Based on Dual Path CNN-RNN Cascade Network. IEEE Access 7: 155304-155312. (2019)
Wang, L., Wei, L., Wen, L., and Luc Van Gool. Appearance-and-relation networks for video classification. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1430-1439. (2018)
Kwon, M. and Choi, S. Recognition of Daily Human Activity Using an Artificial Neural Network and Smartwatch. Wireless Communications and Mobile Computing, Hindawi, https://doi.org/10.1155/2018/2618045. (2018)
Biswal, A., Nanda, S., Panigrahi C. R., Cowlessur, S. K., Pati, B. Human Activity Recognition Using Machine Learning: A Review. Progress in Advanced Computing and Intelligent Engineering, Advances in Intelligent Systems and Computing 1299, pp. 323-333. (2021)
Mendonça, J., Andrade, E., Endo, P.T., Lima, R. Disaster recovery solutions for IT systems: A Systematic mapping study, Journal of Systems and Software, Volume 149, pp. 511-530. (2019)
Feng, F., Li, K-C, Shen, J., Zhou, Q., Yang, X. Using cost-sensitive learning and feature selection algorithms to improve the performance of imbalanced classification. IEEE Access, vol. 8, pp. 69979-69996. (2020)