Abstract
The natural gas (NG), usually methane gas, leaks into the air; it is a big problem for air pollution and the environment. In this paper, we propose to predict gas leakage using ML methods based on the open data provided by the server using IoT-based remote monitoring Picarro gas sensor specification. The performance of the OrdinalEncoder (OE) and MaxAbs normalization-based Naive Bayes techniques was compared with and without the dimensional reduction principal component analysis (PCA) for NG leak prediction. The first step is a preprocessing stage to convert the data based on OE, which results in selecting feature data. The second step is classified into gas CH4 data by the k-means algorithm. After k-means clustering, the experimental dataset has done an imbalanced data. Therefore, we focusing our proposed models can predict medium and high risk so best. In this case, we compared the receiver operating characteristic (ROC) curve for each classification model. As a result of our experiments, the evaluation measurements include ROC reached 85.3% with the OrdinalEncoder (OE)-NB without PCA for the high-level class; ROC values are 72.2, 73.3, and 75.3% for all classes on the OE_PCA_NB with PCA, respectively. These results showed that the proposed OE-NB and OE-PCA-NB outperformed other models for NG leaks prediction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Khongorzul, D., Kim, M.-H., Lee, S.M.: OrdinalEncoder based DNN for natural gas leak prediction. J. Korea Converg. Soc. 10(10), 7–13 (2019)
Weller, Z.D., Yang, D.K., Fischer, J.C.: An open source algorithm to detect natural gas leaks from mobile methane survey data. PLoS ONE 14(2), e0212287 (2019)
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, New York (1995)
Drori, I., et al.: Automatic machine learning by pipeline synthesis using model-based reinforcement learning and a grammar. In: 6th ICML Workshop on Automated Machine Learning (2019), arXiv:1905.10345v1, 24 May (2019)
Miranda, E., et al.: Detection of cardiovascular disease risk’s level for adults using Naive Bayes classifier. Health Inform Res. 22(3), 196–205 (2016)
https://github.com/JVF-CSU/MobileMethaneSurveys/tree/master/Scripts/SampleRawData
Jupri, M., Sarno, R.: Taxpayer compliance classification using C4.5, SVM, KNN, Naive Bayes and MLP. International Conference on Information and Communication Technology on Proceedings, pp. 297–303. Yogyakarta (2018)
Feng, P.M., Ding, H., Chen, W., Lin, H.: Naïve Bayes classifier with feature selection to identify phage virion proteins. Comput. Math. Methods Med. Article ID 530696, (2013)
Ting, S.L., Ip, W.H., Tsang, A.H.: Is Naïve Bayes a good classifier for document classification. Int. J. Softw. Eng. Its Appl. 5(3), (2011)
Soriaa, D., et al.: A ‘non-parametric’ version of the naive Bayes classifier. Knowl.-Based Syst. 24(6), 775–784 (2011)
Novakovic, J.: The impact of feature selection on the accuracy of Naïve Bayes classifier. In: 18th Telecommunications Forum TELFOR2010, Serbia, Belgrade, 23–25 Nov (2010)
Naseriparsa, M., Mansour, M., Kashani, R.: Combination of PCA with SMOTE resampling to boost the prediction rate in lung cancer dataset. Int. J. Comput. Appl. 77(3), 33–38 (2013)
Jingnian Chen, J., et al.: Feature selection for text classification with Naïve Bayes. Expert Syst. Appl. 36(3), 5432–5435 (2009)
Zhang, M.L., Pena, J.M., Robles, V.: Feature selection for multi-label naive Bayes classification. Inf. Sci. 179(19), 3218–3229 (2009)
Amarbayasgalan, T., Park, K.H., Lee, J.Y., Ryu, K.H.: Reconstruction error based deep neural networks for coronary heart disease risk prediction. PLoS ONE 14(12), e0225991 (2019)
Acknowledgements
This research was financially supported by the Ministry of Trade, Industry, and Energy (MOTIE), Korea, under the “Regional Specialized Industry Development Program (R&D, P0002072)” supervised by the Korea Institute for Advancement of Technology (KIAT).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Dashdondov, K., Lee, SM., Kim, MH. (2021). OrdinalEncoder and PCA based NB Classification for Leaked Natural Gas Prediction Using IoT based Remote Monitoring System. In: Pan, JS., Li, J., Ryu, K.H., Meng, Z., Klasnja-Milicevic, A. (eds) Advances in Intelligent Information Hiding and Multimedia Signal Processing. Smart Innovation, Systems and Technologies, vol 212. Springer, Singapore. https://doi.org/10.1007/978-981-33-6757-9_32
Download citation
DOI: https://doi.org/10.1007/978-981-33-6757-9_32
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-6756-2
Online ISBN: 978-981-33-6757-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)