Skip to main content

OrdinalEncoder and PCA based NB Classification for Leaked Natural Gas Prediction Using IoT based Remote Monitoring System

  • Conference paper
  • First Online:
Advances in Intelligent Information Hiding and Multimedia Signal Processing

Abstract

The natural gas (NG), usually methane gas, leaks into the air; it is a big problem for air pollution and the environment. In this paper, we propose to predict gas leakage using ML methods based on the open data provided by the server using IoT-based remote monitoring Picarro gas sensor specification. The performance of the OrdinalEncoder (OE) and MaxAbs normalization-based Naive Bayes techniques was compared with and without the dimensional reduction principal component analysis (PCA) for NG leak prediction. The first step is a preprocessing stage to convert the data based on OE, which results in selecting feature data. The second step is classified into gas CH4 data by the k-means algorithm. After k-means clustering, the experimental dataset has done an imbalanced data. Therefore, we focusing our proposed models can predict medium and high risk so best. In this case, we compared the receiver operating characteristic (ROC) curve for each classification model. As a result of our experiments, the evaluation measurements include ROC reached 85.3% with the OrdinalEncoder (OE)-NB without PCA for the high-level class; ROC values are 72.2, 73.3, and 75.3% for all classes on the OE_PCA_NB with PCA, respectively. These results showed that the proposed OE-NB and OE-PCA-NB outperformed other models for NG leaks prediction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Khongorzul, D., Kim, M.-H., Lee, S.M.: OrdinalEncoder based DNN for natural gas leak prediction. J. Korea Converg. Soc. 10(10), 7–13 (2019)

    Google Scholar 

  2. Weller, Z.D., Yang, D.K., Fischer, J.C.: An open source algorithm to detect natural gas leaks from mobile methane survey data. PLoS ONE 14(2), e0212287 (2019)

    Article  Google Scholar 

  3. Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, New York (1995)

    Book  Google Scholar 

  4. Drori, I., et al.: Automatic machine learning by pipeline synthesis using model-based reinforcement learning and a grammar. In: 6th ICML Workshop on Automated Machine Learning (2019), arXiv:1905.10345v1, 24 May (2019)

  5. Miranda, E., et al.: Detection of cardiovascular disease risk’s level for adults using Naive Bayes classifier. Health Inform Res. 22(3), 196–205 (2016)

    Article  Google Scholar 

  6. https://github.com/JVF-CSU/MobileMethaneSurveys/tree/master/Scripts/SampleRawData

  7. Jupri, M., Sarno, R.: Taxpayer compliance classification using C4.5, SVM, KNN, Naive Bayes and MLP. International Conference on Information and Communication Technology on Proceedings, pp. 297–303. Yogyakarta (2018)

    Google Scholar 

  8. Feng, P.M., Ding, H., Chen, W., Lin, H.: Naïve Bayes classifier with feature selection to identify phage virion proteins. Comput. Math. Methods Med. Article ID 530696, (2013)

    Google Scholar 

  9. Ting, S.L., Ip, W.H., Tsang, A.H.: Is Naïve Bayes a good classifier for document classification. Int. J. Softw. Eng. Its Appl. 5(3), (2011)

    Google Scholar 

  10. Soriaa, D., et al.: A ‘non-parametric’ version of the naive Bayes classifier. Knowl.-Based Syst. 24(6), 775–784 (2011)

    Article  Google Scholar 

  11. Novakovic, J.: The impact of feature selection on the accuracy of Naïve Bayes classifier. In: 18th Telecommunications Forum TELFOR2010, Serbia, Belgrade, 23–25 Nov (2010)

    Google Scholar 

  12. Naseriparsa, M., Mansour, M., Kashani, R.: Combination of PCA with SMOTE resampling to boost the prediction rate in lung cancer dataset. Int. J. Comput. Appl. 77(3), 33–38 (2013)

    Google Scholar 

  13. Jingnian Chen, J., et al.: Feature selection for text classification with Naïve Bayes. Expert Syst. Appl. 36(3), 5432–5435 (2009)

    Article  Google Scholar 

  14. Zhang, M.L., Pena, J.M., Robles, V.: Feature selection for multi-label naive Bayes classification. Inf. Sci. 179(19), 3218–3229 (2009)

    Article  Google Scholar 

  15. Amarbayasgalan, T., Park, K.H., Lee, J.Y., Ryu, K.H.: Reconstruction error based deep neural networks for coronary heart disease risk prediction. PLoS ONE 14(12), e0225991 (2019)

    Article  Google Scholar 

Download references

Acknowledgements

This research was financially supported by the Ministry of Trade, Industry, and Energy (MOTIE), Korea, under the “Regional Specialized Industry Development Program (R&D, P0002072)” supervised by the Korea Institute for Advancement of Technology (KIAT).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mi-Hye Kim .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dashdondov, K., Lee, SM., Kim, MH. (2021). OrdinalEncoder and PCA based NB Classification for Leaked Natural Gas Prediction Using IoT based Remote Monitoring System. In: Pan, JS., Li, J., Ryu, K.H., Meng, Z., Klasnja-Milicevic, A. (eds) Advances in Intelligent Information Hiding and Multimedia Signal Processing. Smart Innovation, Systems and Technologies, vol 212. Springer, Singapore. https://doi.org/10.1007/978-981-33-6757-9_32

Download citation

Publish with us

Policies and ethics