Abstract
India has the worst pollution status amongst all the polluted countries in the world, wherein several cities face severe environmental problems due to the rampant increase in pollution. For research investigation two cities, Gaya and Tirupati were selected to estimate the air pollution SO2 trends by three different modelling techniques: M5P and artificial neural network (ANN) and bagged artificial neural network (BANN). Both cities are cultural centres and important pilgrimage destinations in India. Data were collected with 340 observations for Gaya and 1477 observations for Tirupati between 2017 and 2020 from Central Pollution Control Board (CPCB) and used to generate the SO2 models. During the study, results of three models show that BANN is the best model for Gaya with coefficient correlation (CC) 0.8563, mean absolute error (MAE) 1.7575, root mean square error (RMSE) 2.9023, Nash–Sutcliffe efficiency (NSE) 0.7243, Willmott index (WI) 0.9154 and normalized root mean square error (SI) 0.2457 having a testing stage with input combinations, particulate matter (PM2.5), nitric oxide (NO), NO2 (nitrogen dioxide), NOx, carbon monoxide (CO), ozone (O3), relative humidity (RH), wind speed (WS), wind direction (WD), solar radiation (SR), vertical wind speed (VWS) and absolute temperature (AT), and for Tirupati, also BANN is the best model with CC 0.8634, MAE 1.9088, RMSE 2.5891, NSE 0.7421, WI 0.9233 and SI 0.3409 having a testing stage with input combinations, PM2.5, NO, NO2, NOx, CO, O3, RH, WS, WD, SR, VWS, AT and SO2. The most influential parameters are WD and NO2 for Gaya and NO2 and O3 for Tirupati. Graphical results also confirm that BANN-based model is best performing for both cities Gaya and Tirupati.
Similar content being viewed by others
References
Alberta Environment. Sulphur dioxide: environmental effects, fate and behaviour. Accessed 1 Mar 2003, http://aep.alberta.ca/air/legislation/ambient-air-quality-objectives/documents/SulphurDioxideEffectsFateBehaviour-2003.pdf
Amal L, Chabchoub H (2018) SGA: spatial GIS-based genetic algorithm for route optimization of municipal solid waste collection. Environ Sci Pollut Res 25(27):27569–27582. https://doi.org/10.1007/s11356-018-2826-0
Azimi S, Rocher V, Muller M, Moilleron R, Thévenot DR (2005) Sources, distribution and variability of hydrocarbons and metals in atmospheric deposition in an urban area (Paris, France). Sci Total Environ. 337:223–239. https://doi.org/10.1016/j.scitotenv.2004.06.020
Balouchi B, Nikoo MR, Adamowski J (2015) Development of expert systems for the prediction of scour depth under live-bed conditions at river confluences: application of different types of ANNs and the M5P model tree. Appl Soft Comput 34:51–59. https://doi.org/10.1016/j.asoc.2015.04.040
Begum BA, Kim E, Biswas SK, Hoopke PK (2004) Investigation of sources of atmospheric aerosol at urban and semi urban areas in Bangladesh. Atmos Environ 38(19):3025–3038. https://doi.org/10.1016/j.atmosenv.2004.02.042
Breiman L (1994) Heuristics of instability in model selection. Technique Report. Statistics Department. University of California at Berkeley
Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140. https://doi.org/10.1007/BF00058655
Breiman L (1999) Using adaptive bagging to debias regressions. Technical Report 547, Statistics Dept. UCB, p 16. https://www.stat.berkeley.edu/users/breiman/adaptbag99.pdf
Brunekreef B, Holgate ST (2002) Air pollution and health. Lancet 360(9341):1233–1242. https://doi.org/10.1016/S0140-6736(02)11274-8
Burak S, Doğan E, Gazioğlu C (2004) Impact of urbanization and tourism on coastal environment. Ocean Coast Manag 47(9):515–527. https://doi.org/10.1016/j.ocecoaman.2004.07.007
Capilla C (2014) Multilayer perceptron and regression modelling to forecast hourly nitrogen dioxide concentrations. WIT Trans Ecol Environ Spain 7(183):39–48
Chai T, Draxler RR (2014) Root mean square error (RMSE) or mean absolute error (MAE)?–Arguments against avoiding RMSE in the literature. Geosci Model Dev 7:1247–1250. https://doi.org/10.5194/gmd-7-1247-2014
Chehregani A, Majde A, Moin M, Gholami M, Shariatzadeh MA, Nassiri H (2004) Increasing allergy potency of Zinnia pollen grains in polluted areas. Ecotoxicol Environ Saf 58(2):267–272. https://doi.org/10.1016/j.ecoenv.2003.12.004
Chelani AB, Chalapati Rao CV, Phadke KM, Hasan MZ (2002) Prediction of sulphur dioxide concentration using artificial neural networks. Environ Model Softw 17:161–168. https://doi.org/10.1016/S1364-8152(01)00061-5
Dales R, Burnett RT, Smith-Doiron M, Stieb DM, Brook JR (2004) Air pollution and sudden infant death syndrome. PEDIATRICS 113:e628–e631. https://doi.org/10.1542/peds.113.6.e628
Dayev Z, Kairakbaev A, Yetilmezsoy K, Bahramian M, Sihag P, Kıyan E (2021) Approximation of the discharge coefficient of differential pressure flowmeters using different soft computing strategies. Flow Meas Instrum 1(79):101913. https://doi.org/10.1016/j.flowmeasinst.2021.101913
Demir KA (2018) Assessment of exhaust emissions from the local marine survey of the bosphorus, IU, Institute of Marine Sciences and Management Msc. Thesis
Ebtehaj I, Sammen SS, Sidek LM, Malik A, Sihag P, Al-Janabi AM, Chau KW, Bonakdari H (2021) Prediction of daily water level using new hybridized GS-GMDH and ANFIS-FCM models. Eng Appl Comput Fluid Mech 15(1):1343–1361. https://doi.org/10.1080/19942060.2021.1966837
Efron B (1979) Computers and the theory of statistics: thinking the unthinkable. SIAM Rev 21(4):460–480. https://doi.org/10.1137/1021092
Etemad-Shahidi A, Mahjoobi J (2009) Comparison between M5P model tree and neural networks for prediction of significant wave height in Lake Superior. Ocean Eng 36:1175–1181. https://doi.org/10.1016/j.oceaneng.2009.08.008
Gardner MW, Dorling SR (1998) Artificial neural networks (the multilayer perceptron)—a review of applications in the atmospheric sciences. Atmos Environ 32(14–15):2627–2636. https://doi.org/10.1016/S1352-2310(97)00447-0
Gardner MW, Dorling SR (1999) Neural network modelling and prediction of hourly NOx and NO2 concentrations in urban air in London. Atmos Environ 33(5):709–719. https://doi.org/10.1016/S1352-2310(98)00230-1
Ghorani-Azam A, Riahi-Zanjani B, Balali-Mood M (2016) Effects of air pollution on human health and practical measures for prevention in Iran. J Res Med Sci 21:65–72. https://doi.org/10.4103/1735-1995.189646
Guttikunda SK, Goel R, Pant P (2014) Nature of air pollution, emission sources, and management in the Indian cities. Atmos Environ 1(95):501–510. https://doi.org/10.1016/j.atmosenv.2014.07.006
Guttikunda SK, Nishadh KA, Jawahar P (2019) Air pollution knowledge assessments (APnA) for 20 Indian cities. Urban Clim 27:124–141. https://doi.org/10.1016/j.uclim.2018.11.005
Hedley AJ, Wong C-M, Thach TQ, Ma S, Lam T-H, Anderson HR (2002) Cardiorespiratory and all-cause mortality after restrictions on sulphur content of fuel in Hong Kong: an intervention study. Lancet 360:1646–165. https://doi.org/10.1016/s0140-6736(02)11612-6
Hornik K, Stinchcombe M, White H (1989) Multilayer feedforward networks are universal approximators. Neural Netw 2(5):359–366. https://doi.org/10.1016/0893-6080(89)90020-8
Juhos I, Makra L, Tóth B (2008) Forecasting of traffic origin NO and NO2 concentrations by support vector machines and neural networks using principal component analysis. Simul Model Pract Theory 16(9):1488–1502. https://doi.org/10.1016/j.simpat.2008.08.006
Kural G, ÇağlarBalkıs N, Aksu A (2018) Source identification of polycyclic aromatic hydrocarbons (PAHs) in the urban environment of Istanbul. Int J Environ Geoinform (IJEGEO) 5(1):1–16. https://doi.org/10.30897/ijegeo.412341
Lingireddy S, Brion GM (2005) “Artificial neural networks in water supply engineering,” in Artificial Neural Networks in Water Supply Engineering, G. Brion, S. Lingireddy, Virginia, USA: American society of Civil Engineers. pp. 1–9
Ling Z, Huang T, Zhao Y, Li J, Zhang X, Wang J, Lian L, Mao X, Gao H, Ma J (2017) OMI-measured increasing SO 2 emissions due to energy industry expansion and relocation in northwestern China. Atmos Chem Phys 17(14):9115–9131. https://doi.org/10.5194/acp-17-9115-2017
Masih A (2019) Machine learning algorithms in air quality modeling. Global J Environ Sci Manag 5(4):515–534. https://doi.org/10.22034/GJESM.2019.04.10
May RJ, Maier HR, Dandy GC (2010) Data splitting for artificial neural networks using SOM-based stratified sampling. Neural Netw 23(2):283–294. https://doi.org/10.1016/j.neunet.2009.11.009
Molina MJ, Molina LT (2004) Megacities and atmospheric pollution. J Air Waste Manag Assoc 54(6):644–680. https://doi.org/10.1080/10473289.2004.10470936
O'Brien T (2017) Climate change and sulfur dioxide (Doctoral dissertation, University of Pittsburgh)
Parsaie A, Azamathulla HM, Haghiabi AH (2017) Physical and numerical modeling of performance of detention dams. J Hydrol. https://doi.org/10.1016/j.jhydrol.2017.01.018
Parsaie A, Haghiabi AH (2016) Numerical routing of tracer concentrations in rivers with stagnant zones. Water Supply 17(3):825–834. https://doi.org/10.2166/ws.2016.175
Pino-Mejías R, Jiménez-Gamero MD, Cubiles-de-la-Vega MD, Pascual-Acosta A (2008) Reduced bootstrap aggregating of learning algorithms. Pattern Recogn Lett 29(3):265–271. https://doi.org/10.1016/j.patrec.2007.10.002
Quinlan JR (1992) Learning with continuous classes. In 5th Australian joint conference on artificial intelligence. 92:343–348. https://doi.org/10.1142/9789814536271
Russo A, Soares AO (2014) Hybrid model for urban air pollution forecasting: a stochastic spatio-temporal approach. Math Geosci 46(1):75–93. https://doi.org/10.1007/s11004-013-9483-0
Salam A, Bauer H, Kassin K, Ullah SM, Puxbaum H (2003) Aerosol chemical characteristics of a mega-city in southeast Asia (Dhaka, Bangladesh). Atmos Environ 37(18):2517–2528. https://doi.org/10.1016/S1352-2310(03)00135-3
Salmasi F, Nouri M, Sihag P, Abraham J (2021) Application of SVM, ANN, GRNN, RF, GP and RT models for predicting discharge coefficients of oblique sluice gates using experimental data. Water Supply 21(1):232–248. https://doi.org/10.2166/ws.2020.226
Sangeeta, Haji SeyedAsadollah SB, Sharafati A, Sihag P, Al-Ansari N, Chau KW (2021) Machine learning model development for predicting aeration efficiency through Parshall flume. Eng Appl Comput Fluid Mech 15(1):889–901. https://doi.org/10.1080/19942060.2021.1922314
Shaban KB, Kadri A, Rezk E (2016) Urban air pollution monitoring system with forecasting models. IEEE Sens J 16(8):2598–2606. https://doi.org/10.1109/JSEN.2016.2514378
Sharifi SS, Rezaverdinejad V, Nourani V (2016) Estimation of daily global solar radiation using wavelet regression, ANN, GEP and empirical models: a comparative study of selected temperature based approaches. J Atmos Sol TerrPhys. 149:131–145. https://doi.org/10.1016/j.jastp.2016.10.008
Shirzadi A, Shahabi H, Chapi K, Bui DT, Pham BT, Shahedi K, Ahmad BB (2017) A comparative study between popular statistical and machine learning methods for simulating volume of landslides. CATENA 157:213–226. https://doi.org/10.1016/j.catena.2017.05.016
Sihag P, Pandhiani SM, Sangwan V, Kumar M, Angelaki A (2021) Estimation of ground-level O3 using soft computing techniques: case study of Amritsar, Punjab State, India. Int J Environ Sci Technol 3:1–8. https://doi.org/10.1007/s13762-021-03514-9
Singh B, Sihag P, Singh K (2017) Modelling of impact of water quality on infiltration rate of soil by random forest regression. Model Earth Syst Environ 3(3):999–1004. https://doi.org/10.1007/s40808-017-0347-3
Singh KP, Gupta S, Rai P (2013) Identifying pollution sources and predicting urban air quality using ensemble learning methods. Atmos Environ 1(80):426–437. https://doi.org/10.1016/j.atmosenv.2013.08.023
Taylor KE (2001) Summarizing multiple aspects of model performance in a single diagram. J Geophys Res Atmos 106(D7):7183–7192. https://doi.org/10.1029/2000JD900719
Thakur MS, Pandhiani SM, Kashyap V, Upadhya A, Sihag P (2021) Predicting bond strength of FRP bars in concrete using soft computing techniques. Arab J Sci Eng 46(5):4951–4969. https://doi.org/10.1007/s13369-020-05314-8
Turner MC, Krewski D, Pope CA III, Chen Y, Gapstur SM, Thun MJ (2011) Long-term ambient fine particulate matter air pollution and lung cancer in a large cohort of never-smokers. Am J Respir Crit Care Med. 184(12):1374–1381. https://doi.org/10.1164/rccm.201106-1011OC
Upadhya A, Thakur MS, Sharma N, Sihag P (2021) Assessment of soft computing-based techniques for the prediction of Marshall stability of asphalt concrete reinforced with glass fiber. Int J Pavement Res Technol 20:1–20. https://doi.org/10.1007/s42947-021-00094-2
Vlachogianni A, Kassomenos P, Karppinen A, Karakitsios S, Kukkonen J (2011) Evaluation of a multiple regression model for the forecasting of the concentrations of NOx and PM10 in Athens and Helsinki. Sci Total Environ 409(8):1559–1571. https://doi.org/10.1016/j.scitotenv.2010.12.040
Wang K, Chen T, Lau R (2011) Bagging for robust non-linear multivariate calibration of spectroscopy. Chemometr Intell Lab Syst 105(1):1–6. https://doi.org/10.1016/j.chemolab.2010.10.004
Wang X, Bi X, Sheng G, Fu J (2006) Chemical composition and sources of PM10 and PM2.5 aerosols in Guangzhou, China. Environ. Monit Assess. 119 (1–3), 425–439vironment. 2011 Mar 15;409(8):1559–71. https://doi.org/10.1007/s10661-005-9034-3
Willmott CJ, Matsuura K (2005) Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance. Clim Res 30:79–82. https://doi.org/10.3354/cr030079
Willmott CJ (1981) On the validation of models. Phys Geogr 2(2):184–194
Willmott CJ (1982) Some comments on the evaluation of model performance. Bull Am Meteorol Soc 63(11):1309–1313. https://doi.org/10.1175/1520-0477(1982)063<1309:SCOTEO>2.0.CO;2
Witten IH, Frank E (2002) Data mining: practical machine learning tools and techniques with Java implementations. ACM SIGMOD Rec 31(1):76–77. https://doi.org/10.1145/507338.507355
Xie Y, Zhao L, Xue J, Hu Q, Xu X, Wang H (2016) A cooperative reduction model for regional air pollution control in China that considers adverse health effects and pollutant reduction costs. Sci Total Environ 15(573):458–469. https://doi.org/10.1016/j.scitotenv.2016.08.077
Yuda M. Asian countries rush to fight toxic air pollution. 2019. Accessed from https://asia.nikkei.com/Economy/Asian-countries-rush-tofight-toxic-air-pollution. Accessed 24 Jan 2019
Zhang H, Wang Y, Hu J, Ying Q, Hu XM (2015) Relationships between meteorological parameters and criteria air pollutants in three megacities in China. Environ Res 140:242–254. https://doi.org/10.1016/j.envres.2015.04.004
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Ethics approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Conflict of interest
The authors declare no competing interests.
Additional information
Responsible Editor: Zhihua Zhang
Rights and permissions
About this article
Cite this article
Devi, S., Esmaeilbeiki, F., Karimi, S.M. et al. Prediction of sulphur dioxide (SO2) in air by using bagging, ANN and M5P: a case study, Gaya and Tirupati, India. Arab J Geosci 15, 613 (2022). https://doi.org/10.1007/s12517-022-09725-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s12517-022-09725-9