Comparison of Four Machine Learning Methods for Predicting PM10 Concentrations in Helsinki, Finland

Zickus, M.; Greig, A. J.; Niranjan, M.

doi:10.1023/A:1021321820639

Comparison of Four Machine Learning Methods for Predicting PM₁₀ Concentrations in Helsinki, Finland

Published: September 2002

Volume 2, pages 717–729, (2002)
Cite this article

Water, Air and Soil Pollution: Focus

M. Zickus¹,
A. J. Greig &
M. Niranjan²

175 Accesses
20 Citations
Explore all metrics

Abstract

Machine learning methods can offer a practicalalternative to deterministic and statistical methods forpredicting air pollution concentrations. However, for agiven data set, it is often not clear beforehand whichmachine learning method will yield the best predictionperformance. This study compares the variable selection andprediction performance of four machine-learning methods ofdifferent complexity: logistic regression, decision tree,multivariate adaptive regression splines and neuralnetwork. The methods are applied to the task of predictingthe exceedance of the European PM₁₀ daily averageobjective of 50 μg m^-3 for a station in Helsinki,Finland. Our study shows that some predictors were selectedby all models but that the different models also pickeddifferent variables. The performance of three of the fourmethods investigated was very similar, however, performanceof the decision tree method was significantly inferior.Performance was sensitive to the learning sample size andtime period used.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Modeling of air pollutants using least square support vector regression, multivariate adaptive regression spline, and M5 model tree models

Article 13 April 2017

Predicting Ozone Layer Concentration Using Multivariate Adaptive Regression Splines, Random Forest and Classification and Regression Tree

Evaluating hourly air quality forecasting in Canada with nonlinear updatable machine learning methods

Article 14 July 2016

References

Berge, E., Walker, S-E., Sorteberg, A., Lenkopane, M. L., Eastwood, S., Jablonska, H. J. and Ødegaard, M.: 2001, ‘A Real Time Operational Forecast Model for Meteorology and Air Quality During Peak Air Pollution Episodes in Oslo, Norway’, Proceedings of 3th International Conference on Urban Air Quality, Loutraki, Greece, March 2001.
Berthold, M. and Hand, D. (eds): 1999, Intelligent Data Analysis, Springer.
Breiman, L., Friedman, J., Olshen, R. and Stone, C.: 1984, Classification and Regression Trees, Wadsworth International Group.
Brodley, C. E.: 1993, ‘Addressing the selective superiority problem: Automatic algorithms/model class selection’, in P. Utgoff (ed.), Proceedings of the Tenth International Conference on Machine Learning, pp. 17–24.
De Leeuw, F., Moussiopoulos, N., Bartonova, A. and Sahm, P.: 2000, ‘Air Quality in Larger Conurbations in the European Union’, European Topic Centre on Air Quality.
Friedman, J. H.: 1991, ‘Multivariate adaptive regression splines (with discussion)’, Ann. Statis. 19,1–141.
Google Scholar
Gardner, M. and Dorling, S., 1998: 'Artificial neural networks (the multi-layer perceptron) – a review of applications in the atmospheric sciences’, Atmos. Environ. 32, 2627–2636
Google Scholar
Gardner, M. and Dorling, S.: 1999, ‘Statistical surface ozone models: an improved methodology to account for non-linear behaviour, Atmos. Environ. 34, 21–34.
Google Scholar
Goldberg, D. E.: 1989, Genetic Algorithms, Reading, MA: Addison Wesley.
Google Scholar
Kennedy, R. L., Yuchun, L., van Roy, B., Reed, C. and Lippman, R.: 1997, ‘Solving Data Mining Problems with Pattern Recognition’, The Data Warehousing Institute Series.
Kooperberg, C., Smarajit, B. and Charles, J.: 1997, ‘Polychotomous regression’, J. Amer. Stat. Assoc. 92, 117–127.
Google Scholar
Pohjola, M., Kousa, A., P. Aarnio, P., Koskentalo, T., Kukkonen, Harkonen, J. and Karppinen, A.: 2000, ‘Meteorological interpretation of measured urban PM_2.5 and PM₁₀ concentrations in Helsinki Metropolitan Area’, Air Pollution VIII, 679–698.
Google Scholar
SPSS, User Manual, Version 9.0.
US EPA: 1999 'Guideline for Developing an Ozone Forecasting Program’, EPA-454/R–99–009.
Zickus, M.: 1999, ‘Influence of Meteorological Parameters on Urban Air Pollution and Its Forecast’, PhD. Thesis, Department of Physics, Vilnius University, 105 pp. Available on Internet: http://195.194.93.120/thesis/.

Download references

Author information

Authors and Affiliations

Anglia Polytechnic University, Cambridge, CBT 1PT, U.K
M. Zickus
University of Sheffield, U.K
M. Niranjan

Authors

M. Zickus
View author publications
You can also search for this author in PubMed Google Scholar
A. J. Greig
View author publications
You can also search for this author in PubMed Google Scholar
M. Niranjan
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zickus, M., Greig, A.J. & Niranjan, M. Comparison of Four Machine Learning Methods for Predicting PM₁₀ Concentrations in Helsinki, Finland. Water, Air, & Soil Pollution: Focus 2, 717–729 (2002). https://doi.org/10.1023/A:1021321820639

Download citation

Issue Date: September 2002
DOI: https://doi.org/10.1023/A:1021321820639

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Comparison of Four Machine Learning Methods for Predicting PM₁₀ Concentrations in Helsinki, Finland

Abstract

Access this article

Similar content being viewed by others

Modeling of air pollutants using least square support vector regression, multivariate adaptive regression spline, and M5 model tree models

Predicting Ozone Layer Concentration Using Multivariate Adaptive Regression Splines, Random Forest and Classification and Regression Tree

Evaluating hourly air quality forecasting in Canada with nonlinear updatable machine learning methods

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Comparison of Four Machine Learning Methods for Predicting PM10 Concentrations in Helsinki, Finland

Abstract

Access this article

Similar content being viewed by others

Modeling of air pollutants using least square support vector regression, multivariate adaptive regression spline, and M5 model tree models

Predicting Ozone Layer Concentration Using Multivariate Adaptive Regression Splines, Random Forest and Classification and Regression Tree

Evaluating hourly air quality forecasting in Canada with nonlinear updatable machine learning methods

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation

Comparison of Four Machine Learning Methods for Predicting PM₁₀ Concentrations in Helsinki, Finland