Performance assessment of air quality monitoring networks using principal component analysis and cluster analysis

https://doi.org/10.1016/j.buildenv.2010.09.004Get rights and content

Abstract

This study aims to evaluate the performance of two statistical methods, principal component analysis and cluster analysis, for the management of air quality monitoring network of Hong Kong and the reduction of associated expenses. The specific objectives include: (i) to identify city areas with similar air pollution behavior; and (ii) to locate emission sources. The statistical methods were applied to the mass concentrations of sulphur dioxide (SO2), respirable suspended particulates (RSP) and nitrogen dioxide (NO2), collected in monitoring network of Hong Kong from January 2001 to December 2007.

The results demonstrate that, for each pollutant, the monitoring stations are grouped into different classes based on their air pollution behaviors. The monitoring stations located in nearby area are characterized by the same specific air pollution characteristics and suggested with an effective management of air quality monitoring system. The redundant equipments should be transferred to other monitoring stations for allowing further enlargement of the monitored area. Additionally, the existence of different air pollution behaviors in the monitoring network is explained by the variability of wind directions across the region. The results imply that the air quality problem in Hong Kong is not only a local problem mainly from street-level pollutions, but also a region problem from the Pearl River Delta region.

Introduction

Hong Kong is one of the most developed metropolitans and characterized by the highest population and traffic densities in the world. With the continuous economic development and the population expansion, several environmental problems especially air quality problem, have become severe and attracted much attention in recent years [2], [3], [5], [6], [7], [10], [20]. To control and improve air quality, Hong Kong Environmental Protection Department (HKEPD) has established fourteen monitoring stations to monitor and manage. These stations are distributed in different area of Hong Kong and equipped with various instruments for detecting corresponding pollutants. Some of them are located closely in geography or characterized by similar monitoring condition, which generally present similar behaviors. It probably results in an inefficient usage of the resource and extra expense cost. Hence, it is necessary and significant to optimize the air quality monitoring network using the practical alternative methods such as principal components analysis (PCA) and cluster analysis (CA).

PCA is a statistical technique that transforms the original set of inter-correlated variables into a new set of an equal number of independent uncorrelated variables or principal components that have linear combinations to the original variables. The multicollinearity which probably implied between original variables can be removed through application of it [16], [17]. On the other hand, CA is a classification method used to split a data set into a number of groups of observations which are distinct in terms of typical group values of the variables [11], [12], [13]. The aim of it is to maximize between-group variance and to minimize within-group variance.

In previous studies, these two approaches are combined to explore significant information from the origin data. Gramsch et al. [4] used the PCA and CA to determine the seasonal trends and spatial distribution of PM10 and O3 in Santiago de Chile, concluding that the city had four large sectors with dissimilar air pollution behaviors. Shah and Shaheen [14] employed them to identify the major source of airborne trace metals in area of Islamabad. Pires et al. [12], [13] applied them in the mass concentration of SO2, PM10, CO, NO2 and O3 for the efficient management of air quality in Oporto Metropolitan Area. They all verified that these two methods could complement each other and the combination of them could provide a practical alternative approach for analyzing and solving environmental problem.

In this study, PCA approach was employed as well as CA for optimizing and managing the air quality monitoring stations in Hong Kong. The aim of this paper was to assess the performance of PCA and CA for the management of SO2, RSP and NO2 mass concentrations monitoring with the following specific objectives: (i) to identify city areas with similar air pollution behavior; and (ii) to locate emission sources.

Section snippets

Sample location and available data

Air quality in Hong Kong is greatly affected by the vehicle density on roads, coupled with the hilly geography and cavernous streets. For a better management of air quality in Hong Kong, fourteen monitoring stations have been established by HKEPD since 1999. They are generally classified into two categories in term of the experimental location, roadside stations, i.e., Causeway Bay (CB), Central (CT), Mong Kok (MK), and general stations, i.e., Central/Western (CW), Sham Shui Po (SS), Eastern

Results and discussion

Table 2 shows the main results of the PCA application at all stations for pollutants of SO2, RSP and NO2. From the table, it can be seen that the first three PCs explain 69.1% of the original data variance for analyzing the SO2 concentrations, followed by PC4, PC5 and PC6 components with cumulative variance of 75.4%, 80.0% and 83.7% respectively. In the first PC (PC1), the stations of KC, TW, and ST seem to present similar characteristics and they are combined to take up much loading on

Comments and suggestions

Aiming at the identification of city areas with similar pollution behaviors in Hong Kong, two statistical methods, PCA and CA, were applied on three pollutants at fourteen stations. The grouped clusters for SO2 and RSP pollutants were evidently dependent on the geography while that for the NO2 pollutant was dependent on the monitoring condition. The identification of city areas with the same specific pollutant behaviors at many stations suggests an ineffective management of the AQMN. The

Acknowledgement

The work presented in this paper was partially supported by research grants from City University of Hong Kong, HKSAR, China (Projects No. CityU-SRG 7002370) and the National Natural Science Foundation of China (Grant Nos 10662002, 10672098 and 10532020).

References (21)

There are more references available in the full text version of this article.

Cited by (97)

  • Comparison of different multicriteria decision-making methodologies for sustainability decision making

    2023, Hydrogen Economy: Processes, Supply Chain, Life Cycle Analysis and Energy Transition for Sustainability
View all citing articles on Scopus
View full text