Cluster analysis of rural, urban, and curbside atmospheric particle size data

Environ Sci Technol. 2009 Jul 1;43(13):4694-700. doi: 10.1021/es803121t.

Abstract

Particle size is a key determinant of the hazard posed by airborne particles. Continuous multivariate particle size data have been collected using aerosol particle size spectrometers sited at four locations within the UK: Harwell (Oxfordshire); Regents Park (London); British Telecom Tower (London); and Marylebone Road (London). These data have been analyzed using k-means cluster analysis, deduced to be the preferred cluster analysis technique, selected from an option of four partitional cluster packages, namelythe following: Fuzzy; k-means; k-median; and Model-Based clustering. Using cluster validation indices k-means clustering was shown to produce clusters with the smallest size, furthest separation, and importantly the highest degree of similarity between the elements within each partition. Using k-means clustering, the complexity of the data set is reduced allowing characterization of the data according to the temporal and spatial trends of the clusters. At Harwell, the rural background measurement site, the cluster analysis showed that the spectra may be differentiated by their modal-diameters and average temporal trends showing either high counts during the day-time or night-time hours. Likewise for the urban sites, the cluster analysis differentiated the spectra into a small number of size distributions according their modal-diameter, the location of the measurement site, and time of day. The responsible aerosol emission, formation, and dynamic processes can be inferred according to the cluster characteristics and correlation to concurrently measured meteorological, gas phase, and particle phase measurements.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aerosols
  • Air Pollutants / analysis*
  • Atmosphere
  • Cities
  • Cluster Analysis
  • Environmental Exposure
  • Environmental Monitoring / methods*
  • Fuzzy Logic
  • Humans
  • Multivariate Analysis
  • Particle Size*
  • Reproducibility of Results
  • Rural Health
  • United Kingdom
  • Urban Health
  • Vehicle Emissions*

Substances

  • Aerosols
  • Air Pollutants
  • Vehicle Emissions