Methods for estimating uncertainty in PMF solutions: examples with ambient air and water quality data and guidance on reporting PMF results

Sci Total Environ. 2015 Jun 15:518-519:626-35. doi: 10.1016/j.scitotenv.2015.01.022. Epub 2015 Mar 13.

Abstract

The new version of EPA's positive matrix factorization (EPA PMF) software, 5.0, includes three error estimation (EE) methods for analyzing factor analytic solutions: classical bootstrap (BS), displacement of factor elements (DISP), and bootstrap enhanced by displacement (BS-DISP). These methods capture the uncertainty of PMF analyses due to random errors and rotational ambiguity. To demonstrate the utility of the EE methods, results are presented for three data sets: (1) speciated PM2.5 data from a chemical speciation network (CSN) site in Sacramento, California (2003-2009); (2) trace metal, ammonia, and other species in water quality samples taken at an inline storage system (ISS) in Milwaukee, Wisconsin (2006); and (3) an organic aerosol data set from high-resolution aerosol mass spectrometer (HR-AMS) measurements in Las Vegas, Nevada (January 2008). We present an interpretation of EE diagnostics for these data sets, results from sensitivity tests of EE diagnostics using additional and fewer factors, and recommendations for reporting PMF results. BS-DISP and BS are found useful in understanding the uncertainty of factor profiles; they also suggest if the data are over-fitted by specifying too many factors. DISP diagnostics were consistently robust, indicating its use for understanding rotational uncertainty and as a first step in assessing a solution's viability. The uncertainty of each factor's identifying species is shown to be a useful gauge for evaluating multiple solutions, e.g., with a different number of factors.

Keywords: Air pollution; EPA PMF; Positive matrix factorization; Receptor modeling; Water pollution.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Air Pollutants / analysis*
  • Air Pollution / analysis
  • Air Pollution / statistics & numerical data*
  • Environmental Monitoring / methods*
  • Particulate Matter / analysis*
  • Uncertainty

Substances

  • Air Pollutants
  • Particulate Matter