Spatially weighted functional clustering of river network data

J R Stat Soc Ser C Appl Stat. 2015 Apr;64(3):491-506. doi: 10.1111/rssc.12082. Epub 2014 Oct 14.

Abstract

Incorporating spatial covariance into clustering has previously been considered for functional data to identify groups of functions which are similar across space. However, in the majority of situations that have been considered until now the most appropriate metric has been Euclidean distance. Directed networks present additional challenges in terms of estimating spatial covariance due to their complex structure. Although suitable river network covariance models have been proposed for use with stream distance, where distance is computed along the stream network, these models have not been extended for contexts where the data are functional, as is often the case with environmental data. The paper develops a method of calculating spatial covariance between functions from sites along a river network and applies the measure as a weight within functional hierarchical clustering. Levels of nitrate pollution on the River Tweed in Scotland are considered with the aim of identifying groups of monitoring stations which display similar spatiotemporal characteristics.

Keywords: Covariance; Functional data; Hierarchical clustering; River networks; Water quality.