A novel kernel Wasserstein distance on Gaussian measures: An application of identifying dental artifacts in head and neck computed tomography

Jung Hun Oh; Maryam Pouryahya; Aditi Iyer; Aditya P Apte; Joseph O Deasy; Allen Tannenbaum

doi:10.1016/j.compbiomed.2020.103731

A novel kernel Wasserstein distance on Gaussian measures: An application of identifying dental artifacts in head and neck computed tomography

Comput Biol Med. 2020 May:120:103731. doi: 10.1016/j.compbiomed.2020.103731. Epub 2020 Mar 26.

Authors

Jung Hun Oh¹, Maryam Pouryahya², Aditi Iyer², Aditya P Apte², Joseph O Deasy², Allen Tannenbaum³

Affiliations

¹ Department of Medical Physics, Memorial Sloan Kettering Cancer Center, USA. Electronic address: ohj@mskcc.org.
² Department of Medical Physics, Memorial Sloan Kettering Cancer Center, USA.
³ Departments of Computer Science and Applied Mathematics & Statistics, Stony Brook University, USA.

Abstract

The Wasserstein distance is a powerful metric based on the theory of optimal mass transport. It gives a natural measure of the distance between two distributions with a wide range of applications. In contrast to a number of the common divergences on distributions such as Kullback-Leibler or Jensen-Shannon, it is (weakly) continuous, and thus ideal for analyzing corrupted and noisy data. Until recently, however, no kernel methods for dealing with nonlinear data have been proposed via the Wasserstein distance. In this work, we develop a novel method to compute the L²-Wasserstein distance in reproducing kernel Hilbert spaces (RKHS) called kernel L²-Wasserstein distance, which is implemented using the kernel trick. The latter is a general method in machine learning employed to handle data in a nonlinear manner. We evaluate the proposed approach in identifying computed tomography (CT) slices with dental artifacts in head and neck cancer, performing unsupervised hierarchical clustering on the resulting Wasserstein distance matrix that is computed on imaging texture features extracted from each CT slice. We further compare the performance of kernel Wasserstein distance with alternatives including kernel Kullback-Leibler divergence we previously developed. Our experiments show that the kernel approach outperforms classical non-kernel approaches in identifying CT slices with artifacts.

Keywords: Kernel Kullback–Leibler divergence; Kernel Wasserstein distance; Kernel trick; Reproducing kernel Hilbert space.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Algorithms*
Artifacts*
Machine Learning
Normal Distribution
Tomography, X-Ray Computed

Abstract

Publication types

MeSH terms

Grants and funding