An Efficient and Provable Approach for Mixture Proportion Estimation Using Linear Independence Assumption

Xiyu Yu; Tongliang Liu; Mingming Gong; Kayhan Batmanghelich; Dacheng Tao

doi:10.1109/CVPR.2018.00471

An Efficient and Provable Approach for Mixture Proportion Estimation Using Linear Independence Assumption

Conf Comput Vis Pattern Recognit Workshops. 2018 Jun:2018:4480-4489. doi: 10.1109/CVPR.2018.00471. Epub 2018 Dec 17.

Authors

Xiyu Yu¹, Tongliang Liu¹, Mingming Gong^{2

3}, Kayhan Batmanghelich², Dacheng Tao¹

Affiliations

¹ UBTECH Sydney AI Centre, SIT, FEIT, The University of Sydney, Australia.
² Department of Biomedical Informatics, University of Pittsburgh.
³ Department of Philosophy, Carnegie Mellon University.

Abstract

In this paper, we study the mixture proportion estimation (MPE) problem in a new setting: given samples from the mixture and the component distributions, we identify the proportions of the components in the mixture distribution. To address this problem, we make use of a linear independence assumption, i.e., the component distributions are independent from each other, which is much weaker than assumptions exploited in the previous MPE methods. Based on this assumption, we propose a method (1) that uniquely identifies the mixture proportions, (2) whose output provably converges to the optimal solution, and (3) that is computationally efficient. We show the superiority of the proposed method over the state-of-the-art methods in two applications including learning with label noise and semi-supervised learning on both synthetic and real-world datasets.

Grants and funding

R01 HL141813/HL/NHLBI NIH HHS/United States