Model Description of Similarity-Based Recommendation Systems

Takafumi Kanamori; Naoya Osugi

doi:10.3390/e21070702

Model Description of Similarity-Based Recommendation Systems

Entropy (Basel). 2019 Jul 17;21(7):702. doi: 10.3390/e21070702.

Authors

Takafumi Kanamori^{1

2}, Naoya Osugi³

Affiliations

¹ Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo 152-8552, Japan.
² RIKEN AIP, Nihonbashi 1-chome Mitsui Building, 15th floor, 1-4-1 Nihonbashi, Chuo-ku, Tokyo 103-0027, Japan.
³ Recruit Technologies Co., Ltd., GranTokyo South Tower, 1-9-2 Marunouchi, Chiyoda-ku, Tokyo 100-6640, Japan.

Abstract

The quality of online services highly depends on the accuracy of the recommendations they can provide to users. Researchers have proposed various similarity measures based on the assumption that similar people like or dislike similar items or people, in order to improve the accuracy of their services. Additionally, statistical models, such as the stochastic block models, have been used to understand network structures. In this paper, we discuss the relationship between similarity-based methods and statistical models using the Bernoulli mixture models and the expectation-maximization (EM) algorithm. The Bernoulli mixture model naturally leads to a completely positive matrix as the similarity matrix. We prove that most of the commonly used similarity measures yield completely positive matrices as the similarity matrix. Based on this relationship, we propose an algorithm to transform the similarity matrix to the Bernoulli mixture model. Such a correspondence provides a statistical interpretation to similarity-based methods. Using this algorithm, we conduct numerical experiments using synthetic data and real-world data provided from an online dating site, and report the efficiency of the recommendation system based on the Bernoulli mixture models.

Keywords: bernoulli mixture models; completely positive matrix; recommendation; similarity measures.

Grants and funding

KAKENHI 15H01678, 15H03636, 16K00044, and 19H04071/Japan Society for the Promotion of Science