Inflammatory bowel disease (IBD) is a chronic disease whose incidence and prevalence increase every year; however, the pathogenesis of IBD is still unclear. Thus, identifying IBD-related proteins is important for understanding its complex disease mechanism. Here, we propose a new and simple network-based approach using a reverse k-nearest neighbor ( R k NN ) search to identify novel IBD-related proteins. Protein-protein interactions (PPI) and Genome-Wide Association Studies (GWAS) were used in this study. After constructing the PPI network, the R k NN search was applied to all of the proteins to identify sets of influenced proteins among their k-nearest neighbors ( R k NNs ). An observed protein whose influenced proteins were mostly known IBD-related proteins was statistically identified as a novel IBD-related protein. Our method outperformed a random aspect, k NN search, and centrality measures based on the network topology. A total of 39 proteins were identified as IBD-related proteins. Of these proteins, 71% were reported at least once in the literature as related to IBD. Additionally, these proteins were found over-represented in the IBD pathway and enriched in importantly functional pathways in IBD. In conclusion, the R k NN search with the statistical enrichment test is a great tool to identify IBD-related proteins to better understand its complex disease mechanism.
Keywords: Inflammatory bowel disease; disease-related protein; protein–protein interaction network; reverse k-nearest neighbor search.