Predicting mucin-type O-Glycosylation using enhancement value products from derived protein features

J Theor Comput Chem. 2020 May;19(3):2040003. doi: 10.1142/s0219633620400039. Epub 2020 Jun 15.

Abstract

Mucin-type O-glycosylation is one of the most common post-translational modifications of proteins. This glycosylation is initiated in the Golgi by the addition of the sugar N-acetylgalactosamine (GalNAc) onto protein Ser and Thr residues by a family of polypeptide GalNAc transferases. In humans there are 20 isoforms that are differentially expressed across tissues that serve multiple important biological roles. Using random peptide substrates, isoform specific amino acid preferences have been obtained in the form of enhancement values (EV). These EVs alone have previously been used to predict O-glycosylation sites via the web based ISOGlyP (Isoform Specific O-Glycosylation Prediction) tool. Here we explore additional protein features to determine whether these can complement the random peptide derived enhancement values and increase the predictive power of ISOGlyP. The inclusion of additional protein substrate features (such as secondary structure and surface accessibility) was found to increase sensitivity with minimal loss of specificity, when tested with three different published in vivo O-glycoproteomics data sets, thus increasing the overall accuracy of the ISOGlyP predictions.

Keywords: GalNAc transferases; ISOGlyP; O-glycosylation prediction; enhancement value; post-translational modification prediction.