Proteins with homo-repeats of more than 4 amino acid residues in length were examined to understand whether some splicing sites in pre-mRNA may be attributed to homo-repeats in human proteins. The human proteome was found to contain a total of 404 proteins with homo-repeats that account for at least one splicing site in pre-mRNA. Pre-mRNA splicing sites were more often found in the C-terminal part (67%) than in the middle orN-terminal part of a homo-repeat. Ten homo-repeats were identified to have two splicing sites per repeat. The repeats were lysine homo-repeats in all but one case.
Keywords: disease; disordered regions; homo-repeat; splicing.