Noncoding sequences conserved in a limited number of mammals in the SIM2 interval are frequently functional

Genome Res. 2004 Mar;14(3):367-72. doi: 10.1101/gr.1961204. Epub 2004 Feb 12.

Abstract

Cross-species DNA sequence comparison is a fundamental method for identifying biologically important elements, because functional sequences are evolutionarily conserved, wheres nonfunctional sequences drift. A recent genome-wide comparison of human and mouse DNA discovered over 200,000 conserved noncoding sequences with unknown function. Multispecies DNA comparison has been proposed as a method to prioritize these conserved noncoding sequences for functional analysis based on the hypothesis that elements present in many species are more likely to be functional than elements present in limited numbers of species. Here, we perform a comparative analysis of the single-minded 2 (SIM2) gene interval on human chromosome 21 with horse, cow, pig, dog, cat, and mouse DNA. We classify conserved sequences based on the number of mammals in which they are present, and experimentally test sequences in each class for function. As hypothesized, conserved sequences present in many mammals are frequently functional. Additionally, we demonstrate that sequences conserved in a limited number of mammals are also frequently functional. Examination of genomic deletions in chimpanzee and rhesus macaque DNA showed that several putatively functional conserved noncoding human sequences were absent in these primates. These findings suggest that functional conserved noncoding human sequences can be missing in other mammals, even closely related primate species.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Basic Helix-Loop-Helix Transcription Factors
  • Cats
  • Cattle
  • Chromosome Deletion
  • Chromosomes, Artificial, Bacterial / genetics
  • Chromosomes, Human, Pair 21 / genetics
  • Cloning, Molecular
  • Computational Biology / methods
  • Conserved Sequence / genetics*
  • Conserved Sequence / physiology
  • DNA, Intergenic / classification
  • DNA, Intergenic / genetics*
  • DNA, Intergenic / physiology
  • Dogs
  • Evolution, Molecular
  • Horses / genetics
  • Humans
  • Macaca mulatta / genetics
  • Mice
  • Pan troglodytes / genetics
  • Regulatory Sequences, Nucleic Acid
  • Sequence Homology, Nucleic Acid
  • Swine / genetics
  • Transcription Factors / classification
  • Transcription Factors / genetics*
  • Transcription Factors / physiology

Substances

  • Basic Helix-Loop-Helix Transcription Factors
  • DNA, Intergenic
  • SIM2 protein, human
  • Sim2 protein, mouse
  • Transcription Factors