Reconstructing protein-coding sequences from ancient DNA

Methods Enzymol. 2020:642:21-33. doi: 10.1016/bs.mie.2020.05.008. Epub 2020 Jun 19.

Abstract

Obtaining information about functional details of proteins of extinct species is of critical importance for a better understanding of the real-life appearance, behavior and ecology of these lost entries in the book of life. In this chapter, we discuss the possibilities to retrieve the necessary DNA sequence information from paleogenomic data obtained from fossil specimens, which can then be used to express and subsequently analyze the protein of interest. We discuss the problems specific to ancient DNA, including miscoding lesions, short read length and incomplete paleogenome assemblies. Finally, we discuss an alternative, but currently rarely used approach, direct PCR amplification, which is especially useful for comparatively short proteins.

Keywords: Ancient DNA; Genome assembly; Mapping; Paleogenomics; Re-sequencing; Reference sequence; SRAssembler; Targeted assembly.

MeSH terms

  • Base Sequence
  • DNA, Ancient*
  • Fossils*
  • Polymerase Chain Reaction
  • Sequence Analysis, DNA

Substances

  • DNA, Ancient