Identifying protein domains with the Pfam database

Curr Protoc Bioinformatics. 2008 Sep:Chapter 2:2.5.1-2.5.17. doi: 10.1002/0471250953.bi0205s23.

Abstract

Pfam is a database of protein domain families, with each family represented by multiple sequence alignments and profile hidden Markov models (HMMs). In addition, each family has associated annotation, literature references, and links to other databases. The entries in Pfam are available via the World Wide Web and in flatfile format. This unit contains detailed information on how to access and utilize the information present in the Pfam database, namely the families, multiple alignments, and annotation. Details on running Pfam, both remotely and locally are presented.

MeSH terms

  • Computational Biology / methods*
  • Databases, Nucleic Acid
  • Databases, Protein*
  • Internet
  • Markov Chains
  • Protein Structure, Tertiary*
  • Proteins / analysis
  • Proteins / chemistry
  • Proteins / genetics
  • Sequence Alignment
  • Sequence Analysis, Protein
  • User-Computer Interface

Substances

  • Proteins