Genome-wide analysis of pseudogenes reveals HBBP1's human-specific essentiality in erythropoiesis and implication in β-thalassemia

Dev Cell. 2021 Feb 22;56(4):478-493.e11. doi: 10.1016/j.devcel.2020.12.019. Epub 2021 Jan 20.

Abstract

The human genome harbors 14,000 duplicated or retroposed pseudogenes. Given their functionality as regulatory RNAs and low conservation, we hypothesized that pseudogenes could shape human-specific phenotypes. To test this, we performed co-expression analyses and found that pseudogene exhibited tissue-specific expression, especially in the bone marrow. By incorporating genetic data, we identified a bone-marrow-specific duplicated pseudogene, HBBP1 (η-globin), which has been implicated in β-thalassemia. Extensive functional assays demonstrated that HBBP1 is essential for erythropoiesis by binding the RNA-binding protein (RBP), HNRNPA1, to upregulate TAL1, a key regulator of erythropoiesis. The HBBP1/TAL1 interaction contributes to a milder symptom in β-thalassemia patients. Comparative studies further indicated that the HBBP1/TAL1 interaction is human-specific. Genome-wide analyses showed that duplicated pseudogenes are often bound by RBPs and less commonly bound by microRNAs compared with retropseudogenes. Taken together, we not only demonstrate that pseudogenes can drive human evolution but also provide insights on their functional landscapes.

Keywords: HBBP1; HNRNPA1; TAL1; erythropoiesis; functional pseudogenes; gene duplication; hematopoiesis; human-specific essentiality; retroposition; β-thalassemia.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Binding, Competitive
  • Bone Marrow / metabolism
  • Cell Differentiation / genetics
  • Cell Line
  • Erythroid Cells / metabolism
  • Erythroid Cells / pathology
  • Erythropoiesis / genetics*
  • Globins / genetics*
  • Heterogeneous Nuclear Ribonucleoprotein A1 / metabolism
  • Humans
  • Organ Specificity / genetics
  • Protein Binding
  • Protein Stability
  • Pseudogenes*
  • RNA Stability
  • RNA, Long Noncoding / genetics
  • RNA, Long Noncoding / metabolism
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Species Specificity
  • T-Cell Acute Lymphocytic Leukemia Protein 1 / genetics
  • T-Cell Acute Lymphocytic Leukemia Protein 1 / metabolism
  • beta-Thalassemia / genetics*

Substances

  • Heterogeneous Nuclear Ribonucleoprotein A1
  • RNA, Long Noncoding
  • RNA, Messenger
  • T-Cell Acute Lymphocytic Leukemia Protein 1
  • hnRNPA1 protein, human
  • TAL1 protein, human
  • Globins