PUBLICATION

Whole genome duplications and expansion of the vertebrate GATA transcription factor gene family

Authors
Gillis, W.Q., St John, J., Bowerman, B., and Schneider, S.Q.
ID
ZDB-PUB-091130-3
Date
2009
Source
BMC Evolutionary Biology   9: 207 (Journal)
Registered Authors
Keywords
none
MeSH Terms
  • Animals
  • Chordata, Nonvertebrate/genetics
  • Comparative Genomic Hybridization
  • Conserved Sequence
  • Evolution, Molecular*
  • GATA Transcription Factors/genetics*
  • Gene Duplication*
  • Genome
  • Invertebrates/genetics
  • Multigene Family*
  • Phylogeny
  • RNA Splice Sites
  • Sequence Analysis, DNA
  • Synteny
  • Vertebrates/genetics*
PubMed
19695090 Full text @ BMC Evol. Biol.
Abstract
BACKGROUND: GATA transcription factors influence many developmental processes, including the specification of embryonic germ layers. The GATA gene family has significantly expanded in many animal lineages: whereas diverse cnidarians have only one GATA transcription factor, six GATA genes have been identified in many vertebrates, five in many insects, and eleven to thirteen in Caenorhabditis nematodes. All bilaterian animal genomes have at least one member each of two classes, GATA123 and GATA456. RESULTS: We have identified one GATA123 gene and one GATA456 gene from the genomic sequence of two invertebrate deuterostomes, a cephalochordate (Branchiostoma floridae) and a hemichordate (Saccoglossus kowalevskii). We also have confirmed the presence of six GATA genes in all vertebrate genomes, as well as additional GATA genes in teleost fish. Analyses of conserved sequence motifs and of changes to the exon-intron structure, and molecular phylogenetic analyses of these deuterostome GATA genes support their origin from two ancestral deuterostome genes, one GATA 123 and one GATA456. Comparison of the conserved genomic organization across vertebrates identified eighteen paralogous gene families linked to multiple vertebrate GATA genes (GATA paralogons), providing the strongest evidence yet for expansion of vertebrate GATA gene families via genome duplication events. CONCLUSION: From our analysis, we infer the evolutionary birth order and relationships among vertebrate GATA transcription factors, and define their expansion via multiple rounds of whole genome duplication events. As the genomes of four independent invertebrate deuterostome lineages contain single copy GATA123 and GATA456 genes, we infer that the 0R (pre-genome duplication) invertebrate deuterostome ancestor also had two GATA genes, one of each class. Synteny analyses identify duplications of paralogous chromosomal regions (paralogons), from single ancestral vertebrate GATA123 and GATA456 chromosomes to four paralogons after the first round of vertebrate genome duplication, to seven paralogons after the second round of vertebrate genome duplication, and to fourteen paralogons after the fish-specific 3R genome duplication. The evolutionary analysis of GATA gene origins and relationships may inform understanding vertebrate GATA factor redundancies and specializations.
Genes / Markers
Figures
Expression
Phenotype
Mutations / Transgenics
Human Disease / Model
Sequence Targeting Reagents
Fish
Antibodies
Orthology
Engineered Foreign Genes
Mapping