ZFIN ID: ZDB-PUB-200403-38
Phylogeny of teleost connexins reveals highly inconsistent intra- and interspecies use of nomenclature and misassemblies in recent teleost chromosome assemblies
Mikalsen, S.O., Tausen, M., Í Kongsstovu, S.
Date: 2020
Source: BMC Genomics   21: 223 (Journal)
Registered Authors:
Keywords: Connexins, Genome duplication, Mammals, Nomenclature, Ohnologs, Orthologs, Paralogs, Phylogenetic trees, Teleosts
MeSH Terms:
  • Animals
  • Connexins/genetics*
  • Fish Proteins/genetics
  • Fishes/genetics*
  • Multigene Family
  • Phylogeny
  • Sequence Analysis, DNA/methods*
  • Terminology as Topic
PubMed: 32160866 Full text @ BMC Genomics
FIGURES
ABSTRACT
Based on an initial collecting of database sequences from the gap junction protein gene family (also called connexin genes) in a few teleosts, the naming of these sequences appeared variable. The reasons could be (i) that the structure in this family is variable across teleosts, or (ii) unfortunate naming. Rather clear rules for the naming of genes in fish and mammals have been outlined by nomenclature committees, including the naming of orthologous and ohnologous genes. We therefore analyzed the connexin gene family in teleosts in more detail. We covered the range of divergence times in teleosts (eel, Atlantic herring, zebrafish, Atlantic cod, three-spined stickleback, Japanese pufferfish and spotted pufferfish; listed from early divergence to late divergence).
The gene family pattern of connexin genes is similar across the analyzed teleosts. However, (i) several nomenclature systems are used, (ii) specific orthologous groups contain genes that are named differently in different species, (iii) several distinct genes have the same name in a species, and (iv) some genes have incorrect names. The latter includes a human connexin pseudogene, claimed as GJA4P, but which in reality is Cx39.2P (a delta subfamily gene often called GJD2like). We point out the ohnologous pairs of genes in teleosts, and we suggest a more consistent nomenclature following the outlined rules from the nomenclature committees. We further show that connexin sequences can indicate some errors in two high-quality chromosome assemblies that became available very recently.
Minimal consistency exists in the present practice of naming teleost connexin genes. A consistent and unified nomenclature would be an advantage for future automatic annotations and would make various types of subsequent genetic analyses easier. Additionally, roughly 5% of the connexin sequences point out misassemblies in the new high-quality chromosome assemblies from herring and cod.
ADDITIONAL INFORMATION