PUBLICATION

Principles of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species

Authors
Hezroni, H., Koppstein, D., Schwartz, M.G., Avrutin, A., Bartel, D.P., Ulitsky, I.
ID
ZDB-PUB-170614-9
Date
2015
Source
Cell Reports   11(7): 1110-22 (Journal)
Registered Authors
Keywords
none
MeSH Terms
  • Base Sequence
  • Conserved Sequence/genetics
  • Evolution, Molecular*
  • Humans
  • RNA, Long Noncoding/genetics*
  • Sequence Analysis, RNA
  • Transcriptome/genetics*
PubMed
25959816 Full text @ Cell Rep.
Abstract

The inability to predict long noncoding RNAs from genomic sequence has impeded the use of comparative genomics for studying their biology. Here, we develop methods that use RNA sequencing (RNA-seq) data to annotate the transcriptomes of 16 vertebrates and the echinoid sea urchin, uncovering thousands of previously unannotated genes, most of which produce long intervening noncoding RNAs (lincRNAs). Although in each species, >70% of lincRNAs cannot be traced to homologs in species that diverged >50 million years ago, thousands of human lincRNAs have homologs with similar expression patterns in other species. These homologs share short, 5'-biased patches of sequence conservation nested in exonic architectures that have been extensively rewired, in part by transposable element exonization. Thus, over a thousand human lincRNAs are likely to have conserved functions in mammals, and hundreds beyond mammals, but those functions require only short patches of specific sequences and can tolerate major changes in gene architecture.

Genes / Markers
Figures
Expression
Phenotype
Mutations / Transgenics
Human Disease / Model
Sequence Targeting Reagents
Fish
Antibodies
Orthology
Engineered Foreign Genes
Mapping