PUBLICATION

A genome-wide survey demonstrates widespread non-linear mRNA in expressed sequences from multiple species

Authors
Dixon, R.J., Eperon, I.C., Hall, L., and Samani, N.J.
ID
ZDB-PUB-051101-5
Date
2005
Source
Nucleic acids research   33(18): 5904-5913 (Journal)
Registered Authors
Keywords
none
MeSH Terms
  • Alternative Splicing*
  • Animals
  • Computational Biology/methods
  • Exons*
  • Expressed Sequence Tags/chemistry
  • Genome*
  • Genomics/methods
  • Humans
  • Introns
  • Mice
  • Open Reading Frames
  • RNA, Messenger/chemistry*
  • Rats
PubMed
16237125 Full text @ Nucleic Acids Res.
Abstract
We describe here the results of the first genome-wide survey of candidate exon repetition events in expressed sequences from human, mouse, rat, chicken, zebrafish and fly. Exon repetition is a rare event, reported in <10 genes, in which one or more exons is tandemly duplicated in mRNA but not in the gene. To identify candidates, we analysed database sequences for mRNA transcripts in which the order of the spliced exons does not follow the linear genomic order of the individual gene [events we term rearrangements or repetition in exon order (RREO)]. Using a computational approach, we have identified 245 genes in mammals that produce RREO events. RREO in mRNA occurs predominantly in the coding regions of genes. However, exon 1 is never involved. Analysis of the open reading frames suggests that this process may increase protein diversity and regulate protein expression via nonsense-mediated RNA decay. The sizes of the exons and introns involved around these events suggest a gene model structure that may facilitate non-linear splicing. These findings imply that RREO affects a significant subset of genes within a genome and suggests that non-linear information encoded within the genomes of complex organisms could contribute to phenotypic variation.
Genes / Markers
Figures
Expression
Phenotype
Mutations / Transgenics
Human Disease / Model
Sequence Targeting Reagents
Fish
Antibodies
Orthology
Engineered Foreign Genes
Mapping