PUBLICATION

Comparative genomics of elastin: Sequence analysis of a highly repetitive protein

Authors
He, D., Chung, M., Chan, E., Alleyne, T., Ha, K.C., Miao, M., Stahl, R.J., Keeley, F.W., and Parkinson, J.
ID
ZDB-PUB-070726-11
Date
2007
Source
Matrix biology : journal of the International Society for Matrix Biology   26(7): 524-540 (Journal)
Registered Authors
Keywords
Elastin, Sequence analysis, Evolution, Sequence?structure?function relationships, Graph theory
MeSH Terms
  • Zebrafish
  • Xenopus
  • Evolution, Molecular
  • Elastin/chemistry
  • Elastin/genetics*
  • Ranidae
  • Xenopus Proteins/genetics
  • Phylogeny
  • Humans
  • Molecular Sequence Data
  • Chickens
  • Amino Acid Sequence
  • Animals
  • Zebrafish Proteins/genetics
PubMed
17628459 Full text @ Matrix Biol.
Abstract
Due to the low complexity associated with their sequences, uncovering the evolutionary and functional relationships in highly repetitive proteins such as elastin, spider silks, resilin and abductin represents a significant challenge. Using the polymeric extracellular protein elastin as a model system, we present a novel computational approach to the study of sequence, function and evolutionary relationships in repetitive proteins. To address the absence of accurate sequence annotation for repetitive proteins such as elastin, we have constructed a new database repository, ElastoDB (http://theileria.ccb.sickkids.ca/elastin), dedicated to the storage and retrieval of elastin sequence- and meta-data. To analyse their sequence relationships we have devised an innovative new method, based on the identification of overrepresented 'fuzzy' motifs. Applying this method to elastin sequences derived from mammals, chicken, Xenopus and zebrafish resulted in the identification of both highly conserved, and taxon and species specific motifs that likely represent important functional and/or structural elements. The relative spacing and organization of these elements suggest that exon duplication events have played an important role in the evolution of elastin. Clustering of similarity profiles generated for sets of exons and introns, revealed a pattern of putative duplication events involving exons 15-30 in mammalian and chicken elastins, exons 20-31 in both zebrafish elastins, exons 15-20 in fugu elastin and exons 35-50 in Xenopus elastin 1. The success of this approach for elastin offers a promising route to the elucidation of sequence, structure, function and evolutionary relationships for many other proteins with sequences of low complexity.
Genes / Markers
Figures
No images available
Expression
Phenotype
Mutations / Transgenics
Human Disease / Model
Sequence Targeting Reagents
Fish
Antibodies
Orthology
Engineered Foreign Genes
Mapping