PUBLICATION

A compact, in vivo screen of all 6-mers reveals drivers of tissue-specific expression and guides synthetic regulatory element design

Authors
Smith, R.P., Riesenfeld, S.J., Holloway, A.K., Li, Q., Murphy, K.K., Feliciano, N.M., Orecchia, L., Oksenberg, N., Pollard, K.S., and Ahituv, N.
ID
ZDB-PUB-130805-2
Date
2013
Source
Genome biology   14(7): R72 (Journal)
Registered Authors
Ahituv, Nadav, Murphy, Karl
Keywords
none
MeSH Terms
  • Animals
  • Base Sequence
  • Dissection
  • Embryo, Nonmammalian/metabolism
  • Enhancer Elements, Genetic
  • Gene Expression Regulation, Developmental*
  • Gene Ontology
  • Molecular Sequence Data
  • Nucleotide Motifs/genetics
  • Oligonucleotides/genetics*
  • Organ Specificity/genetics*
  • Regulatory Sequences, Nucleic Acid/genetics*
  • Synthetic Biology/methods*
  • Zebrafish/embryology
  • Zebrafish/genetics*
PubMed
23867016 Full text @ Genome Biol.
Abstract

Background

Large-scale annotation efforts have improved our ability to coarsely predict regulatory elements throughout vertebrate genomes. However, it is unclear how complex spatiotemporal patterns of gene expression driven by these elements emerge from the activity of short, transcription factor binding sequences.

Results

We describe a comprehensive promoter extension assay in which the regulatory potential of all 6 base-pair (bp) sequences was tested in the context of a minimal promoter. To enable this large-scale screen, we developed algorithms that use a reverse-complement aware decomposition of the de Bruijn graph to design a library of DNA oligomers incorporating every 6-bp sequence exactly once. Our library multiplexes all 4,096 unique 6-mers into 184 double-stranded 15-bp oligomers, which is sufficiently compact for in vivo testing. We injected each multiplexed construct into zebrafish embryos and scored GFP expression in 15 tissues at two developmental time points. Twenty-seven constructs produced consistent expression patterns, with the majority doing so in only one tissue. Functional sequences are enriched near biologically relevant genes, match motifs for developmental transcription factors, and are required for enhancer activity. By concatenating tissue-specific functional sequences, we generated completely synthetic enhancers for the notochord, epidermis, spinal cord, forebrain and otic lateral line, and show that short regulatory sequences do not always function modularly.

Conclusions

This work introduces a unique in vivo catalog of short, functional regulatory sequences and demonstrates several important principles of regulatory element organization. Furthermore, we provide resources for designing compact, reverse-complement aware k-mer libraries.

Genes / Markers
Figures
Show all Figures
Expression
Phenotype
Mutations / Transgenics
Human Disease / Model
Sequence Targeting Reagents
Fish
Antibodies
Orthology
Engineered Foreign Genes
Mapping