Skip to main navigation Skip to search Skip to main content

Two proteins for the price of one: The design of maximally compressed coding sequences

  • Duke University
  • Stony Brook University

Research output: Contribution to journalArticlepeer-review

Abstract

The emerging field of synthetic biology moves beyond conventional genetic manipulation to construct novel life forms which do not originate in nature. We explore the problem of designing the provably shortest genomic sequence to encode a given set of genes by exploiting alternate reading frames. We present an algorithm for designing the shortest DNA sequence simultaneously encoding two given amino acid sequences. We show that the coding sequence of naturally occurring pairs of overlapping genes approach maximum compression. We also investigate the impact of alternate coding matrices on overlapping sequence design. Finally, we discuss an interesting application for overlapping gene design, namely the interleaving of an antibiotic resistance gene into a target gene inserted into a virus or plasmid for amplification.

Original languageEnglish
Pages (from-to)359-370
Number of pages12
JournalNatural Computing
Volume6
Issue number4
DOIs
StatePublished - Dec 2007

Keywords

  • Dynamic programming
  • Gene compression
  • Gene design
  • Overlapping genes
  • Protein design
  • Sequence design algorithms
  • Synthetic biology

Fingerprint

Dive into the research topics of 'Two proteins for the price of one: The design of maximally compressed coding sequences'. Together they form a unique fingerprint.

Cite this