CLSWeb Main
Caltech Library System
Electronic Theses
                  About | Browse | Search | Caltech Student Instructions

Endelman, Jeffrey B (2005-05-12) Design and analysis of combinatorial protein libraries created by site-directed recombination. http://resolver.caltech.edu/CaltechETD:etd-06022005-192548


Type of Document Dissertation
Author Endelman, Jeffrey B
URN etd-06022005-192548
Persistent URL http://resolver.caltech.edu/CaltechETD:etd-06022005-192548
Title Design and analysis of combinatorial protein libraries created by site-directed recombination
Degree PhD
Option Bioengineering
Advisory Committee
Advisor Name Title
Frances Hamilton Arnold Committee Chair
Niles A. Pierce Committee Member
Stephen L. Mayo Committee Member
Zhen-Gang Wang Committee Member
Keywords
  • protein design
  • optimization
  • logistic regression
  • directed evolution
Date of Defense 2005-05-12
Availability unrestricted
Abstract
For many protein design problems, limited understanding of the relationship between sequence and function necessitates searching through a library of proteins to find the properties of interest. To accelerate this process, molecular models and optimization algorithms can be combined to design diverse libraries enriched in folded proteins. I apply this strategy to site-directed recombination, in which an alignment of p homologs is partitioned into f blocks, and the resulting gene fragments are combinatorially assembled to create a library with p^f chimeric sequences. To design the fragments, I present a dynamic programming algorithm that minimizes the average energy of the library, subject to constraints on fragment length. This algorithm works for any pairwise residue potential, several of which are compared for their ability to predict which chimeras retain the parental function and/or fold. The alignments of folded and unfolded chimeras are used to generate sequence-function relationships via logistic regression, a technique for fitting models to binary data. Compared to methods developed for alignments of naturally occurring proteins, logistic regression more readily distinguishes true interactions from correlations between strongly stabilizing but non-interacting residues.
Files
  Filename       Size       Approximate Download Time (Hours:Minutes:Seconds) 
 
 28.8 Modem   56K Modem   ISDN (64 Kb)   ISDN (128 Kb)   Higher-speed Access 
  thesis_final.pdf 774.90 Kb 00:03:35 00:01:50 00:01:36 00:00:48 00:00:04

Browse All Available ETDs by ( Author | Option )

If you have more questions or technical problems, please Contact the Caltech Library System.