» Articles » PMID: 19376134

Calculating Complexity of Large Randomized Libraries

Overview
Journal J Theor Biol
Publisher Elsevier
Specialty Biology
Date 2009 Apr 21
PMID 19376134
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

Randomized libraries are increasingly popular in protein engineering and other biomedical research fields. Statistics of the libraries are useful to guide and evaluate randomized library construction. Previous works only give the mean of the number of unique sequences in the library, and they can only handle equal molar ratio of the four nucleotides at a small number of mutation sites. We derive formulas to calculate the mean and variance of the number of unique sequences in libraries generated by cassette mutagenesis with mixtures of arbitrary nucleotide ratios. Computer program was developed which utilizes arbitrary numerical precision software package to calculate the statistics of large libraries. The statistics of library with mutations in more than 20 amino acids can be calculated easily. Results show that the nucleotide ratios have significant effects on these statistics. The more skewed the ratio, the larger the library size is needed to obtain the same expected number of unique sequences. The program is freely available at http://graphics.med.yale.edu/cgi-bin/lib_comp.pl.

Citing Articles

Sampling Strategies for Experimentally Mapping Molecular Fitness Landscapes Using High-Throughput Methods.

Chen S, Liu J, Van Nynatten A, Tudor-Price B, Chang B J Mol Evol. 2024; 92(4):402-414.

PMID: 38886207 DOI: 10.1007/s00239-024-10179-8.


Biomathematical description of synthetic peptide libraries.

Sieber T, Hare E, Hofmann H, Trepel M PLoS One. 2015; 10(6):e0129200.

PMID: 26042419 PMC: 4456392. DOI: 10.1371/journal.pone.0129200.


A highly scalable peptide-based assay system for proteomics.

Kozlov I, Thomsen E, Munchel S, Villegas P, capek P, Gower A PLoS One. 2012; 7(6):e37441.

PMID: 22701568 PMC: 3373263. DOI: 10.1371/journal.pone.0037441.


When second best is good enough: another probabilistic look at saturation mutagenesis.

Nov Y Appl Environ Microbiol. 2011; 78(1):258-62.

PMID: 22038607 PMC: 3255629. DOI: 10.1128/AEM.06265-11.