DEFOG: a Practical Scheme for Deciphering Families of Genes
Authors
Affiliations
We developed a novel efficient scheme, DEFOG (for "deciphering families of genes"), for determining sequences of numerous genes from a family of interest. The scheme provides a powerful means to obtain a gene family composition in species for which high-throughput genomic sequencing data are not available. DEFOG uses two key procedures. The first is a novel algorithm for designing highly degenerate primers based on a set of known genes from the family of interest. These primers are used in PCR reactions to amplify the members of the gene family. The second combines oligofingerprinting of the cloned PCR products with clustering of the clones based on their fingerprints. By selecting members from each cluster, a low-redundancy clone subset is chosen for sequencing. We applied the scheme to the human olfactory receptor (OR) genes. OR genes constitute the largest gene superfamily in the human genome, as well as in the genomes of other vertebrate species. DEFOG almost tripled the size of the initial repertoire of human ORs in a single experiment, and only 7% of the PCR clones had to be sequenced. Extremely high degeneracies, reaching over a billion combinations of distinct PCR primer pairs, proved to be very effective and yielded only 0.4% nonspecific products.
Expression signatures of long non-coding RNA and mRNA in human traumatic brain injury.
Yang L, Yang L, Zhu J, Chen J, Wang Y, Xiong K Neural Regen Res. 2019; 14(4):632-641.
PMID: 30632503 PMC: 6352599. DOI: 10.4103/1673-5374.247467.
Gulvik C, Effler T, Wilhelm S, Buchan A PLoS One. 2012; 7(11):e50362.
PMID: 23189198 PMC: 3506598. DOI: 10.1371/journal.pone.0050362.
Validation of an NSP-based (negative selection pattern) gene family identification strategy.
L Frank R, Kandoth C, Ercal F BMC Bioinformatics. 2008; 9 Suppl 9:S2.
PMID: 18793465 PMC: 2537573. DOI: 10.1186/1471-2105-9-S9-S2.
Designing multiple degenerate primers via consecutive pairwise alignments.
Najafabadi H, Torabi N, Chamankhah M BMC Bioinformatics. 2008; 9:55.
PMID: 18221562 PMC: 2253518. DOI: 10.1186/1471-2105-9-55.
Gilad Y, Wiebe V, Przeworski M, Lancet D, Paabo S PLoS Biol. 2004; 2(1):E5.
PMID: 14737185 PMC: 314465. DOI: 10.1371/journal.pbio.0020005.