» Articles » PMID: 10592242

The Pfam Protein Families Database

Overview
Specialty Biochemistry
Date 1999 Dec 11
PMID 10592242
Citations 538
Authors
Affiliations
Soon will be listed here.
Abstract

Pfam is a large collection of protein multiple sequence alignments and profile hidden Markov models. Pfam is available on the WWW in the UK at http://www.sanger.ac.uk/Software/Pfam/, in Sweden at http://www.cgr.ki.se/Pfam/ and in the US at http://pfam.wustl.edu/. The latest version (4.3) of Pfam contains 1815 families. These Pfam families match 63% of proteins in SWISS-PROT 37 and TrEMBL 9. For complete genomes Pfam currently matches up to half of the proteins. Genomic DNA can be directly searched against the Pfam library using the Wise2 package.

Citing Articles

Metagenomic selections reveal diverse antiphage defenses in human and environmental microbiomes.

Rodriguez-Rodriguez L, Pfister J, Schuck L, Martin A, Mercado-Santiago L, Tagliabracci V bioRxiv. 2025; .

PMID: 40060627 PMC: 11888456. DOI: 10.1101/2025.02.28.640651.


Potential of 8ER183 for poly(lactic acid)-degrading enzyme production, biodegradative capability, and its whole-genome sequence characterization.

Sujarit K, Pannim B, Kuakkhunthod N, Uywannang U, Sakdapetsiri C, Panyachanakul T 3 Biotech. 2025; 15(3):55.

PMID: 39926107 PMC: 11802947. DOI: 10.1007/s13205-025-04219-3.


ZF-HD gene family in rapeseed (Brassica napus L.): genome-wide identification, phylogeny, evolutionary expansion and expression analyses.

Xu X, Zhou H, Yang Q, Yang Y, Pu X BMC Genomics. 2024; 25(1):1181.

PMID: 39639240 PMC: 11619180. DOI: 10.1186/s12864-024-11102-7.


Metabolic capabilities are highly conserved among human nasal-associated species in pangenomic analyses.

Tran T, F Escapa I, Roberts A, Gao W, Obawemimo A, Segre J mSystems. 2024; 9(12):e0113224.

PMID: 39508593 PMC: 11651106. DOI: 10.1128/msystems.01132-24.


Cyclic Diguanylate in the Wild: Roles During Plant and Animal Colonization.

Isenberg R, Mandel M Annu Rev Microbiol. 2024; 78(1):533-551.

PMID: 39270684 PMC: 11578789. DOI: 10.1146/annurev-micro-041522-101729.


References
1.
Sayle R . RASMOL: biomolecular graphics for all. Trends Biochem Sci. 1995; 20(9):374. DOI: 10.1016/s0968-0004(00)89080-5. View

2.
Sonnhammer E, Eddy S, Durbin R . Pfam: a comprehensive database of protein domain families based on seed alignments. Proteins. 1997; 28(3):405-20. DOI: 10.1002/(sici)1097-0134(199707)28:3<405::aid-prot10>3.0.co;2-l. View

3.
Galperin M, Koonin E . Sources of systematic error in functional annotation of genomes: domain rearrangement, non-orthologous gene displacement and operon disruption. In Silico Biol. 2001; 1(1):55-67. View

4.
Birney E, Durbin R . Dynamite: a flexible code generating language for dynamic programming methods used in sequence comparison. Proc Int Conf Intell Syst Mol Biol. 1997; 5:56-64. View

5.
Bernstein F, Koetzle T, Williams G, Meyer Jr E, Brice M, Rodgers J . The Protein Data Bank: a computer-based archival file for macromolecular structures. J Mol Biol. 1977; 112(3):535-42. DOI: 10.1016/s0022-2836(77)80200-3. View