» Articles » PMID: 33125078

Pfam: The Protein Families Database in 2021

Overview
Specialty Biochemistry
Date 2020 Oct 30
PMID 33125078
Citations 2336
Authors
Affiliations
Soon will be listed here.
Abstract

The Pfam database is a widely used resource for classifying protein sequences into families and domains. Since Pfam was last described in this journal, over 350 new families have been added in Pfam 33.1 and numerous improvements have been made to existing entries. To facilitate research on COVID-19, we have revised the Pfam entries that cover the SARS-CoV-2 proteome, and built new entries for regions that were not covered by Pfam. We have reintroduced Pfam-B which provides an automatically generated supplement to Pfam and contains 136 730 novel clusters of sequences that are not yet matched by a Pfam family. The new Pfam-B is based on a clustering by the MMseqs2 software. We have compared all of the regions in the RepeatsDB to those in Pfam and have started to use the results to build and refine Pfam repeat families. Pfam is freely available for browsing and download at http://pfam.xfam.org/.

Citing Articles

Sporophyte-directed gametogenesis in Arabidopsis.

Sivakumar P, Pandey S, Ramesha A, Davda J, Singh A, Kumar C Nat Plants. 2025; .

PMID: 40087543 DOI: 10.1038/s41477-025-01932-y.


Genome mining the black-yeast Aureobasidium pullulans NRRL 62031 for biotechnological traits.

Xiao D, Driller M, Stein K, Blank L, Tiso T BMC Genomics. 2025; 26(1):244.

PMID: 40082747 PMC: 11905612. DOI: 10.1186/s12864-025-11395-2.


Genome-Wide Identification and Expression Analysis of the Gene Family in Banana () Under Various Nitrogen Conditions.

Zhang B, Wang W, Wang C, Cai B, Feng J, Zhou D Int J Mol Sci. 2025; 26(5).

PMID: 40076789 PMC: 11900138. DOI: 10.3390/ijms26052168.


Genome-Wide Exploration and Characterization of the Gene Family's Expression Patterns in Response to Abiotic Stresses in Siberian Wildrye ( L.).

Liu T, Peng J, Dong Z, Liu Y, Wu J, Xiong Y Int J Mol Sci. 2025; 26(5).

PMID: 40076552 PMC: 11900556. DOI: 10.3390/ijms26051925.


An acyl-homoserine lactone acylase found in Stenotrophomonas maltophilia exhibits both quorum quenching activity and the ability to degrade penicillin antibiotics.

Bravo M, Conchillo-Sole O, Coves X, Garcia-Navarro A, Gomez A, Marquez-Martinez M Sci Rep. 2025; 15(1):8557.

PMID: 40074792 PMC: 11903891. DOI: 10.1038/s41598-025-92749-4.


References
1.
Chen C, Natale D, Finn R, Huang H, Zhang J, Wu C . Representative proteomes: a stable, scalable and unbiased proteome set for sequence analysis and functional annotation. PLoS One. 2011; 6(4):e18910. PMC: 3083393. DOI: 10.1371/journal.pone.0018910. View

2.
Cong Y, Ulasli M, Schepers H, Mauthe M, Vkovski P, Kriegenburg F . Nucleocapsid Protein Recruitment to Replication-Transcription Complexes Plays a Crucial Role in Coronaviral Life Cycle. J Virol. 2019; 94(4). PMC: 6997762. DOI: 10.1128/JVI.01925-19. View

3.
Mitchell A, Attwood T, Babbitt P, Blum M, Bork P, Bridge A . InterPro in 2019: improving coverage, classification and access to protein sequence annotations. Nucleic Acids Res. 2018; 47(D1):D351-D360. PMC: 6323941. DOI: 10.1093/nar/gky1100. View

4.
Berman H, Henrick K, Nakamura H, Markley J . The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data. Nucleic Acids Res. 2006; 35(Database issue):D301-3. PMC: 1669775. DOI: 10.1093/nar/gkl971. View

5.
Hauser M, Steinegger M, Soding J . MMseqs software suite for fast and deep clustering and searching of large protein sequence sets. Bioinformatics. 2016; 32(9):1323-30. DOI: 10.1093/bioinformatics/btw006. View