A Hypergraph-based Method for Unification of Existing Protein Structure- and Sequence-families
Overview
Affiliations
Classification of proteins is a major challenge in bioinformatics. Here an approach is presented, that unifies different existing classifications of protein structures and sequences. Protein structural domains are represented as nodes in a hypergraph. Shared memberships in sequence families result in hyperedges in the graph. The presented method partitions the hypergraph into clusters of structural domains. Each computed cluster is based on a set of shared sequence family memberships. Thus, the clusters put existing protein sequence families into the context of structural family hierarchies. Conversely, structural domains are related to their sequence family memberships, which can be used to gain further knowledge about the respective structural families.
Insight into the structures of Interleukin-18 systems.
Roy U Comput Biol Chem. 2020; 88:107353.
PMID: 32769049 PMC: 7392904. DOI: 10.1016/j.compbiolchem.2020.107353.