» Articles » PMID: 38129777

IgMAT: Immunoglobulin Sequence Multi-species Annotation Tool for Any Species Including Those with Incomplete Antibody Annotation or Unusual Characteristics

Overview
Publisher Biomed Central
Specialty Biology
Date 2023 Dec 22
PMID 38129777
Authors
Affiliations
Soon will be listed here.
Abstract

Background: The advent and continual improvement of high-throughput sequencing technologies has made immunoglobulin repertoire sequencing accessible and informative regardless of study species. However, to fully map dynamic changes in polyclonal responses precise framework and complementarity determining region annotation of rearranging genes is pivotal. Most sequence annotation tools are designed primarily for use with human and mouse antibody sequences which use databases with fixed species lists, applying very specific assumptions which select against unique structural characteristics. For this reason, data agnostic tools able to learn from presented data can be very useful with new species or with novel datasets.

Results: We have developed IgMAT, which utilises a reduced amino acid alphabet, that incorporates multiple HMM alignments into a single consensus to automatically annotate immunoglobulin sequences from most organisms. Additionally, the software allows the incorporation of user defined databases to better represent the species and/or antibody class of interest. To demonstrate the accuracy and utility of IgMAT, we present analysis of sequences extracted from structural data and immunoglobulin sequence datasets from several different species.

Conclusions: IgMAT is fully open-sourced and freely available on GitHub ( https://github.com/TPI-Immunogenetics/igmat ) for download under GPLv3 license. It can be used as a CLI application or as a python module to be integrated in custom scripts.

Citing Articles

A Customizable Suite of Methods to Sequence and Annotate Cattle Antibodies.

Ramirez Valdez K, Nzau B, Dorey-Robinson D, Jarman M, Nyagwange J, Schwartz J Vaccines (Basel). 2023; 11(6).

PMID: 37376488 PMC: 10302312. DOI: 10.3390/vaccines11061099.

References
1.
Adolf-Bryfogle J, Xu Q, North B, Lehmann A, Dunbrack Jr R . PyIgClassify: a database of antibody CDR structural classifications. Nucleic Acids Res. 2014; 43(Database issue):D432-8. PMC: 4383924. DOI: 10.1093/nar/gku1106. View

2.
Li K, Wang S, Cao Y, Bao H, Li P, Sun P . Development of Foot-and-Mouth Disease Virus-Neutralizing Monoclonal Antibodies Derived From Plasmablasts of Infected Cattle and Their Germline Gene Usage. Front Immunol. 2019; 10:2870. PMC: 6908506. DOI: 10.3389/fimmu.2019.02870. View

3.
Galson J, Schaetzle S, Bashford-Rogers R, Raybould M, Kovaltsuk A, Kilpatrick G . Deep Sequencing of B Cell Receptor Repertoires From COVID-19 Patients Reveals Strong Convergent Immune Signatures. Front Immunol. 2021; 11:605170. PMC: 7769841. DOI: 10.3389/fimmu.2020.605170. View

4.
Lefranc M, Pommie C, Ruiz M, Giudicelli V, Foulquier E, Truong L . IMGT unique numbering for immunoglobulin and T cell receptor variable domains and Ig superfamily V-like domains. Dev Comp Immunol. 2002; 27(1):55-77. DOI: 10.1016/s0145-305x(02)00039-3. View

5.
Arakawa H, Hauschild J, Buerstedde J . Requirement of the activation-induced deaminase (AID) gene for immunoglobulin gene conversion. Science. 2002; 295(5558):1301-6. DOI: 10.1126/science.1067308. View