» Articles » PMID: 34078279

Identifying Genomic Islands with Deep Neural Networks

Overview
Journal BMC Genomics
Publisher Biomed Central
Specialty Genetics
Date 2021 Jun 3
PMID 34078279
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Horizontal gene transfer is the main source of adaptability for bacteria, through which genes are obtained from different sources including bacteria, archaea, viruses, and eukaryotes. This process promotes the rapid spread of genetic information across lineages, typically in the form of clusters of genes referred to as genomic islands (GIs). Different types of GIs exist, and are often classified by the content of their cargo genes or their means of integration and mobility. While various computational methods have been devised to detect different types of GIs, no single method is capable of detecting all types.

Results: We propose a method, which we call Shutter Island, that uses a deep learning model (Inception V3, widely used in computer vision) to detect genomic islands. The intrinsic value of deep learning methods lies in their ability to generalize. Via a technique called transfer learning, the model is pre-trained on a large generic dataset and then re-trained on images that we generate to represent genomic fragments. We demonstrate that this image-based approach generalizes better than the existing tools.

Conclusions: We used a deep neural network and an image-based approach to detect the most out of the correct GI predictions made by other tools, in addition to making novel GI predictions. The fact that the deep neural network was re-trained on only a limited number of GI datasets and then successfully generalized indicates that this approach could be applied to other problems in the field where data is still lacking or hard to curate.

Citing Articles

Current state and future prospects of Horizontal Gene Transfer detection.

Wijaya A, Anzel A, Richard H, Hattab G NAR Genom Bioinform. 2025; 7(1):lqaf005.

PMID: 39935761 PMC: 11811736. DOI: 10.1093/nargab/lqaf005.


Comparative genomics and virulence potential of Campylobacter coli strains isolated from different sources over 25 years in Brazil.

Gomes C, Felice A, Pereira G, Ceballos V, de Castro Soares S, Tonani L BMC Microbiol. 2024; 24(1):512.

PMID: 39614143 PMC: 11607955. DOI: 10.1186/s12866-024-03642-5.


Synteruptor: mining genomic islands for non-classical specialized metabolite gene clusters.

Haas D, Barba M, Vicente C, Nezbedova S, Garenaux A, Bury-Mone S NAR Genom Bioinform. 2024; 6(2):lqae069.

PMID: 38915823 PMC: 11195616. DOI: 10.1093/nargab/lqae069.


Detecting operons in bacterial genomes via visual representation learning.

Assaf R, Xia F, Stevens R Sci Rep. 2021; 11(1):2124.

PMID: 33483546 PMC: 7822928. DOI: 10.1038/s41598-021-81169-9.

References
1.
Langille M, Hsiao W, Brinkman F . Detecting genomic islands using bioinformatics approaches. Nat Rev Microbiol. 2010; 8(5):373-82. DOI: 10.1038/nrmicro2350. View

2.
Hacker J, Bender L, Ott M, Wingender J, Lund B, Marre R . Deletions of chromosomal regions coding for fimbriae and hemolysins occur in vitro and in vivo in various extraintestinal Escherichia coli isolates. Microb Pathog. 1990; 8(3):213-25. DOI: 10.1016/0882-4010(90)90048-u. View

3.
Hudson C, Lau B, Williams K . Islander: a database of precisely mapped genomic islands in tRNA and tmRNA genes. Nucleic Acids Res. 2014; 43(Database issue):D48-53. PMC: 4383910. DOI: 10.1093/nar/gku1072. View

4.
Barondess J, Beckwith J . A bacterial virulence determinant encoded by lysogenic coliphage lambda. Nature. 1990; 346(6287):871-4. DOI: 10.1038/346871a0. View

5.
Dobrindt U, Hochhut B, Hentschel U, Hacker J . Genomic islands in pathogenic and environmental microorganisms. Nat Rev Microbiol. 2004; 2(5):414-24. DOI: 10.1038/nrmicro884. View