» Articles » PMID: 26125026

Trends in IT Innovation to Build a Next Generation Bioinformatics Solution to Manage and Analyse Biological Big Data Produced by NGS Technologies

Overview
Journal Biomed Res Int
Publisher Wiley
Date 2015 Jul 1
PMID 26125026
Citations 12
Authors
Affiliations
Soon will be listed here.
Abstract

Sequencing the human genome began in 1994, and 10 years of work were necessary in order to provide a nearly complete sequence. Nowadays, NGS technologies allow sequencing of a whole human genome in a few days. This deluge of data challenges scientists in many ways, as they are faced with data management issues and analysis and visualization drawbacks due to the limitations of current bioinformatics tools. In this paper, we describe how the NGS Big Data revolution changes the way of managing and analysing data. We present how biologists are confronted with abundance of methods, tools, and data formats. To overcome these problems, focus on Big Data Information Technology innovations from web and business intelligence. We underline the interest of NoSQL databases, which are much more efficient than relational databases. Since Big Data leads to the loss of interactivity with data during analysis due to high processing time, we describe solutions from the Business Intelligence that allow one to regain interactivity whatever the volume of data is. We illustrate this point with a focus on the Amadea platform. Finally, we discuss visualization challenges posed by Big Data and present the latest innovations with JavaScript graphic libraries.

Citing Articles

JSONWP: a static website generator for protein bioinformatics research.

Kilinc M, Jia K, Jernigan R Bioinform Adv. 2023; 3(1):vbad154.

PMID: 37904893 PMC: 10613403. DOI: 10.1093/bioadv/vbad154.


Linear epitope mapping of the humoral response against SARS-CoV-2 in two independent African cohorts.

Vigan-Womas I, Spadoni J, Poiret T, Taieb F, Randrianarisaona F, Faye R Sci Rep. 2023; 13(1):782.

PMID: 36646780 PMC: 9842613. DOI: 10.1038/s41598-023-27810-1.


Machine Learning Assisted Cervical Cancer Detection.

Mehmood M, Rizwan M, Gregus Ml M, Abbas S Front Public Health. 2022; 9:788376.

PMID: 35004588 PMC: 8733205. DOI: 10.3389/fpubh.2021.788376.


Social innovation for life expectancy extension utilizing a platform-centered system used in the Iwaki health promotion project: A protocol paper.

Nakaji S, Ihara K, Sawada K, Parodi S, Umeda T, Takahashi I SAGE Open Med. 2021; 9:20503121211002606.

PMID: 33796303 PMC: 7985939. DOI: 10.1177/20503121211002606.


Bioinformatics Workflows With NoSQL Database in Cloud Computing.

Wercelens P, da Silva W, Hondo F, Castro K, Walter M, Araujo A Evol Bioinform Online. 2019; 15:1176934319889974.

PMID: 31839702 PMC: 6896126. DOI: 10.1177/1176934319889974.


References
1.
Oinn T, Addis M, Ferris J, Marvin D, Senger M, Greenwood M . Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics. 2004; 20(17):3045-54. DOI: 10.1093/bioinformatics/bth361. View

2.
Bult C, White O, Olsen G, Zhou L, Fleischmann R, Sutton G . Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii. Science. 1996; 273(5278):1058-73. DOI: 10.1126/science.273.5278.1058. View

3.
Wang S, Pandis I, Wu C, He S, Johnson D, Emam I . High dimensional biological data retrieval optimization with NoSQL technology. BMC Genomics. 2014; 15 Suppl 8:S3. PMC: 4248814. DOI: 10.1186/1471-2164-15-S8-S3. View

4.
Luscombe N, Greenbaum D, Gerstein M . What is bioinformatics? An introduction and overview. Yearb Med Inform. 2016; (1):83-99. View

5.
SINSHEIMER R . The Santa Cruz Workshop--May 1985. Genomics. 1989; 5(4):954-6. DOI: 10.1016/0888-7543(89)90142-0. View