» Articles » PMID: 35394342

A Survey of Biological Data in a Big Data Perspective

Overview
Journal Big Data
Date 2022 Apr 8
PMID 35394342
Authors
Affiliations
Soon will be listed here.
Abstract

The amount of available data is continuously growing. This phenomenon promotes a new concept, named big data. The highlight technologies related to big data are cloud computing (infrastructure) and Not Only SQL (NoSQL; data storage). In addition, for data analysis, machine learning algorithms such as decision trees, support vector machines, artificial neural networks, and clustering techniques present promising results. In a biological context, big data has many applications due to the large number of biological databases available. Some limitations of biological big data are related to the inherent features of these data, such as high degrees of complexity and heterogeneity, since biological systems provide information from an atomic level to interactions between organisms or their environment. Such characteristics make most bioinformatic-based applications difficult to build, configure, and maintain. Although the rise of big data is relatively recent, it has contributed to a better understanding of the underlying mechanisms of life. The main goal of this article is to provide a concise and reliable survey of the application of big data-related technologies in biology. As such, some fundamental concepts of information technology, including storage resources, analysis, and data sharing, are described along with their relation to biological data.

Citing Articles

Development and evaluation of a training curriculum to engage researchers on accessing and analyzing the All of Us data.

Coleman J, Baker J, Ketkar S, Butler A, Williams L, Hammonds-Odie L J Am Med Inform Assoc. 2024; 31(12):2857-2868.

PMID: 39269931 PMC: 11631121. DOI: 10.1093/jamia/ocae240.


Identification of biomarkers in multiple myeloma: A comprehensive study combining microarray analysis and Mendelian randomization.

Zhu Y, Liu J, Wang B J Cell Mol Med. 2024; 28(12):e18504.

PMID: 38923838 PMC: 11200096. DOI: 10.1111/jcmm.18504.


CREDO: a friendly Customizable, REproducible, DOcker file generator for bioinformatics applications.

Alessandri S, Ratto M, Rabellino S, Piacenti G, Contaldo S, Pernice S BMC Bioinformatics. 2024; 25(1):110.

PMID: 38475691 PMC: 10935966. DOI: 10.1186/s12859-024-05695-9.


Conceptual breakthroughs of the long noncoding RNA functional system and its endogenous regulatory role in the cancerous regime.

Wang A Explor Target Antitumor Ther. 2024; 5(1):170-186.

PMID: 38464381 PMC: 10918237. DOI: 10.37349/etat.2024.00211.


Y chromosome sequence and epigenomic reconstruction across human populations.

Esteller-Cucala P, Palmada-Flores M, Kuderna L, Fontsere C, Serres-Armero A, Dabad M Commun Biol. 2023; 6(1):623.

PMID: 37296226 PMC: 10256797. DOI: 10.1038/s42003-023-05004-9.