» Articles » PMID: 36222162

Leveraging Deep Learning Algorithms for Synthetic Data Generation to Design and Analyze Biological Networks

Overview
Journal J Biosci
Specialties Biochemistry
Biology
Date 2022 Oct 12
PMID 36222162
Authors
Affiliations
Soon will be listed here.
Abstract

The use of synthetic data is gaining an increasingly prominent role in data and machine learning workflows to build better models and conduct analyses with greater statistical inference. In the domains of healthcare and biomedical research, synthetic data may be seen in structured and unstructured formats. Concomitant with the adoption of synthetic data, a sub-discipline of machine learning known as deep learning has taken the world by storm. At a larger scale, deep learning methods tend to outperform traditional methods in regression and classification tasks. These techniques are also used in generative modeling and are thus prime candidates for generating synthetic data in both structured and unstructured formats. Here, we emphasize the generation of synthetic data in healthcare and biomedical research using deep learning methods for unstructured data formats such as text and images. Deep learning methods leverage the neural network algorithm, and in the context of generative modeling, several neural network architectures can create new synthetic data for a problem at hand including, but not limited to, recurrent neural networks (RNNs), variational autoencoders (VAEs), and generative adversarial networks (GANs). To better understand these methods, we will look at specific case studies such as generating realistic clinical notes of a patient, the generation of synthetic DNA sequences, as well as to enrich experimental data collected during the study of heterotypic cultures of cancer cells.

Citing Articles

Deep learning and generative artificial intelligence in aging research and healthy longevity medicine.

Wilczok D Aging (Albany NY). 2025; 17(1):251-275.

PMID: 39836094 PMC: 11810058. DOI: 10.18632/aging.206190.


Prediction of viral oncoproteins through the combination of generative adversarial networks and machine learning techniques.

Beltran J, Herrera-Belen L, Yanez A, Jimenez L Sci Rep. 2024; 14(1):27108.

PMID: 39511292 PMC: 11543823. DOI: 10.1038/s41598-024-77028-y.


Getting real about synthetic data ethics : Are AI ethics principles a good starting point for synthetic data ethics?.

Shanley D, Hogenboom J, Lysen F, Wee L, Lobo Gomes A, Dekker A EMBO Rep. 2024; 25(5):2152-2155.

PMID: 38388694 PMC: 11094102. DOI: 10.1038/s44319-024-00101-0.


Leveraging the Academic Artificial Intelligence Silecosystem to Advance the Community Oncology Enterprise.

McDonnell K J Clin Med. 2023; 12(14).

PMID: 37510945 PMC: 10381436. DOI: 10.3390/jcm12144830.


Schooling of light reflecting fish.

Pertzelan A, Ariel G, Kiflawi M PLoS One. 2023; 18(7):e0289026.

PMID: 37478091 PMC: 10361475. DOI: 10.1371/journal.pone.0289026.