» Articles » PMID: 31032399

Machine Learning in a Data-limited Regime: Augmenting Experiments with Synthetic Data Uncovers Order in Crumpled Sheets

Overview
Journal Sci Adv
Specialties Biology
Science
Date 2019 Apr 30
PMID 31032399
Citations 10
Authors
Affiliations
Soon will be listed here.
Abstract

Machine learning has gained widespread attention as a powerful tool to identify structure in complex, high-dimensional data. However, these techniques are ostensibly inapplicable for experimental systems where data are scarce or expensive to obtain. Here, we introduce a strategy to resolve this impasse by augmenting the experimental dataset with synthetically generated data of a much simpler sister system. Specifically, we study spontaneously emerging local order in crease networks of crumpled thin sheets, a paradigmatic example of spatial complexity, and show that machine learning techniques can be effective even in a data-limited regime. This is achieved by augmenting the scarce experimental dataset with inexhaustible amounts of simulated data of rigid flat-folded sheets, which are simple to simulate and share common statistical properties. This considerably improves the predictive power in a test problem of pattern completion and demonstrates the usefulness of machine learning in bench-top experiments where data are good but scarce.

Citing Articles

WDRIV-Net: a weighted ensemble transfer learning to improve automatic type stratification of lumbar intervertebral disc bulge, prolapse, and herniation.

Nakamoto I, Chen H, Wang R, Guo Y, Chen W, Feng J Biomed Eng Online. 2025; 24(1):11.

PMID: 39915867 PMC: 11800529. DOI: 10.1186/s12938-025-01341-4.


The Constrained Disorder Principle Overcomes the Challenges of Methods for Assessing Uncertainty in Biological Systems.

Ilan Y J Pers Med. 2025; 15(1).

PMID: 39852203 PMC: 11767140. DOI: 10.3390/jpm15010010.


Accurately predicting hit songs using neurophysiology and machine learning.

Merritt S, Gaffuri K, Zak P Front Artif Intell. 2023; 6:1154663.

PMID: 37408542 PMC: 10318137. DOI: 10.3389/frai.2023.1154663.


Automated data preparation for tumor characterization with machine learning.

Krajnc D, Spielvogel C, Grahovac M, Ecsedi B, Rasul S, Poetsch N Front Oncol. 2022; 12:1017911.

PMID: 36303841 PMC: 9595446. DOI: 10.3389/fonc.2022.1017911.


Machine learning for a sustainable energy future.

Yao Z, Lum Y, Johnston A, Mejia-Mendoza L, Zhou X, Wen Y Nat Rev Mater. 2022; 8(3):202-215.

PMID: 36277083 PMC: 9579620. DOI: 10.1038/s41578-022-00490-5.


References
1.
Cubuk E, Schoenholz S, Rieser J, Malone B, Rottler J, Durian D . Identifying structural flow defects in disordered solids using machine-learning methods. Phys Rev Lett. 2015; 114(10):108001. DOI: 10.1103/PhysRevLett.114.108001. View

2.
Baltz E, Trask E, Binderbauer M, Dikovsky M, Gota H, Mendoza R . Achievement of Sustained Net Plasma Heating in a Fusion Experiment with the Optometrist Algorithm. Sci Rep. 2017; 7(1):6425. PMC: 5526926. DOI: 10.1038/s41598-017-06645-7. View

3.
Silver D, Schrittwieser J, Simonyan K, Antonoglou I, Huang A, Guez A . Mastering the game of Go without human knowledge. Nature. 2017; 550(7676):354-359. DOI: 10.1038/nature24270. View

4.
Raccuglia P, Elbert K, Adler P, Falk C, Wenny M, Mollo A . Machine-learning-assisted materials discovery using failed experiments. Nature. 2016; 533(7601):73-6. DOI: 10.1038/nature17439. View

5.
Joel S, Eastwick P, Finkel E . Is Romantic Desire Predictable? Machine Learning Applied to Initial Romantic Attraction. Psychol Sci. 2017; 28(10):1478-1489. DOI: 10.1177/0956797617714580. View