Machine Learning in a Data-limited Regime: Augmenting Experiments with Synthetic Data Uncovers Order in Crumpled Sheets
Authors
Affiliations
Machine learning has gained widespread attention as a powerful tool to identify structure in complex, high-dimensional data. However, these techniques are ostensibly inapplicable for experimental systems where data are scarce or expensive to obtain. Here, we introduce a strategy to resolve this impasse by augmenting the experimental dataset with synthetically generated data of a much simpler sister system. Specifically, we study spontaneously emerging local order in crease networks of crumpled thin sheets, a paradigmatic example of spatial complexity, and show that machine learning techniques can be effective even in a data-limited regime. This is achieved by augmenting the scarce experimental dataset with inexhaustible amounts of simulated data of rigid flat-folded sheets, which are simple to simulate and share common statistical properties. This considerably improves the predictive power in a test problem of pattern completion and demonstrates the usefulness of machine learning in bench-top experiments where data are good but scarce.
Nakamoto I, Chen H, Wang R, Guo Y, Chen W, Feng J Biomed Eng Online. 2025; 24(1):11.
PMID: 39915867 PMC: 11800529. DOI: 10.1186/s12938-025-01341-4.
Ilan Y J Pers Med. 2025; 15(1).
PMID: 39852203 PMC: 11767140. DOI: 10.3390/jpm15010010.
Accurately predicting hit songs using neurophysiology and machine learning.
Merritt S, Gaffuri K, Zak P Front Artif Intell. 2023; 6:1154663.
PMID: 37408542 PMC: 10318137. DOI: 10.3389/frai.2023.1154663.
Automated data preparation for tumor characterization with machine learning.
Krajnc D, Spielvogel C, Grahovac M, Ecsedi B, Rasul S, Poetsch N Front Oncol. 2022; 12:1017911.
PMID: 36303841 PMC: 9595446. DOI: 10.3389/fonc.2022.1017911.
Machine learning for a sustainable energy future.
Yao Z, Lum Y, Johnston A, Mejia-Mendoza L, Zhou X, Wen Y Nat Rev Mater. 2022; 8(3):202-215.
PMID: 36277083 PMC: 9579620. DOI: 10.1038/s41578-022-00490-5.