» Articles » PMID: 39747064

Predicting Thermodynamic Stability of Inorganic Compounds Using Ensemble Machine Learning Based on Electron Configuration

Overview
Journal Nat Commun
Specialty Biology
Date 2025 Jan 2
PMID 39747064
Authors
Affiliations
Soon will be listed here.
Abstract

Machine learning offers a promising avenue for expediting the discovery of new compounds by accurately predicting their thermodynamic stability. This approach provides significant advantages in terms of time and resource efficiency compared to traditional experimental and modeling methods. However, most existing models are constructed based on specific domain knowledge, potentially introducing biases that impact their performance. Here, we propose a machine learning framework rooted in electron configuration, further enhanced through stack generalization with two additional models grounded in diverse domain knowledge. Experimental results validate the efficacy of our model in accurately predicting the stability of compounds, achieving an Area Under the Curve score of 0.988. Notably, our model demonstrates exceptional efficiency in sample utilization, requiring only one-seventh of the data used by existing models to achieve the same performance. To underscore the versatility of our approach, we present three illustrative examples showcasing its effectiveness in navigating unexplored composition space. We present two case studies to demonstrate that our method can facilitate the exploration of new two-dimensional wide bandgap semiconductors and double perovskite oxides. Validation results from first-principles calculations indicate that our method demonstrates remarkable accuracy in correctly identifying stable compounds.

Citing Articles

Predicting thermodynamic stability of inorganic compounds using ensemble machine learning based on electron configuration.

Zou H, Zhao H, Lu M, Wang J, Deng Z, Wang J Nat Commun. 2025; 16(1):203.

PMID: 39747064 PMC: 11696921. DOI: 10.1038/s41467-024-55525-y.

References
1.
Tshitoyan V, Dagdelen J, Weston L, Dunn A, Rong Z, Kononova O . Unsupervised word embeddings capture latent knowledge from materials science literature. Nature. 2019; 571(7763):95-98. DOI: 10.1038/s41586-019-1335-8. View

2.
Schmidt J, Pettersson L, Verdozzi C, Botti S, Marques M . Crystal graph attention networks for the prediction of stable materials. Sci Adv. 2021; 7(49):eabi7948. PMC: 8641929. DOI: 10.1126/sciadv.abi7948. View

3.
Goodall R, Lee A . Predicting materials properties without crystal structure: deep representation learning from stoichiometry. Nat Commun. 2020; 11(1):6280. PMC: 7722901. DOI: 10.1038/s41467-020-19964-7. View

4.
Zagorac D, Muller H, Ruehl S, Zagorac J, Rehme S . Recent developments in the Inorganic Crystal Structure Database: theoretical crystal structure data and related features. J Appl Crystallogr. 2019; 52(Pt 5):918-925. PMC: 6782081. DOI: 10.1107/S160057671900997X. View

5.
Emery A, Wolverton C . High-throughput DFT calculations of formation energy, stability and oxygen vacancy formation energy of ABO perovskites. Sci Data. 2017; 4:170153. PMC: 5644373. DOI: 10.1038/sdata.2017.153. View