» Articles » PMID: 33850210

Machine Learning, Transcriptome, and Genotyping Chip Analyses Provide Insights into SNP Markers Identifying Flower Color in Platycodon Grandiflorus

Overview
Journal Sci Rep
Specialty Science
Date 2021 Apr 14
PMID 33850210
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

Bellflower is an edible ornamental gardening plant in Asia. For predicting the flower color in bellflower plants, a transcriptome-wide approach based on machine learning, transcriptome, and genotyping chip analyses was used to identify SNP markers. Six machine learning methods were deployed to explore the classification potential of the selected SNPs as features in two datasets, namely training (60 RNA-Seq samples) and validation (480 Fluidigm chip samples). SNP selection was performed in sequential order. Firstly, 96 SNPs were selected from the transcriptome-wide SNPs using the principal compound analysis (PCA). Then, 9 among 96 SNPs were later identified using the Random forest based feature selection method from the Fluidigm chip dataset. Among six machines, the random forest (RF) model produced higher classification performance than the other models. The 9 SNP marker candidates selected for classifying the flower color classification were verified using the genomic DNA PCR with Sanger sequencing. Our results suggest that this methodology could be used for future selection of breeding traits even though the plant accessions are highly heterogeneous.

Citing Articles

Genome-wide association study (GWAS) with high-throughput SNP chip DNA markers identified novel genetic factors for mesocotyl elongation and seedling emergence in rice ( L.) using multiple GAPIT models.

Kabange N, Alibu S, Kwon Y, Lee S, Oh K, Lee J Front Genet. 2023; 14:1282620.

PMID: 38054028 PMC: 10694456. DOI: 10.3389/fgene.2023.1282620.


Genome-Wide Comparative Profiles of Triterpenoid Biosynthesis Genes in Ginseng and Pseudo Ginseng Medicinal Plants.

Lu J Life (Basel). 2023; 13(11).

PMID: 38004367 PMC: 10672587. DOI: 10.3390/life13112227.


PlgMYBR1, an R2R3-MYB transcription factor, plays as a negative regulator of anthocyanin biosynthesis in .

Kim E, Hyun T 3 Biotech. 2023; 13(3):75.

PMID: 36748016 PMC: 9898487. DOI: 10.1007/s13205-023-03490-6.

References
1.
Dudek B, Warskulat A, Schneider B . The Occurrence of Flavonoids and Related Compounds in Flower Sections of Papaver nudicaule. Plants (Basel). 2016; 5(2). PMC: 4931408. DOI: 10.3390/plants5020028. View

2.
Noe F, Tkatchenko A, Muller K, Clementi C . Machine Learning for Molecular Simulation. Annu Rev Phys Chem. 2020; 71:361-390. DOI: 10.1146/annurev-physchem-042018-052331. View

3.
Kim J, Kang S, Park S, Yang T, Lee Y, Kim O . Whole-genome, transcriptome, and methylome analyses provide insights into the evolution of platycoside biosynthesis in , a medicinal plant. Hortic Res. 2020; 7:112. PMC: 7327020. DOI: 10.1038/s41438-020-0329-x. View

4.
Kremling K, Diepenbrock C, Gore M, Buckler E, Bandillo N . Transcriptome-Wide Association Supplements Genome-Wide Association in . G3 (Bethesda). 2019; 9(9):3023-3033. PMC: 6723120. DOI: 10.1534/g3.119.400549. View

5.
Sachs M . plotROC: A Tool for Plotting ROC Curves. J Stat Softw. 2019; 79. PMC: 6347406. DOI: 10.18637/jss.v079.c02. View