» Articles » PMID: 39184199

NQuack: An R Package for Predicting Ploidal Level from Sequence Data Using Site-based Heterozygosity

Overview
Journal Appl Plant Sci
Date 2024 Aug 26
PMID 39184199
Authors
Affiliations
Soon will be listed here.
Abstract

Premise: Traditional methods of ploidal-level estimation are tedious; using DNA sequence data for cytotype estimation is an ideal alternative. Multiple statistical approaches to leverage sequence data for ploidy inference based on site-based heterozygosity have been developed. However, these approaches may require high-coverage sequence data, use inappropriate probability distributions, or have additional statistical shortcomings that limit inference abilities. We introduce nQuack, an open-source R package that addresses the main shortcomings of current methods.

Methods And Results: nQuack performs model selection for improved ploidy predictions. Here, we implement expectation maximization algorithms with normal, beta, and beta-binomial distributions. Using extensive computer simulations that account for variability in sequencing depth, as well as real data sets, we demonstrate the utility and limitations of nQuack.

Conclusions: Inferring ploidy based on site-based heterozygosity alone is difficult. Even though nQuack is more accurate than similar methods, we suggest caution when relying on any site-based heterozygosity method to infer ploidy.

References
1.
Zhuang Y, Wang X, Li X, Hu J, Fan L, Landis J . Phylogenomics of the genus Glycine sheds light on polyploid evolution and life-strategy transition. Nat Plants. 2022; 8(3):233-244. DOI: 10.1038/s41477-022-01102-4. View

2.
Galbraith D, Harkins K, Maddox J, Ayres N, Sharma D, Firoozabady E . Rapid flow cytometric analysis of the cell cycle in intact plant tissues. Science. 1983; 220(4601):1049-51. DOI: 10.1126/science.220.4601.1049. View

3.
Jerde C, Kraskura K, Eliason E, Csik S, Stier A, Taper M . Strong Evidence for an Intraspecific Metabolic Scaling Coefficient Near 0.89 in Fish. Front Physiol. 2019; 10:1166. PMC: 6763608. DOI: 10.3389/fphys.2019.01166. View

4.
Suda J, Travnicek P . Reliable DNA ploidy determination in dehydrated tissues of vascular plants by DAPI flow cytometry--new prospects for plant research. Cytometry A. 2006; 69(4):273-80. DOI: 10.1002/cyto.a.20253. View

5.
Elshire R, Glaubitz J, Sun Q, Poland J, Kawamoto K, Buckler E . A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One. 2011; 6(5):e19379. PMC: 3087801. DOI: 10.1371/journal.pone.0019379. View