» Articles » PMID: 32143574

PyBSASeq: a Simple and Effective Algorithm for Bulked Segregant Analysis with Whole-genome Sequencing Data

Overview
Publisher Biomed Central
Specialty Biology
Date 2020 Mar 8
PMID 32143574
Citations 9
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Bulked segregant analysis (BSA), coupled with next-generation sequencing, allows the rapid identification of both qualitative and quantitative trait loci (QTL), and this technique is referred to as BSA-Seq here. The current SNP index method and G-statistic method for BSA-Seq data analysis require relatively high sequencing coverage to detect significant single nucleotide polymorphism (SNP)-trait associations, which leads to high sequencing cost.

Results: We developed a simple and effective algorithm for BSA-Seq data analysis and implemented it in Python; the program was named PyBSASeq. Using PyBSASeq, the significant SNPs (sSNPs), SNPs likely associated with the trait, were identified via Fisher's exact test, and then the ratio of the sSNPs to total SNPs in a chromosomal interval was used to detect the genomic regions that condition the trait of interest. The results obtained this way are similar to those generated via the current methods, but with more than five times higher sensitivity. This approach was termed the significant SNP method here.

Conclusions: The significant SNP method allows the detection of SNP-trait associations at much lower sequencing coverage than the current methods, leading to ~ 80% lower sequencing cost and making BSA-Seq more accessible to the research community and more applicable to the species with a large genome.

Citing Articles

Genetic Regulation of Chlorophyll Biosynthesis in Pepper Fruit: Roles of and .

Sun H, Zhang Y, Zhang L, Wang X, Zhang K, Cheng F Genes (Basel). 2025; 16(2).

PMID: 40004548 PMC: 11855580. DOI: 10.3390/genes16020219.


Comparison of Recombination Rate, Reference Bias, and Unique Pangenomic Haplotypes in Using Seven De Novo Genome Assemblies.

Stack G, Quade M, Wilkerson D, Monserrate L, Bentz P, Carey S Int J Mol Sci. 2025; 26(3).

PMID: 39940933 PMC: 11818205. DOI: 10.3390/ijms26031165.


Physiological Analysis and Genetic Mapping of Short Hypocotyl Trait in L.

Liu M, Hu F, Liu L, Lu X, Li R, Wang J Int J Mol Sci. 2023; 24(20).

PMID: 37895090 PMC: 10607371. DOI: 10.3390/ijms242015409.


Maize LOST SUBSIDIARY CELL encoding a large subunit of ribonucleotide reductase is required for subsidiary cell development and plant growth.

Cui Y, He M, Liu J, Wang S, Zhang J, Xie S J Exp Bot. 2023; 74(15):4449-4460.

PMID: 37103989 PMC: 10433938. DOI: 10.1093/jxb/erad153.


Identification and mapping of major-effect flowering time loci and in L.

Toth J, Stack G, Carlson C, Smart L Front Plant Sci. 2022; 13:991680.

PMID: 36212374 PMC: 9533707. DOI: 10.3389/fpls.2022.991680.


References
1.
Xiao N, Gao Y, Qian H, Gao Q, Wu Y, Zhang D . Identification of Genes Related to Cold Tolerance and a Functional Allele That Confers Cold Tolerance. Plant Physiol. 2018; 177(3):1108-1123. PMC: 6052991. DOI: 10.1104/pp.18.00209. View

2.
Lu H, Lin T, Klein J, Wang S, Qi J, Zhou Q . QTL-seq identifies an early flowering QTL located near Flowering Locus T in cucumber. Theor Appl Genet. 2014; 127(7):1491-9. DOI: 10.1007/s00122-014-2313-z. View

3.
Luo H, Pandey M, Khan A, Guo J, Wu B, Cai Y . Discovery of genomic regions and candidate genes controlling shelling percentage using QTL-seq approach in cultivated peanut (Arachis hypogaea L.). Plant Biotechnol J. 2018; 17(7):1248-1260. PMC: 6576108. DOI: 10.1111/pbi.13050. View

4.
Chen S, Zhou Y, Chen Y, Gu J . fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018; 34(17):i884-i890. PMC: 6129281. DOI: 10.1093/bioinformatics/bty560. View

5.
Shen F, Huang Z, Zhang B, Wang Y, Zhang X, Wu T . Mapping Gene Markers for Apple Fruit Ring Rot Disease Resistance Using a Multi-omics Approach. G3 (Bethesda). 2019; 9(5):1663-1678. PMC: 6505150. DOI: 10.1534/g3.119.400167. View