Powerful Methods for Detecting Introgressed Regions from Population Genomic Data
Overview
Molecular Biology
Affiliations
Understanding the types and functions of genes that are able to cross species boundaries-and those that are not-is an important step in understanding the forces maintaining species as largely independent lineages across the remainder of the genome. With large next-generation sequencing data sets we are now able to ask whether introgression has occurred across the genome, and multiple methods have been proposed to detect the signature of such events. Here, we introduce a new summary statistic that can be used to test for introgression, RNDmin , that makes use of the minimum pairwise sequence distance between two population samples relative to divergence to an outgroup. We find that our method offers a modest increase in power over other, related tests, but that all such tests have high power to detect introgressed loci when migration is recent and strong. RNDmin is robust to variation in the mutation rate, and remains reliable even when estimates of the divergence time between sister species are inaccurate. We apply RNDmin to population genomic data from the African mosquitoes Anopheles quadriannulatus and A. arabiensis, identifying three novel candidate regions for introgression. Interestingly, one of the introgressed loci is on the X chromosome, but outside of an inversion separating these two species. Our results suggest that significant, but rare, sharing of alleles is occurring between species that diverged more than 1 million years ago, and that application of these methods to additional systems are likely to reveal similar results.
Long-distance gene flow and recombination shape the evolutionary history of a maize pathogen.
Rogerio F, Van Oosterhout C, De Mita S, Cuevas-Fernandez F, Garcia-Rodriguez P, Becerra S IMA Fungus. 2025; 16:e138888.
PMID: 40052074 PMC: 11882024. DOI: 10.3897/imafungus.16.138888.
Tree Sequences as a General-Purpose Tool for Population Genetic Inference.
Whitehouse L, Ray D, Schrider D Mol Biol Evol. 2024; 41(11).
PMID: 39460991 PMC: 11600592. DOI: 10.1093/molbev/msae223.
Genealogical asymmetry under the IM model and a two-taxon test for gene flow.
Mackintosh A, Setter D Genetics. 2024; .
PMID: 39344660 PMC: 11631468. DOI: 10.1093/genetics/iyae157.
Tree sequences as a general-purpose tool for population genetic inference.
Whitehouse L, Ray D, Schrider D bioRxiv. 2024; .
PMID: 39185244 PMC: 11343121. DOI: 10.1101/2024.02.20.581288.
IntroUNET: Identifying introgressed alleles via semantic segmentation.
Ray D, Flagel L, Schrider D PLoS Genet. 2024; 20(2):e1010657.
PMID: 38377104 PMC: 10906877. DOI: 10.1371/journal.pgen.1010657.