TY - JOUR
T1 - A new inference method for detecting an ongoing selective sweep
AU - Fujito, Naoko T.
AU - Satta, Yoko
AU - Hayakawa, Toshiyuki
AU - Takahata, Naoyuki
N1 - Funding Information:
We thank Dr. Andy D. Kern for his kind and prompt response to our inquiry and request to utilize discoal. We also thank the editor and reviewers for their constructive criticisms, and Editage (www.editage.jp) for English language editing on early versions of this manuscript. This work was supported in part by the Japan Society for Promotion of Science (JSPS) grant 16H04821 to Y. S., the JSPS grants 23570271, 25101705 and 16K07535 to T. H., and the Scientific Research on Innovative Areas, a MEXT Grant-in-Aid Project FY2016-2020 to N. T.
Publisher Copyright:
© 2018, Genetics Society of Japan. All rights reserved.
PY - 2018
Y1 - 2018
N2 - A simple method was developed to detect signatures of ongoing selective sweeps in single nucleotide polymorphism (SNP) data. Based largely on the traditional site frequency spectrum (SFS), the method additionally incorporates linkage disequilibrium (LD) between pairs of SNP sites and uniquely represents both SFS and LD information as hierarchical “barcodes.” This barcode representation allows the identification of a hitchhiking genomic region surrounding a putative target site of positive selection, or a core site. Sweep signals at linked neutral sites are then measured by the proportion (F c ) of derived alleles within the hitchhiking region that are linked in the derived allele group defined at the core site. In measuring F c or intra-allelic variability in an informative way, certain conditions for derived allele frequencies are required, as illustrated with the human ST8SIA2 locus. Coalescent simulators with and without positive selection are used to assess the false-positive and false-negative rates of the F c statistic. To demonstrate its power, the method was further applied to the LCT, OCA2, EDAR, SLC24A5 and ASPM loci, which are known to have undergone positive selection in human populations. Overall, the method is powerful and can be used to identify core sites responsible for ongoing selective sweeps.
AB - A simple method was developed to detect signatures of ongoing selective sweeps in single nucleotide polymorphism (SNP) data. Based largely on the traditional site frequency spectrum (SFS), the method additionally incorporates linkage disequilibrium (LD) between pairs of SNP sites and uniquely represents both SFS and LD information as hierarchical “barcodes.” This barcode representation allows the identification of a hitchhiking genomic region surrounding a putative target site of positive selection, or a core site. Sweep signals at linked neutral sites are then measured by the proportion (F c ) of derived alleles within the hitchhiking region that are linked in the derived allele group defined at the core site. In measuring F c or intra-allelic variability in an informative way, certain conditions for derived allele frequencies are required, as illustrated with the human ST8SIA2 locus. Coalescent simulators with and without positive selection are used to assess the false-positive and false-negative rates of the F c statistic. To demonstrate its power, the method was further applied to the LCT, OCA2, EDAR, SLC24A5 and ASPM loci, which are known to have undergone positive selection in human populations. Overall, the method is powerful and can be used to identify core sites responsible for ongoing selective sweeps.
UR - http://www.scopus.com/inward/record.url?scp=85050775329&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85050775329&partnerID=8YFLogxK
U2 - 10.1266/ggs.18-00008
DO - 10.1266/ggs.18-00008
M3 - Article
C2 - 30270233
AN - SCOPUS:85050775329
SN - 1341-7568
VL - 93
SP - 149
EP - 161
JO - Genes and Genetic Systems
JF - Genes and Genetic Systems
IS - 4
ER -