signature=ff62d57ff7413aeaa1a1a48c52825d38,DataSpace: A Quantitative Summary Statistic for Genetic A…「建议收藏」

老牧童 • 2023-02-20 09:00 • 未分类

大家好，欢迎来到IT知识分享网。

dc.description.abstractThe admixture model is a widely popular approach to evaluate the genetic ancestry of humans and other organisms. The model has successfully been used to improve the accuracy of genetic association studies, to further the understanding of human migratory history, and to help identify signatures of natural selection. Admixture occurs when individuals of two genetically divergent populations interbreed. The admixture model, assuming that each observed individual is derived from $d$ ancestral populations, estimates (a) the allele frequencies that define the ancestral populations, and (b) the proportions of each individual’s genetic information that comes from each ancestral population.

The standard summary tool for the results of ancestry estimation has become the admixture barplot, a stacked barplot illustrating admixture proportions across individuals. In the genetic literature, these barplots are used to compare the ancestry profiles of distinct populations and even to inform the reconstruction of ancestral histories. Unfortunately, such usage can be extremely misleading, because there is no concrete metric of similarity when using a qualitative summary. It is difficult to know the error associated with ancestry estimates, and two similar-looking barplots may come from datasets that represent individuals with very different ancestry. Therefore, we need a tool that can summarize subtle differences in the underlying distribution of admixture.

This thesis calls attention to the need for a quantitative summary statistic for admixture that is concise, informative about the error in the estimates, and allows a comparison of ancestry across datasets. Here, we evaluate the two most common methods of obtaining summary statistics in the field of statistics: maximum likelihood estimation and method of moments. However, these methods fail to achieve a high level of accuracy. To solve this problem, we propose a new summary method, the Hybrid estimator, and demonstrate that it outperforms the existing methods in accuracy. Rather than replace the existing tool, the goal of this thesis is to encourage the use of this summary alongside the admixture bar plot. This will provide a more robust analysis of ancestry.

免责声明：本站所有文章内容,图片，视频等均是来源于用户投稿和互联网及文摘转载整编而成，不代表本站观点，不承担相关法律责任。其著作权各归其原作者或其出版社所有。如发现本站有涉嫌抄袭侵权/违法违规的内容,侵犯到您的权益，请在线联系站长,一经查实,本站将立刻删除。本文来自网络,若有侵权，请联系删除，如若转载，请注明出处：https://yundeesoft.com/10285.html

signature=ff62d57ff7413aeaa1a1a48c52825d38,DataSpace: A Quantitative Summary Statistic for Genetic A…「建议收藏」

相关推荐

发表回复