Conservation programs require rigorous evaluation to ensure the preservation of genetic diversity and viability of conservation populations. In this study, we conducted a comparative analysis of two indigenous Chinese...Conservation programs require rigorous evaluation to ensure the preservation of genetic diversity and viability of conservation populations. In this study, we conducted a comparative analysis of two indigenous Chinese chicken breeds, Gushi and Xichuan black-bone, using whole-genome SNPs to understand their genetic diversity, track changes over time and population structure. The breeds were divided into five conservation populations(GS1, 2010, ex-situ;GS2, 2019, ex-situ;GS3, 2019, in-situ;XB1, 2010, in-situ;and XB2, 2019, in-situ) based on conservation methods and generations. The genetic diversity indices of three conservation populations of Gushi chicken showed consistent trends, with the GS3 population under in-situ strategy having the highest diversity and GS2 under ex-situ strategy having the lowest. The degree of inbreeding of GS2 was higher than that of GS1 and GS3. Conserved populations of Xichuan black-bone chicken showed no obvious changes in genetic diversity between XB1 and XB2. In terms of population structure, the GS3 population were stratified relative to GS1 and GS2. According to the conservation priority, GS3 had the highest contribution to the total gene and allelic diversity in GS breed, whereas the contribution of XB1 and XB2 were similar. We also observed that the genetic diversity of GS2 was lower than GS3, which were from the same generation but under different conservation programs(in-situ and ex-situ). While XB1 and XB2 had similar levels of genetic diversity. Overall, our findings suggested that the conservation programs performed in ex-situ could slow down the occurrence of inbreeding events, but could not entirely prevent the loss of genetic diversity when the conserved population size was small, while in-situ conservation populations with large population size could maintain a relative high level of genetic diversity.展开更多
Background Breed identification is useful in a variety of biological contexts.Breed identification usually involves two stages,i.e.,detection of breed-informative SNPs and breed assignment.For both stages,there are se...Background Breed identification is useful in a variety of biological contexts.Breed identification usually involves two stages,i.e.,detection of breed-informative SNPs and breed assignment.For both stages,there are several methods proposed.However,what is the optimal combination of these methods remain unclear.In this study,using the whole genome sequence data available for 13 cattle breeds from Run 8 of the 1,000 Bull Genomes Project,we compared the combinations of three methods(Delta,FST,and In)for breed-informative SNP detection and five machine learning methods(KNN,SVM,RF,NB,and ANN)for breed assignment with respect to different reference population sizes and difference numbers of most breed-informative SNPs.In addition,we evaluated the accuracy of breed identification using SNP chip data of different densities.Results We found that all combinations performed quite well with identification accuracies over 95%in all scenarios.However,there was no combination which performed the best and robust across all scenarios.We proposed to inte-grate the three breed-informative detection methods,named DFI,and integrate the three machine learning methods,KNN,SVM,and RF,named KSR.We found that the combination of these two integrated methods outperformed the other combinations with accuracies over 99%in most cases and was very robust in all scenarios.The accuracies from using SNP chip data were only slightly lower than that from using sequence data in most cases.Conclusions The current study showed that the combination of DFI and KSR was the optimal strategy.Using sequence data resulted in higher accuracies than using chip data in most cases.However,the differences were gener-ally small.In view of the cost of genotyping,using chip data is also a good option for breed identification.展开更多
基金supported by the Key Research Project of the Shennong Laboratory,Henan Province,China(SN012022-05)the National Natural Science Foundation of China(32272866)+1 种基金the Young Elite Scientists Sponsorship Program by CAST(2021QNRC001)the Starting Foundation for Outstanding Young Scientists of Henan Agricultural University,China(30500664&30501280)。
文摘Conservation programs require rigorous evaluation to ensure the preservation of genetic diversity and viability of conservation populations. In this study, we conducted a comparative analysis of two indigenous Chinese chicken breeds, Gushi and Xichuan black-bone, using whole-genome SNPs to understand their genetic diversity, track changes over time and population structure. The breeds were divided into five conservation populations(GS1, 2010, ex-situ;GS2, 2019, ex-situ;GS3, 2019, in-situ;XB1, 2010, in-situ;and XB2, 2019, in-situ) based on conservation methods and generations. The genetic diversity indices of three conservation populations of Gushi chicken showed consistent trends, with the GS3 population under in-situ strategy having the highest diversity and GS2 under ex-situ strategy having the lowest. The degree of inbreeding of GS2 was higher than that of GS1 and GS3. Conserved populations of Xichuan black-bone chicken showed no obvious changes in genetic diversity between XB1 and XB2. In terms of population structure, the GS3 population were stratified relative to GS1 and GS2. According to the conservation priority, GS3 had the highest contribution to the total gene and allelic diversity in GS breed, whereas the contribution of XB1 and XB2 were similar. We also observed that the genetic diversity of GS2 was lower than GS3, which were from the same generation but under different conservation programs(in-situ and ex-situ). While XB1 and XB2 had similar levels of genetic diversity. Overall, our findings suggested that the conservation programs performed in ex-situ could slow down the occurrence of inbreeding events, but could not entirely prevent the loss of genetic diversity when the conserved population size was small, while in-situ conservation populations with large population size could maintain a relative high level of genetic diversity.
基金funded by National Key Research and Development Program of China(2021YFD1200404)the Yangzhou University Interdisciplinary Research Foundation for Animal Science Discipline of Targeted Support(yzuxk202016)the Project of Genetic Improvement for Agricultural Species(Dairy Cattle)of Shandong Province(2019LZGC011).
文摘Background Breed identification is useful in a variety of biological contexts.Breed identification usually involves two stages,i.e.,detection of breed-informative SNPs and breed assignment.For both stages,there are several methods proposed.However,what is the optimal combination of these methods remain unclear.In this study,using the whole genome sequence data available for 13 cattle breeds from Run 8 of the 1,000 Bull Genomes Project,we compared the combinations of three methods(Delta,FST,and In)for breed-informative SNP detection and five machine learning methods(KNN,SVM,RF,NB,and ANN)for breed assignment with respect to different reference population sizes and difference numbers of most breed-informative SNPs.In addition,we evaluated the accuracy of breed identification using SNP chip data of different densities.Results We found that all combinations performed quite well with identification accuracies over 95%in all scenarios.However,there was no combination which performed the best and robust across all scenarios.We proposed to inte-grate the three breed-informative detection methods,named DFI,and integrate the three machine learning methods,KNN,SVM,and RF,named KSR.We found that the combination of these two integrated methods outperformed the other combinations with accuracies over 99%in most cases and was very robust in all scenarios.The accuracies from using SNP chip data were only slightly lower than that from using sequence data in most cases.Conclusions The current study showed that the combination of DFI and KSR was the optimal strategy.Using sequence data resulted in higher accuracies than using chip data in most cases.However,the differences were gener-ally small.In view of the cost of genotyping,using chip data is also a good option for breed identification.
文摘目的:探讨健康中国汉族人群中basigin(BSG)基因的单核苷酸多态性(single nucleotide polymorphisms,SNPs)发生情况。方法:随机收集48例健康、无亲缘关系的中国汉族人外周血液并提取基因组DNA,设计引物对所有个体BSG基因的启动子区、外显子区和外显子内含子交界区的序列进行PCR扩增和正反向测序,通过判读测序峰图,明确SNPs的发生情况及其频率;通过Hardy-Weinberg平衡分析、单倍型推测和连锁不平衡分析,确定BSG基因位点的单倍型标签SNPs(haplotype tag SNPs,htSNPs);中性理论检验查明该基因位点SNPs频率分布是否符合选择中性。结果:共发现19个SNPs,其中包括2个新发现的SNPs;所有SNPs位点基因型分布均符合Hardy-Weinberg平衡。该基因位点共推测出4种常见单倍型域,确定9个SNPs为htSNPs。中性理论检验结果提示健康中国汉族人群BSG基因的SNPs分布符合中性进化假说。结论:首次对中国健康汉族人群BSG基因的SNPs进行了发掘,确定了其9个单倍型标签SNPs,为在汉族人群中研究该基因的遗传多态性与疾病易感性或药物反应性个体差异奠定了基础。