Supervised clustering or projection analysis is a staple technique in population genetic analysis. The utility of this technique depends critically on the reference panel. The most commonly used ...reference panel in the analysis of ancient DNA to date is based on the Human Origins array. We previously described a larger reference panel that captures more ancestries on the global level. Here, I reanalyzed DNA data from 279 ancient Eurasians using our reference panel. I found substantially more ancestral heterogeneity than has been reported. Reanalysis provides evidence against a resurgence of Western hunter-gatherer ancestry in the Middle to Late Neolithic and evidence for a common ancestor of farmers characterized by Western Asian ancestry, a transition of the spread of agriculture from demic to cultural diffusion, at least two migrations between the Pontic-Caspian steppes and Bronze Age Europe, and a sub-Saharan African component in Natufians that localizes to present-day southern Ethiopia.
A deeper appreciation of the complex architecture of African genomes is critical to the global effort to understand human history, biology and differential distribution of disease by geography and ...ancestry. Here, we report on how the growing engagement of African populations in genome science is providing new insights into the forces that shaped human genomes before and after the Out-of-Africa migrations. As a result of this human evolutionary history, African ancestry populations have the greatest genomic diversity in the world, and this diversity has important ramifications for genomic research. In the case of pharmacogenomics, for instance, variants of consequence are not limited to those identified in other populations, and diversity within African ancestry populations precludes summarizing risk across different African ethnic groups. Exposure of Africans to fatal pathogens, such as Plasmodium falciparum, Lassa Virus and Trypanosoma brucei rhodesiense, has resulted in elevated frequencies of alleles conferring survival advantages for infectious diseases, but that are maladaptive in modern-day environments. Illustrating with cardiometabolic traits, we show that while genomic research in African ancestry populations is still in early stages, there are already many examples of novel and African ancestry-specific disease loci that have been discovered. Furthermore, the shorter haplotypes in African genomes have facilitated fine-mapping of loci discovered in other human ancestry populations. Given the insights already gained from the interrogation of African genomes, it is imperative to continue and increase our efforts to describe genomic risk in and across African ancestry populations.
Familial Mediterranean fever (FMF) is an autoinflammatory disease caused by homozygous or compound heterozygous gain-of-function mutations in MEFV, which encodes pyrin, an inflammasome protein. ...Heterozygous carrier frequencies for multiple MEFV mutations are high in several Mediterranean populations, suggesting that they confer selective advantage. Among 2,313 Turkish people, we found extended haplotype homozygosity flanking FMF-associated mutations, indicating evolutionarily recent positive selection of FMF-associated mutations. Two pathogenic pyrin variants independently arose >1,800 years ago. Mutant pyrin interacts less avidly with Yersinia pestis virulence factor YopM than with wild-type human pyrin, thereby attenuating YopM-induced interleukin (IL)-1β suppression. Relative to healthy controls, leukocytes from patients with FMF harboring homozygous or compound heterozygous mutations and from asymptomatic heterozygous carriers released heightened IL-1β specifically in response to Y. pestis. Y. pestis-infected Mefv
FMF knock-in mice exhibited IL-1-dependent increased survival relative to wild-type knock-in mice. Thus, FMF mutations that were positively selected in Mediterranean populations confer heightened resistance to Y. pestis.
Genetic and archaeological studies have established a sub-Saharan African origin for anatomically modern humans with subsequent migrations out of Africa. Using the largest multi-locus data set known ...to date, we investigated genetic differentiation of early modern humans, human admixture and migration events, and relationships among ancestries and language groups. We compiled publicly available genome-wide genotype data on 5,966 individuals from 282 global samples, representing 30 primary language families. The best evidence supports 21 ancestries that delineate genetic structure of present-day human populations. Independent of self-identified ethno-linguistic labels, the vast majority (97.3%) of individuals have mixed ancestry, with evidence of multiple ancestries in 96.8% of samples and on all continents. The data indicate that continents, ethno-linguistic groups, races, ethnicities, and individuals all show substantial ancestral heterogeneity. We estimated correlation coefficients ranging from 0.522 to 0.962 between ancestries and language families or branches. Ancestry data support the grouping of Kwadi-Khoe, Kx'a, and Tuu languages, support the exclusion of Omotic languages from the Afroasiatic language family, and do not support the proposed Dené-Yeniseian language family as a genetically valid grouping. Ancestry data yield insight into a deeper past than linguistic data can, while linguistic data provide clarity to ancestry data.
Genome‐wide association studies (GWAS) are used to investigate genetic variants contributing to complex traits. Despite discovering many loci, a large proportion of “missing” heritability remains ...unexplained. Gene–gene interactions may help explain some of this gap. Traditionally, gene–gene interactions have been evaluated using parametric statistical methods such as linear and logistic regression, with multifactor dimensionality reduction (MDR) used to address sparseness of data in high dimensions. We propose a method for the analysis of gene–gene interactions across independent single‐nucleotide polymorphisms (SNPs) in two genes. Typical methods for this problem use statistics based on an asymptotic chi‐squared mixture distribution, which is not easy to use. Here, we propose a Kullback–Leibler‐type statistic, which follows an asymptotic, positive, normal distribution under the null hypothesis of no relationship between SNPs in the two genes, and normally distributed under the alternative hypothesis. The performance of the proposed method is evaluated by simulation studies, which show promising results. The method is also used to analyze real data and identifies gene–gene interactions among RAB3A, MADD, and PTPRN on type 2 diabetes (T2D) status.
Genome analysis of diverse human populations has contributed to the identification of novel genomic loci for diseases of major clinical and public health impact. Here, we report a genome-wide ...analysis of type 2 diabetes (T2D) in sub-Saharan Africans, an understudied ancestral group. We analyze ~18 million autosomal SNPs in 5,231 individuals from Nigeria, Ghana and Kenya. We identify a previously-unreported genome-wide significant locus: ZRANB3 (Zinc Finger RANBP2-Type Containing 3, lead SNP p = 2.831 × 10
). Knockdown or genomic knockout of the zebrafish ortholog results in reduction in pancreatic β-cell number which we demonstrate to be due to increased apoptosis in islets. siRNA transfection of murine Zranb3 in MIN6 β-cells results in impaired insulin secretion in response to high glucose, implicating Zranb3 in β-cell functional response to high glucose conditions. We also show transferability in our study of 32 established T2D loci. Our findings advance understanding of the genetics of T2D in non-European ancestry populations.
European-ancestry populations are recognized as stratified but not as admixed, implying that residual confounding by locus-specific ancestry can affect studies of association, polygenic adaptation, ...and polygenic risk scores. We integrate individual-level genome-wide data from ~19,000 European-ancestry individuals across 79 European populations and five European American cohorts. We generate a new reference panel that captures ancestral diversity missed by both the 1000 Genomes and Human Genome Diversity Projects. Both Europeans and European Americans are admixed at the subcontinental level, with admixture dates differing among subgroups of European Americans. After adjustment for both genome-wide and locus-specific ancestry, associations between a highly differentiated variant in LCT (rs4988235) and height or LDL-cholesterol were confirmed to be false positives whereas the association between LCT and body mass index was genuine. We provide formal evidence of subcontinental admixture in individuals with European ancestry, which, if not properly accounted for, can produce spurious results in genetic epidemiology studies.
Impaired glucose tolerance is a major risk factor for type 2 diabetes (T2D) and several cardiometabolic disorders. To identify genetic loci underlying fasting glucose levels, we conducted an analysis ...of 9,232 individuals of European ancestry who at enrollment were either nondiabetic or had untreated type 2 diabetes. Multivariable linear mixed models were used to test for associations between fasting glucose and 7.9 million SNPs, with adjustment for age, body mass index (BMI), sex, significant principal components of the genotypes, and cryptic relatedness. Three previously discovered loci were genome-wide significant, with the lead SNPs being rs1260326, a missense variant in GCKR (p = 1.06×10-8); rs560887, an intronic variant in G6PC2 (p = 3.39×10-11); and rs13266634, a missense variant in SLC30A8 (p = 4.28×10-10). Fine mapping, genome-wide conditional analysis, and functional annotation indicated that the three loci were independently associated with fasting glucose. Each copy of an alternate allele at any of these three SNPs was associated with a reduction of 0.012 mmol/L in fasting glucose levels (p = 8.0×10-28), and this association was replicated in trans-ethnic analysis of 14,303 individuals (p = 2.2×10-16). The three SNPs were jointly associated with significantly reduced T2D risk, with an odds ratio (95% CI) of 0.93 (0.88, 0.98) per protective allele. Our findings implicate additive effects across pathophysiological pathways involved in type 2 diabetes, including glycolysis, gluconeogenesis, and insulin secretion. Since none of the individuals homozygous for the alternate alleles at all three loci has T2D, it might be possible to use a genetic predictor of fasting glucose levels to identify individuals at low vs. high risk of developing type 2 diabetes.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
For samples of admixed individuals, it is possible to test for both ancestry effects via admixture mapping and genotype effects via association mapping. Here, we describe a joint test called BMIX ...that combines admixture and association statistics at single markers. We first perform high-density admixture mapping using local ancestry. We then perform association mapping using stratified regression, wherein for each marker genotypes are stratified by local ancestry. In both stages, we use generalized linear models, providing the advantage that the joint test can be used with any phenotype distribution with an appropriate link function. To define the alternative densities for admixture mapping and association mapping, we describe a method based on autocorrelation to empirically estimate the testing burdens of admixture mapping and association mapping. We then describe a joint test that uses the posterior probabilities from admixture mapping as prior probabilities for association mapping, capitalizing on the reduced testing burden of admixture mapping relative to association mapping. By simulation, we show that BMIX is potentially orders-of-magnitude more powerful than the MIX score, which is currently the most powerful frequentist joint test. We illustrate the gain in power through analysis of fasting plasma glucose among 922 unrelated, non-diabetic, admixed African Americans from the Howard University Family Study. We detected loci at 1q24 and 6q26 as genome-wide significant via admixture mapping; both loci have been independently reported from linkage analysis. Using the association data, we resolved the 1q24 signal into two regions. One region, upstream of the gene FAM78B, contains three binding sites for the transcription factor PPARG and two binding sites for HNF1A, both previously implicated in the pathology of type 2 diabetes. The fact that both loci showed ancestry effects may provide novel insight into the genetic architecture of fasting plasma glucose in individuals of African ancestry.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Abstract
Background
There is growing support for the use of genetic risk scores (GRS) in routine clinical settings. Due to the limited diversity of current genomic discovery samples, there are ...concerns that the predictive power of GRS will be limited in non-European ancestry populations. GRS for cardiometabolic traits were evaluated in sub-Saharan Africans in comparison with African Americans and European Americans.
Methods
We evaluated the predictive utility of GRS for 12 cardiometabolic traits in sub-Saharan Africans (AF; n = 5200), African Americans (AA; n = 9139) and European Americans (EUR; n = 9594). GRS were constructed as weighted sums of the number of risk alleles. Predictive utility was assessed using the additional phenotypic variance explained and the increase in discriminatory ability over traditional risk factors age, sex and body mass index (BMI), with adjustment for ancestry-derived principal components.
Results
Across all traits, GRS showed up to a 5-fold and 20-fold greater predictive utility in EUR relative to AA and AF, respectively. Predictive utility was most consistent for lipid traits, with percentage increase in explained variation attributable to GRS ranging from 10.6% to 127.1% among EUR, 26.6% to 65.8% among AA and 2.4% to 37.5% among AF. These differences were recapitulated in the discriminatory power, whereby the predictive utility of GRS was 4-fold greater in EUR relative to AA and up to 44-fold greater in EUR relative to AF. Obesity and blood pressure traits showed a similar pattern of greater predictive utility among EUR.
Conclusions
This work demonstrates the poorer performance of GRS in AF and highlights the need to improve representation of multiple ethnic populations in genomic studies to ensure equitable clinical translation of GRS.