Although genome-wide association studies (GWASs) have identified numerous loci associated with complex traits, imprecise modeling of the genetic relatedness within study samples may cause substantial ...inflation of test statistics and possibly spurious associations. Variance component approaches, such as efficient mixed-model association (EMMA), can correct for a wide range of sample structures by explicitly accounting for pairwise relatedness between individuals, using high-density markers to model the phenotype distribution; but such approaches are computationally impractical. We report here a variance component approach implemented in publicly available software, EMMA eXpedited (EMMAX), that reduces the computational time for analyzing large GWAS data sets from years to hours. We apply this method to two human GWAS data sets, performing association analysis for ten quantitative traits from the Northern Finland Birth Cohort and seven common diseases from the Wellcome Trust Case Control Consortium. We find that EMMAX outperforms both principal component analysis and genomic control in correcting for sample structure.
Celotno besedilo
Dostopno za:
DOBA, IJS, IZUM, KILJ, NUK, PILJ, PNG, SAZU, UILJ, UKNU, UL, UM, UPUK
Rare coding variation has historically provided the most direct connections between gene function and disease pathogenesis. By meta-analysing the whole exomes of 24,248 schizophrenia cases and 97,322 ...controls, we implicate ultra-rare coding variants (URVs) in 10 genes as conferring substantial risk for schizophrenia (odds ratios of 3-50, P < 2.14 × 10
) and 32 genes at a false discovery rate of <5%. These genes have the greatest expression in central nervous system neurons and have diverse molecular functions that include the formation, structure and function of the synapse. The associations of the NMDA (N-methyl-D-aspartate) receptor subunit GRIN2A and AMPA (α-amino-3-hydroxy-5-methyl-4-isoxazole propionic acid) receptor subunit GRIA3 provide support for dysfunction of the glutamatergic system as a mechanistic hypothesis in the pathogenesis of schizophrenia. We observe an overlap of rare variant risk among schizophrenia, autism spectrum disorders
, epilepsy and severe neurodevelopmental disorders
, although different mutation types are implicated in some shared genes. Most genes described here, however, are not implicated in neurodevelopment. We demonstrate that genes prioritized from common variant analyses of schizophrenia are enriched in rare variant risk
, suggesting that common and rare genetic risk factors converge at least partially on the same underlying pathogenic biological processes. Even after excluding significantly associated genes, schizophrenia cases still carry a substantial excess of URVs, which indicates that more risk genes await discovery using this approach.
Next-generation sequencing technology (NGS) enables the discovery of nearly all genetic variants present in a genome. A subset of these variants, however, may have poor sequencing quality due to ...limitations in NGS or variant callers. In genetic studies that analyze a large number of sequenced individuals, it is critical to detect and remove those variants with poor quality as they may cause spurious findings. In this paper, we present ForestQC, a statistical tool for performing quality control on variants identified from NGS data by combining a traditional filtering approach and a machine learning approach. Our software uses the information on sequencing quality, such as sequencing depth, genotyping quality, and GC contents, to predict whether a particular variant is likely to be false-positive. To evaluate ForestQC, we applied it to two whole-genome sequencing datasets where one dataset consists of related individuals from families while the other consists of unrelated individuals. Results indicate that ForestQC outperforms widely used methods for performing quality control on variants such as VQSR of GATK by considerably improving the quality of variants to be included in the analysis. ForestQC is also very efficient, and hence can be applied to large sequencing datasets. We conclude that combining a machine learning algorithm trained with sequencing quality information and the filtering approach is a practical approach to perform quality control on genetic variants from sequencing data.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Few studies have explored the impact of rare variants (minor allele frequency < 1%) on highly heritable plasma metabolites identified in metabolomic screens. The Finnish population provides an ideal ...opportunity for such explorations, given the multiple bottlenecks and expansions that have shaped its history, and the enrichment for many otherwise rare alleles that has resulted. Here, we report genetic associations for 1391 plasma metabolites in 6136 men from the late-settlement region of Finland. We identify 303 novel association signals, more than one third at variants rare or enriched in Finns. Many of these signals identify genes not previously implicated in metabolite genome-wide association studies and suggest mechanisms for diseases and disease-related traits.
Tourette syndrome (TS) is a model neuropsychiatric disorder thought to arise from abnormal development and/or maintenance of cortico-striato-thalamo-cortical circuits. TS is highly heritable, but its ...underlying genetic causes are still elusive, and no genome-wide significant loci have been discovered to date. We analyzed a European ancestry sample of 2,434 TS cases and 4,093 ancestry-matched controls for rare (< 1% frequency) copy-number variants (CNVs) using SNP microarray data. We observed an enrichment of global CNV burden that was prominent for large (> 1 Mb), singleton events (OR = 2.28, 95% CI 1.39–3.79, p = 1.2 × 10−3) and known, pathogenic CNVs (OR = 3.03 1.85–5.07, p = 1.5 × 10−5). We also identified two individual, genome-wide significant loci, each conferring a substantial increase in TS risk (NRXN1 deletions, OR = 20.3, 95% CI 2.6–156.2; CNTN6 duplications, OR = 10.1, 95% CI 2.3–45.4). Approximately 1% of TS cases carry one of these CNVs, indicating that rare structural variation contributes significantly to the genetic architecture of TS.
•Rare structural variants contribute significantly to the genetic architecture of TS.•Increased global CNV burden is driven by large, rare, clinically relevant events.•NRXN1 deletions and CNTN6 duplications confer a substantial increase in TS risk.
Tourette syndrome is highly genetic, but identifying definitive disease susceptibility genes has been challenging. Huang et al. report two genome-wide, significant, recurrent, rare copy-number variants (NRXN1 deletions and CNTN6 duplications), each conferring a substantial increase in TS risk.
The COVID-19 pandemic, caused by the coronavirus SARS-CoV-2, has devastated health infrastructure around the world. Both ACE2 (an entry receptor) and TMPRSS2 (used by the virus for spike protein ...priming) are key proteins to SARS-CoV-2 cell entry, enabling progression to COVID-19 in humans. Comparative genomic research into critical ACE2 binding sites, associated with the spike receptor binding domain, has suggested that African and Asian primates may also be susceptible to disease from SARS-CoV-2 infection. Savanna monkeys (Chlorocebus spp.) are a widespread non-human primate with well-established potential as a bi-directional zoonotic/anthroponotic agent due to high levels of human interaction throughout their range in sub-Saharan Africa and the Caribbean. To characterize potential functional variation in savanna monkey ACE2 and TMPRSS2, we inspected recently published genomic data from 245 savanna monkeys, including 163 wild monkeys from Africa and the Caribbean and 82 captive monkeys from the Vervet Research Colony (VRC). We found several missense variants. One missense variant in ACE2 (X:14,077,550; Asp30Gly), common in Ch. sabaeus, causes a change in amino acid residue that has been inferred to reduce binding efficiency of SARS-CoV-2, suggesting potentially reduced susceptibility. The remaining populations appear as susceptible as humans, based on these criteria for receptor usage. All missense variants observed in wild Ch. sabaeus populations are also present in the VRC, along with two splice acceptor variants (at X:14,065,076) not observed in the wild sample that are potentially disruptive to ACE2 function. The presence of these variants in the VRC suggests a promising model for SARS-CoV-2 infection and vaccine and therapy development. In keeping with a One Health approach, characterizing actual susceptibility and potential for bi-directional zoonotic/anthroponotic transfer in savanna monkey populations may be an important consideration for controlling COVID-19 epidemics in communities with frequent human/non-human primate interactions that, in many cases, may have limited health infrastructure.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
University campuses could become leaders in developing alternatives to policing for managing public health and safety, yet, nearly all campuses rely on campus or local police to respond to mental ...health emergencies. Herein, we present the available evidence for campus mobile crisis intervention teams (MCITs) as an alternative to policing, consider what colleges and universities can learn from existing community MCIT models, and propose initial steps for the development and implementation of a campus MCIT.
Vervet monkeys are among the most widely distributed nonhuman primates, show considerable phenotypic diversity, and have long been an important biomedical model for a variety of human diseases and in ...vaccine research. Using whole-genome sequencing data from 163 vervets sampled from across Africa and the Caribbean, we find high diversity within and between taxa and clear evidence that taxonomic divergence was reticulate rather than following a simple branching pattern. A scan for diversifying selection across taxa identifies strong and highly polygenic selection signals affecting viral processes. Furthermore, selection scores are elevated in genes whose human orthologs interact with HIV and in genes that show a response to experimental simian immunodeficiency virus (SIV) infection in vervet monkeys but not in rhesus macaques, suggesting that part of the signal reflects taxon-specific adaptation to SIV.
The large and diverse population of Latin America is potentially a powerful resource for elucidating the genetic basis of complex traits through admixture mapping. However, no genome-wide ...characterization of admixture across Latin America has yet been attempted. Here, we report an analysis of admixture in thirteen Mestizo populations (i.e. in regions of mainly European and Native settlement) from seven countries in Latin America based on data for 678 autosomal and 29 X-chromosome microsatellites. We found extensive variation in Native American and European ancestry (and generally low levels of African ancestry) among populations and individuals, and evidence that admixture across Latin America has often involved predominantly European men and both Native and African women. An admixture analysis allowing for Native American population subdivision revealed a differentiation of the Native American ancestry amongst Mestizos. This observation is consistent with the genetic structure of pre-Columbian populations and with admixture having involved Natives from the area where the Mestizo examined are located. Our findings agree with available information on the demographic history of Latin America and have a number of implications for the design of association studies in population from the region.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Genome-wide association studies (GWAS) of longitudinal birth cohorts enable joint investigation of environmental and genetic influences on complex traits. We report GWAS results for nine quantitative ...metabolic traits (triglycerides, high-density lipoprotein, low-density lipoprotein, glucose, insulin, C-reactive protein, body mass index, and systolic and diastolic blood pressure) in the Northern Finland Birth Cohort 1966 (NFBC1966), drawn from the most genetically isolated Finnish regions. We replicate most previously reported associations for these traits and identify nine new associations, several of which highlight genes with metabolic functions: high-density lipoprotein with NR1H3 (LXRA), low-density lipoprotein with AR and FADS1-FADS2, glucose with MTNR1B, and insulin with PANK1. Two of these new associations emerged after adjustment of results for body mass index. Gene-environment interaction analyses suggested additional associations, which will require validation in larger samples. The currently identified loci, together with quantified environmental exposures, explain little of the trait variation in NFBC1966. The association observed between low-density lipoprotein and an infrequent variant in AR suggests the potential of such a cohort for identifying associations with both common, low-impact and rarer, high-impact quantitative trait loci.
Celotno besedilo
Dostopno za:
DOBA, IJS, IZUM, KILJ, NUK, PILJ, PNG, SAZU, UILJ, UKNU, UL, UM, UPUK