Abstract
Motivation
Most genetic variants implicated in complex diseases by genome-wide association studies (GWAS) are non-coding, making it challenging to understand the causative genes involved in ...disease. Integrating external information such as quantitative trait locus (QTL) mapping of molecular traits (e.g. expression, methylation) is a powerful approach to identify the subset of GWAS signals explained by regulatory effects. In particular, expression QTLs (eQTLs) help pinpoint the responsible gene among the GWAS regions that harbor many genes, while methylation QTLs (mQTLs) help identify the epigenetic mechanisms that impact gene expression which in turn affect disease risk. In this work, we propose multiple-trait-coloc (moloc), a Bayesian statistical framework that integrates GWAS summary data with multiple molecular QTL data to identify regulatory effects at GWAS risk loci.
Results
We applied moloc to schizophrenia (SCZ) and eQTL/mQTL data derived from human brain tissue and identified 52 candidate genes that influence SCZ through methylation. Our method can be applied to any GWAS and relevant functional data to help prioritize disease associated genes.
Availability and implementation: moloc is available for download as an R package (https://github.com/clagiamba/moloc). We also developed a web site to visualize the biological findings (icahn.mssm.edu/moloc). The browser allows searches by gene, methylation probe and scenario of interest.
Supplementary information
Supplementary data are available at Bioinformatics online.
Genetic studies of complex traits in animals have been hindered by the need to generate, maintain, and phenotype large panels of recombinant lines. We developed a new method, C. elegans eXtreme ...Quantitative Trait Locus (ceX-QTL) mapping, that overcomes this obstacle via bulk selection on millions of unique recombinant individuals. We use ceX-QTL to map a drug resistance locus with high resolution. We also map differences in gene expression in live worms and discovered that mutations in the co-chaperone sti-1 upregulate the transcription of HSP-90. Lastly, we use ceX-QTL to map loci that influence fitness genome-wide confirming previously reported causal variants and uncovering new fitness loci. ceX-QTL is fast, powerful and cost-effective, and will accelerate the study of complex traits in animals.
How variants with different frequencies contribute to trait variation is a central question in genetics. We use a unique model system to disentangle the contributions of common and rare variants to ...quantitative traits. We generated ~14,000 progeny from crosses among 16 diverse yeast strains and identified thousands of quantitative trait loci (QTLs) for 38 traits. We combined our results with sequencing data for 1011 yeast isolates to show that rare variants make a disproportionate contribution to trait variation. Evolutionary analyses revealed that this contribution is driven by rare variants that arose recently, and that negative selection has shaped the relationship between variant frequency and effect size. We leveraged the structure of the crosses to resolve hundreds of QTLs to single genes. These results refine our understanding of trait variation at the population level and suggest that studies of rare variants are a fertile ground for discovery of genetic effects.
Genetic regulation of gene expression underlies variation in disease risk and other complex traits. The effect of expression quantitative trait loci (eQTLs) varies across cell types; however, the ...complexity of mammalian tissues makes studying cell-type eQTLs highly challenging. We developed a novel approach in the model nematode
that uses single-cell RNA sequencing to map eQTLs at cellular resolution in a single one-pot experiment. We mapped eQTLs across cell types in an extremely large population of genetically distinct
individuals. We found cell-type-specific
eQTL hotspots that affect the expression of core pathways in the relevant cell types. Finally, we found single-cell-specific eQTL effects in the nervous system, including an eQTL with opposite effects in two individual neurons. Our results show that eQTL effects can be specific down to the level of single cells.
Increased adiposity is a hallmark of obesity and overweight, which affect 2.2 billion people world-wide. Understanding the genetic and molecular mechanisms that underlie obesity-related phenotypes ...can help to improve treatment options and drug development. Here we perform promoter Capture Hi-C in human adipocytes to investigate interactions between gene promoters and distal elements as a transcription-regulating mechanism contributing to these phenotypes. We find that promoter-interacting elements in human adipocytes are enriched for adipose-related transcription factor motifs, such as PPARG and CEBPB, and contribute to heritability of cis-regulated gene expression. We further intersect these data with published genome-wide association studies for BMI and BMI-related metabolic traits to identify the genes that are under genetic cis regulation in human adipocytes via chromosomal interactions. This integrative genomics approach identifies four cis-eQTL-eGene relationships associated with BMI or obesity-related traits, including rs4776984 and MAP2K5, which we further confirm by EMSA, and highlights 38 additional candidate genes.
ObjectiveMitochondria have an important role in the induction of the NLRP3 inflammasome response central in gout. The objective was to test whether mitochondrial genetic variation and copy number in ...New Zealand Māori and Pacific (Polynesian) people in Aotearoa New Zealand associate with susceptibility to gout.Methods437 whole mitochondrial genomes from Māori and Pacific people (predominantly men) from Aotearoa New Zealand (327 people with gout, 110 without gout) were sequenced. Mitochondrial DNA copy number variation was determined by assessing relative read depth using data produced from whole genome sequencing (32 cases, 43 controls) and targeted resequencing of urate loci (151 cases, 222 controls). Quantitative PCR was undertaken for replication of copy number findings in an extended sample set of 1159 Māori and Pacific men and women (612 cases, 547 controls).ResultsThere was relatively little mitochondrial genetic diversity, with around 96% of those sequenced in this study belonging to the B4a1a and derived sublineages. A B haplogroup heteroplasmy in hypervariable region I was found to associate with a higher risk of gout among the mitochondrial sequenced sample set (position 16181: OR=1.57, P=0.001). Increased copies of mitochondrial DNA were found to protect against gout risk with the effect being consistent when using hyperuricaemic controls across each of the three independent sample sets (OR=0.89, P=0.007; OR=0.90, P=0.002; OR=0.76, P=0.03). Paradoxically, an increase of mitochondrial DNA also associated with an increase in gout flare frequency in people with gout in the two larger sample sets used for the copy number analysis (β=0.003, P=7.1×10–7; β=0.08, P=1.2×10–4).ConclusionAssociation of reduced copy number with gout in hyperuricaemia was replicated over three Polynesian sample sets. Our data are consistent with emerging research showing that mitochondria are important for the colocalisation of the NLRP3 and ASC inflammasome subunits, a process essential for the generation of interleukin-1β in gout.
Reports the findings of complete mitochondrial genome sequencings of fourteen dogs from the colonisation era archaeological site of Wairau Bar. Source: National Library of New Zealand Te Puna ...Matauranga o Aotearoa, licensed by the Department of Internal Affairs for re-use under the Creative Commons Attribution 3.0 New Zealand Licence.
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has caused global disruption of human health and activity. Being able to trace the early outbreak of SARS-CoV-2 within a locality can ...inform public health measures and provide insights to contain or prevent viral transmission. Investigation of the transmission history requires efficient sequencing methods and analytic strategies, which can be generally useful in the study of viral outbreaks.
The County of Los Angeles (hereafter, LA County) sustained a large outbreak of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). To learn about the transmission history, we carried out surveillance viral genome sequencing to determine 142 viral genomes from unique patients seeking care at the University of California, Los Angeles (UCLA) Health System. 86 of these genomes were from samples collected before April 19, 2020.
We found that the early outbreak in LA County, as in other international air travel hubs, was seeded by multiple introductions of strains from Asia and Europe. We identified a USA-specific strain, B.1.43, which was found predominantly in California and Washington State. While samples from LA County carried the ancestral B.1.43 genome, viral genomes from neighboring counties in California and from counties in Washington State carried additional mutations, suggesting a potential origin of B.1.43 in Southern California. We quantified the transmission rate of SARS-CoV-2 over time, and found evidence that the public health measures put in place in LA County to control the virus were effective at preventing transmission, but might have been undermined by the many introductions of SARS-CoV-2 into the region.
Our work demonstrates that genome sequencing can be a powerful tool for investigating outbreaks and informing the public health response. Our results reinforce the critical need for the USA to have coordinated inter-state responses to the pandemic.
While Phoenician culture and trade networks had a significant impact on Western civilizations, we know little about the Phoenicians themselves. In 1994, a Punic burial crypt was discovered on Byrsa ...Hill, near the entry to the National Museum of Carthage in Tunisia. Inside this crypt were the remains of a young man along with a range of burial goods, all dating to the late 6th century BCE. Here we describe the complete mitochondrial genome recovered from the Young Man of Byrsa and identify that he carried a rare European haplogroup, likely linking his maternal ancestry to Phoenician influenced locations somewhere on the North Mediterranean coast, the islands of the Mediterranean or the Iberian Peninsula. This result not only provides the first direct ancient DNA evidence of a Phoenician individual but the earliest evidence of a European mitochondrial haplogroup, U5b2c1, in North Africa.
Pausing of DNA polymerase can indicate the presence of a DNA structure that differs from the canonical double-helix. Here we detail a method to investigate how polymerase pausing in the Pacific ...Biosciences sequencer reads can be related to DNA sequences. The Pacific Biosciences sequencer uses optics to view a polymerase and its interaction with a single DNA molecule in real-time, offering a unique way to detect potential alternative DNA structures.
We have developed a new way to examine polymerase kinetics data and relate it to the DNA sequence by using a wavelet transform of read information from the sequencer. We use this method to examine how polymerase kinetics are related to nucleotide base composition. We then examine tandem repeat sequences known for their ability to form different DNA structures: (CGG)n and (CG)n repeats which can, respectively, form G-quadruplex DNA and Z-DNA. We find pausing around the (CGG)n repeat that may indicate the presence of G-quadruplexes in some of the sequencer reads. The (CG)n repeat does not appear to cause polymerase pausing, but its kinetics signature nevertheless suggests the possibility that alternative nucleotide conformations may sometimes be present.
We discuss the implications of using our method to discover DNA sequences capable of forming alternative structures. The analyses presented here can be reproduced on any Pacific Biosciences kinetics data for any DNA pattern of interest using an R package that we have made publicly available.