Long-read sequencing (LRS) promises to improve the characterization of structural variants (SVs). We generated LRS data from 3,622 Icelanders and identified a median of 22,636 SVs per individual (a ...median of 13,353 insertions and 9,474 deletions). We discovered a set of 133,886 reliably genotyped SV alleles and imputed them into 166,281 individuals to explore their effects on diseases and other traits. We discovered an association of a rare deletion in PCSK9 with lower low-density lipoprotein (LDL) cholesterol levels, compared to the population average. We also discovered an association of a multiallelic SV in ACAN with height; we found 11 alleles that differed in the number of a 57-bp-motif repeat and observed a linear relationship between the number of repeats carried and height. These results show that SVs can be accurately characterized at the population scale using LRS data in a genome-wide non-targeted approach and demonstrate how SVs impact phenotypes.
Long-read sequencing can enable the detection of base modifications, such as CpG methylation, in single molecules of DNA. The most commonly used methods for long-read sequencing are nanopore ...developed by Oxford Nanopore Technologies (ONT) and single molecule real-time (SMRT) sequencing developed by Pacific Bioscience (PacBio). In this study, we systematically compare the performance of CpG methylation detection from long-read sequencing.
We demonstrate that CpG methylation detection from 7179 nanopore-sequenced DNA samples is highly accurate and consistent with 132 oxidative bisulfite-sequenced (oxBS) samples, isolated from the same blood draws. We introduce quality filters for CpGs that further enhance the accuracy of CpG methylation detection from nanopore-sequenced DNA, while removing at most 30% of CpGs. We evaluate the per-site performance of CpG methylation detection across different genomic features and CpG methylation rates and demonstrate how the latest R10.4 flowcell chemistry and base-calling algorithms improve methylation detection from nanopore sequencing. Additionally, we show how the methylation detection of 50 SMRT-sequenced genomes compares to nanopore sequencing and oxBS.
This study provides the first systematic comparison of CpG methylation detection tools for long-read sequencing methods. We compare two commonly used computational methods for the detection of CpG methylation in a large number of nanopore genomes, including samples sequenced using the latest R10.4 nanopore flowcell chemistry and 50 SMRT sequenced samples. We provide insights into the strengths and limitations of each sequencing method as well as recommendations for standardization and evaluation of tools designed for genome-scale modified base detection using long-read sequencing.
Imprinting is the preferential expression of one parental allele over the other. It is controlled primarily through differential methylation of cytosine at CpG dinucleotides. Here we combine 285 ...methylomes and 11,617 transcriptomes from peripheral blood samples with parent-of-origin phased haplotypes, to produce a new map of imprinted methylation and gene expression patterns across the human genome. We demonstrate how imprinted methylation is a continuous rather than a binary characteristic. We describe at high resolution the parent-of-origin methylation pattern at the 15q11.2 Prader-Willi/Angelman syndrome locus, with nearly confluent stochastic paternal methylation punctuated by 'spikes' of maternal methylation. We find examples of polymorphic imprinted methylation unrelated (at VTRNA2-1 and PARD6G) or related (at CHRNE) to nearby SNP genotypes. We observe RNA isoform-specific imprinted expression patterns suggestive of a methylation-sensitive transcriptional elongation block. Finally, we gain new insights into parent-of-origin-specific effects on phenotypes at the DLK1/MEG3 and GNAS loci.
With the increasing incidence of prostate cancer, identifying common genetic variants that confer risk of the disease is important. Here we report such a variant on chromosome 8q24, a region ...initially identified through a study of Icelandic families. Allele −8 of the microsatellite DG8S737 was associated with prostate cancer in three case-control series of European ancestry from Iceland, Sweden and the US. The estimated odds ratio (OR) of the allele is 1.62 (P = 2.7 × 10−11). About 19% of affected men and 13% of the general population carry at least one copy, yielding a population attributable risk (PAR) of ∼8%. The association was also replicated in an African American case-control group with a similar OR, in which 41% of affected individuals and 30% of the population are carriers. This leads to a greater estimated PAR (16%) that may contribute to higher incidence of prostate cancer in African American men than in men of European ancestry.
Abstract
Background
The TNM system is used to assess prognosis after colorectal cancer (CRC) diagnosis. Other prognostic factors reported include histopathological assessments of the tumour, tumour ...mutations and proteins in the blood. As some of these factors are strongly correlated, it is important to evaluate the independent effects they may have on survival.
Methods
Tumour samples from 2162 CRC patients were visually assessed for amount of tumour stroma, severity of lymphocytic infiltrate at the tumour margins and the presence of lymphoid follicles. Somatic mutations in the tumour were assessed for 2134 individuals. Pre-surgical levels of 4963 plasma proteins were measured in 128 individuals. The associations between these features and prognosis were inspected by a Cox Proportional Hazards Model (CPH).
Results
Levels of stroma, lymphocytic infiltration and presence of lymphoid follicles all associate with prognosis, along with high tumour mutation burden, high microsatellite instability and
TP53
and
BRAF
mutations. The somatic mutations are correlated with the histopathology and none of the somatic mutations associate with survival in a multivariate analysis. Amount of stroma and lymphocytic infiltration associate with local invasion of tumours. Elevated levels of two plasma proteins, CA-125 and PPP1R1A, associate with a worse prognosis.
Conclusions
Tumour stroma and lymphocytic infiltration variables are strongly associated with prognosis of CRC and capture the prognostic effects of tumour mutation status. CA-125 and PPP1R1A may be useful prognostic biomarkers in CRC.
Spread of SARS-CoV-2 in the Icelandic Population Gudbjartsson, Daniel F; Helgason, Agnar; Jonsson, Hakon ...
The New England journal of medicine,
06/2020, Letnik:
382, Številka:
24
Journal Article
Recenzirano
Odprti dostop
Despite timely implementation of testing for SARS-CoV-2 virus, a contact-tracing scheme, and social-distancing measures, infection has spread in Iceland. However, there was no detected increase in ...the proportion of infected persons between March 13 and April 4, 2020.
Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic ...variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data
. Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank
. This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation.
Opportunities to directly study the founding of a human population and its subsequent evolutionary history are rare. Using genome sequence data from 27 ancient Icelanders, we demonstrate that they ...are a combination of Norse, Gaelic, and admixed individuals. We further show that these ancient Icelanders are markedly more similar to their source populations in Scandinavia and the British-Irish Isles than to contemporary Icelanders, who have been shaped by 1100 years of extensive genetic drift. Finally, we report evidence of unequal contributions from the ancient founders to the contemporary Icelandic gene pool. These results provide detailed insights into the making of a human population that has proven extraordinarily useful for the discovery of genotype-phenotype associations.
We report a prostate cancer genome-wide association follow-on study. We discovered four variants associated with susceptibility to prostate cancer in several European populations: rs10934853A (OR = ...1.12, P = 2.9 x 10(-10)) on 3q21.3; two moderately correlated (r2 = 0.07) variants, rs16902094G (OR = 1.21, P = 6.2 x 10(-15)) and rs445114T (OR = 1.14, P = 4.7 x 10(-10)), on 8q24.21; and rs8102476C (OR = 1.12, P = 1.6 x 10(-11)) on 19q13.2. We also refined a previous association signal on 11q13 with the SNP rs11228565A (OR = 1.23, P = 6.7 x 10(-12)). In a multivariate analysis using 22 prostate cancer risk variants typed in the Icelandic population, we estimated that carriers in the top 1.3% of the risk distribution are at a 2.5 times greater risk of developing the disease than members of the general population.
Celotno besedilo
Dostopno za:
DOBA, IJS, IZUM, KILJ, NUK, PILJ, PNG, SAZU, UILJ, UKNU, UL, UM, UPUK
Human populations have been shaped by catastrophes that may have left long-lasting signatures in their genomes. One notable example is the second plague pandemic that entered Europe in ca. 1,347 CE ...and repeatedly returned for over 300 years, with typical village and town mortality estimated at 10%–40%.1 It is assumed that this high mortality affected the gene pools of these populations. First, local population crashes reduced genetic diversity. Second, a change in frequency is expected for sequence variants that may have affected survival or susceptibility to the etiologic agent (Yersinia pestis).2 Third, mass mortality might alter the local gene pools through its impact on subsequent migration patterns. We explored these factors using the Norwegian city of Trondheim as a model, by sequencing 54 genomes spanning three time periods: (1) prior to the plague striking Trondheim in 1,349 CE, (2) the 17th–19th century, and (3) the present. We find that the pandemic period shaped the gene pool by reducing long distance immigration, in particular from the British Isles, and inducing a bottleneck that reduced genetic diversity. Although we also observe an excess of large FST values at multiple loci in the genome, these are shaped by reference biases introduced by mapping our relatively low genome coverage degraded DNA to the reference genome. This implies that attempts to detect selection using ancient DNA (aDNA) datasets that vary by read length and depth of sequencing coverage may be particularly challenging until methods have been developed to account for the impact of differential reference bias on test statistics.
•The second plague pandemic homogenized ancestry in Trondheim•Gaelic ancestry is sharply reduced in post-pandemic Trondheim•Pervasive reference bias taints frequency differences observed between populations
Gopalakrishnan et al. investigate the genomic signatures of the second plague pandemic on the residents of Trondheim in Norway. They find that the pandemic resulted in a sharp reduction in Gaelic ancestry and also find evidence of differential reference bias among their ancient samples, which reduces the reliability of selection analyses.