Genotypic causes of a phenotypic trait are typically determined via randomized controlled intervention experiments. Such experiments are often prohibitive with respect to durations and costs, and ...informative prioritization of experiments is desirable. We therefore consider predicting stable rankings of genes (covariates), according to their total causal effects on a phenotype (response), from observational data. Since causal effects are generally non-identifiable from observational data only, we use a method that can infer lower bounds for the total causal effect under some assumptions. We validated our method, which we call Causal Stability Ranking (CStaR), in two situations. First, we performed knock-out experiments with Arabidopsis thaliana according to a predicted ranking based on observational gene expression data, using flowering time as phenotype of interest. Besides several known regulators of flowering time, we found almost half of the tested top ranking mutants to have a significantly changed flowering time. Second, we compared CStaR to established regression-based methods on a gene expression dataset of Saccharomyces cerevisiae. We found that CStaR outperforms these established methods. Our method allows for efficient design and prioritization of future intervention experiments, and due to its generality it can be used for a broad spectrum of applications.
Spread of SARS-CoV-2 in the Icelandic Population Gudbjartsson, Daniel F; Helgason, Agnar; Jonsson, Hakon ...
The New England journal of medicine,
06/2020, Letnik:
382, Številka:
24
Journal Article
Recenzirano
Odprti dostop
During the current worldwide pandemic, coronavirus disease 2019 (Covid-19) was first diagnosed in Iceland at the end of February. However, data are limited on how SARS-CoV-2, the virus that causes ...Covid-19, enters and spreads in a population.
We targeted testing to persons living in Iceland who were at high risk for infection (mainly those who were symptomatic, had recently traveled to high-risk countries, or had contact with infected persons). We also carried out population screening using two strategies: issuing an open invitation to 10,797 persons and sending random invitations to 2283 persons. We sequenced SARS-CoV-2 from 643 samples.
As of April 4, a total of 1221 of 9199 persons (13.3%) who were recruited for targeted testing had positive results for infection with SARS-CoV-2. Of those tested in the general population, 87 (0.8%) in the open-invitation screening and 13 (0.6%) in the random-population screening tested positive for the virus. In total, 6% of the population was screened. Most persons in the targeted-testing group who received positive tests early in the study had recently traveled internationally, in contrast to those who tested positive later in the study. Children under 10 years of age were less likely to receive a positive result than were persons 10 years of age or older, with percentages of 6.7% and 13.7%, respectively, for targeted testing; in the population screening, no child under 10 years of age had a positive result, as compared with 0.8% of those 10 years of age or older. Fewer females than males received positive results both in targeted testing (11.0% vs. 16.7%) and in population screening (0.6% vs. 0.9%). The haplotypes of the sequenced SARS-CoV-2 viruses were diverse and changed over time. The percentage of infected participants that was determined through population screening remained stable for the 20-day duration of screening.
In a population-based study in Iceland, children under 10 years of age and females had a lower incidence of SARS-CoV-2 infection than adolescents or adults and males. The proportion of infected persons identified through population screening did not change substantially during the screening period, which was consistent with a beneficial effect of containment efforts. (Funded by deCODE Genetics-Amgen.).
Human evolutionary history is rich with the interbreeding of divergent populations. Most humans outside of Africa trace about 2% of their genomes to admixture from Neanderthals, which occurred 50-60 ...thousand years ago
. Here we examine the effect of this event using 14.4 million putative archaic chromosome fragments that were detected in fully phased whole-genome sequences from 27,566 Icelanders, corresponding to a range of 56,388-112,709 unique archaic fragments that cover 38.0-48.2% of the callable genome. On the basis of the similarity with known archaic genomes, we assign 84.5% of fragments to an Altai or Vindija Neanderthal origin and 3.3% to Denisovan origin; 12.2% of fragments are of unknown origin. We find that Icelanders have more Denisovan-like fragments than expected through incomplete lineage sorting. This is best explained by Denisovan gene flow, either into ancestors of the introgressing Neanderthals or directly into humans. A within-individual, paired comparison of archaic fragments with syntenic non-archaic fragments revealed that, although the overall rate of mutation was similar in humans and Neanderthals during the 500 thousand years that their lineages were separate, there were differences in the relative frequencies of mutation types-perhaps due to different generation intervals for males and females. Finally, we assessed 271 phenotypes, report 5 associations driven by variants in archaic fragments and show that the majority of previously reported associations are better explained by non-archaic variants.
Asthma is one of the most common chronic diseases affecting both children and adults. We report a genome-wide association meta-analysis of 69,189 cases and 702,199 controls from Iceland and UK ...biobank. We find 88 asthma risk variants at 56 loci, 19 previously unreported, and evaluate their effect on other asthma and allergic phenotypes. Of special interest are two low frequency variants associated with protection against asthma; a missense variant in TNFRSF8 and 3' UTR variant in TGFBR1. Functional studies show that the TNFRSF8 variant reduces TNFRSF8 expression both on cell surface and in soluble form, acting as loss of function. eQTL analysis suggests that the TGFBR1 variant acts through gain of function and together with an intronic variant in a downstream gene, SMAD3, points to defective TGFβR1 signaling as one of the biological perturbations increasing asthma risk. Our results increase the number of asthma variants and implicate genes with known role in T cell regulation, inflammation and airway remodeling in asthma pathogenesis.
Gallstones are responsible for one of the most common diseases in the Western world and are commonly treated with cholecystectomy. We perform a meta-analysis of two genome-wide association studies of ...gallstone disease in Iceland and the UK, totaling 27,174 cases and 736,838 controls, uncovering 21 novel gallstone-associated variants at 20 loci. Two distinct low frequency missense variants in SLC10A2, encoding the apical sodium-dependent bile acid transporter (ASBT), associate with an increased risk of gallstone disease (Pro290Ser: OR = 1.36 1.25-1.49, P = 2.1 × 10
, MAF = 1%; Val98Ile: OR = 1.15 1.10-1.20, P = 1.8 × 10
, MAF = 4%). We demonstrate that lower bile acid transport by ASBT is accompanied by greater risk of gallstone disease and highlight the role of the intestinal compartment of the enterohepatic circulation of bile acids in gallstone disease susceptibility. Additionally, two low frequency missense variants in SERPINA1 and HNF4A and 17 common variants represent novel associations with gallstone disease.
Kidney stone disease is a complex disorder with a strong genetic component. We conducted a genome-wide association study of 28.3 million sequence variants detected through whole-genome sequencing of ...2,636 Icelanders that were imputed into 5,419 kidney stone cases, including 2,172 cases with a history of recurrent kidney stones, and 279,870 controls. We identify sequence variants associating with kidney stones at ALPL (rs1256328T, odds ratio (OR)=1.21, P=5.8 × 10(-10)) and a suggestive association at CASR (rs7627468A, OR=1.16, P=2.0 × 10(-8)). Focusing our analysis on coding sequence variants in 63 genes with preferential kidney expression we identify two rare missense variants SLC34A1 p.Tyr489Cys (OR=2.38, P=2.8 × 10(-5)) and TRPV5 p.Leu530Arg (OR=3.62, P=4.1 × 10(-5)) associating with recurrent kidney stones. We also observe associations of the identified kidney stone variants with biochemical traits in a large population set, indicating potential biological mechanism.
Microsatellites are polymorphic tracts of short tandem repeats with one to six base-pair (bp) motifs and are some of the most polymorphic variants in the genome. Using 6084 Icelandic parent-offspring ...trios we estimate 63.7 (95% CI: 61.9-65.4) microsatellite de novo mutations (mDNMs) per offspring per generation, excluding one bp repeats motifs (homopolymers) the estimate is 48.2 mDNMs (95% CI: 46.7-49.6). Paternal mDNMs occur at longer repeats than maternal ones, which are in turn larger with a mean size of 3.4 bp vs 3.1 bp for paternal ones. mDNMs increase by 0.97 (95% CI: 0.90-1.04) and 0.31 (95% CI: 0.25-0.37) per year of father's and mother's age at conception, respectively. Here, we find two independent coding variants that associate with the number of mDNMs transmitted to offspring; The minor allele of a missense variant (allele frequency (AF) = 1.9%) in MSH2, a mismatch repair gene, increases transmitted mDNMs from both parents (effect: 13.1 paternal and 7.8 maternal mDNMs). A synonymous variant (AF = 20.3%) in NEIL2, a DNA damage repair gene, increases paternally transmitted mDNMs (effect: 4.4 mDNMs). Thus, the microsatellite mutation rate in humans is in part under genetic control.
Features of the QRS complex of the electrocardiogram, reflecting ventricular depolarisation, associate with various physiologic functions and several pathologic conditions. We test 32.5 million ...variants for association with ten measures of the QRS complex in 12 leads, using 405,732 electrocardiograms from 81,192 Icelanders. We identify 190 associations at 130 loci, the majority of which have not been reported before, including associations with 21 rare or low-frequency coding variants. Assessment of genes expressed in the heart yields an additional 13 rare QRS coding variants at 12 loci. We find 51 unreported associations between the QRS variants and echocardiographic traits and cardiovascular diseases, including atrial fibrillation, complete AV block, heart failure and supraventricular tachycardia. We demonstrate the advantage of in-depth analysis of the QRS complex in conjunction with other cardiovascular phenotypes to enhance our understanding of the genetic basis of myocardial mass, cardiac conduction and disease.
Creatine kinase (CK) and lactate dehydrogenase (LDH) are widely used markers of tissue damage. To search for sequence variants influencing serum levels of CK and LDH, 28.3 million sequence variants ...identified through whole-genome sequencing of 2,636 Icelanders were imputed into 63,159 and 98,585 people with CK and LDH measurements, respectively. Here we describe 13 variants associating with serum CK and 16 with LDH levels, including four that associate with both. Among those, 15 are non-synonymous variants and 12 have a minor allele frequency below 5%. We report sequence variants in genes encoding the enzymes being measured (CKM and LDHA), as well as in genes linked to muscular (ANO5) and immune/inflammatory function (CD163/CD163L1, CSF1, CFH, HLA-DQB1, LILRB5, NINJ1 and STAB1). A number of the genes are linked to the mononuclear/phagocyte system and clearance of enzymes from the serum. This highlights the variety in the sources of normal diversity in serum levels of enzymes.