The implementation of high-throughput sequencing (HTS) technologies in research and diagnostic laboratories has linked many new genes to rare bleeding, thrombotic, and platelet disorders (BTPD), and ...revealed multiple genetic variants linked to those disorders, many of them being of uncertain pathogenicity when considering the accepted evidence (variant consequence, frequency in control datasets, number of reported patients, prediction models, and functional assays). The sequencing effort has also resulted in resources for gathering disease-causing variants associated with specific genes, but for BTPD, such well-curated databases exist only for a few genes. On the other hand, submissions by individuals or diagnostic laboratories to the variant database ClinVar are hampered by the lack of a submission process tailored to capture the specific features of hemostatic diseases. As we move toward the implementation of HTS in the diagnosis of BTPD, the Scientific and Standardization Committee for Genetics in Thrombosis and Haemostasis has developed and tested a REDCap-based interface, aimed at the community, to submit curated genetic variants for diagnostic-grade BTPD genes. Here, we describe the use of the interface and the initial submission of 821 variants from 30 different centers covering 14 countries. This open-access variant resource will be shared with the community to improve variant classification and regular bulk data transfer to ClinVar.
Summary Background Obesity is a major health problem that is determined by interactions between lifestyle and environmental and genetic factors. Although associations between several genetic variants ...and body-mass index (BMI) have been identified, little is known about epigenetic changes related to BMI. We undertook a genome-wide analysis of methylation at CpG sites in relation to BMI. Methods 479 individuals of European origin recruited by the Cardiogenics Consortium formed our discovery cohort. We typed their whole-blood DNA with the Infinium HumanMethylation450 array. After quality control, methylation levels were tested for association with BMI. Methylation sites showing an association with BMI at a false discovery rate q value of 0·05 or less were taken forward for replication in a cohort of 339 unrelated white patients of northern European origin from the MARTHA cohort. Sites that remained significant in this primary replication cohort were tested in a second replication cohort of 1789 white patients of European origin from the KORA cohort. We examined whether methylation levels at identified sites also showed an association with BMI in DNA from adipose tissue (n=635) and skin (n=395) obtained from white female individuals participating in the MuTHER study. Finally, we examined the association of methylation at BMI-associated sites with genetic variants and with gene expression. Findings 20 individuals from the discovery cohort were excluded from analyses after quality-control checks, leaving 459 participants. After adjustment for covariates, we identified an association (q value ≤0·05) between methylation at five probes across three different genes and BMI. The associations with three of these probes—cg22891070, cg27146050, and cg16672562, all of which are in intron 1 of HIF3A —were confirmed in both the primary and second replication cohorts. For every 0·1 increase in methylation β value at cg22891070, BMI was 3·6% (95% CI 2·4–4·9) higher in the discovery cohort, 2·7% (1·2–4·2) higher in the primary replication cohort, and 0·8% (0·2–1·4) higher in the second replication cohort. For the MuTHER cohort, methylation at cg22891070 was associated with BMI in adipose tissue (p=1·72 × 10−5 ) but not in skin (p=0·882). We observed a significant inverse correlation (p=0·005) between methylation at cg22891070 and expression of one HIF3A gene-expression probe in adipose tissue. Two single nucleotide polymorphisms—rs8102595 and rs3826795—had independent associations with methylation at cg22891070 in all cohorts. However, these single nucleotide polymorphisms were not significantly associated with BMI. Interpretation Increased BMI in adults of European origin is associated with increased methylation at the HIF3A locus in blood cells and in adipose tissue. Our findings suggest that perturbation of hypoxia inducible transcription factor pathways could have an important role in the response to increased weight in people. Funding The European Commission, National Institute for Health Research, British Heart Foundation, and Wellcome Trust.
Pulmonary arterial hypertension is a devastating disease with high mortality. Familial cases of pulmonary arterial hypertension are usually characterized by autosomal dominant transmission with ...reduced penetrance, and some familial cases have unknown genetic causes.
We studied a family in which multiple members had pulmonary arterial hypertension without identifiable mutations in any of the genes known to be associated with the disease, including BMPR2, ALK1, ENG, SMAD9, and CAV1. Three family members were studied with whole-exome sequencing. Additional patients with familial or idiopathic pulmonary arterial hypertension were screened for the mutations in the gene that was identified on whole-exome sequencing. All variants were expressed in COS-7 cells, and channel function was studied by means of patch-clamp analysis.
We identified a novel heterozygous missense variant c.608 G→A (G203D) in KCNK3 (the gene encoding potassium channel subfamily K, member 3) as a disease-causing candidate gene in the family. Five additional heterozygous missense variants in KCNK3 were independently identified in 92 unrelated patients with familial pulmonary arterial hypertension and 230 patients with idiopathic pulmonary arterial hypertension. We used in silico bioinformatic tools to predict that all six novel variants would be damaging. Electrophysiological studies of the channel indicated that all these missense mutations resulted in loss of function, and the reduction in the potassium-channel current was remedied by the application of the phospholipase inhibitor ONO-RS-082.
Our study identified the association of a novel gene, KCNK3, with familial and idiopathic pulmonary arterial hypertension. Mutations in this gene produced reduced potassium-channel current, which was successfully remedied by pharmacologic manipulation. (Funded by the National Institutes of Health.)
Background Systemic iron status has been implicated in atherosclerosis and thrombosis. The aim of this study was to investigate the effect of genetically determined iron status on carotid ...intima-media thickness, carotid plaque, and venous thromboembolism using Mendelian randomization. Methods and Results Genetic instrumental variables for iron status were selected from a genome-wide meta-analysis of 48 972 subjects. Genetic association estimates for carotid intima-media thickness and carotid plaque were obtained using data from 71 128 and 48 434 participants, respectively, and estimates for venous thromboembolism were obtained using data from a study incorporating 7507 cases and 52 632 controls. Conventional 2-sample summary data Mendelian randomization was performed for the main analysis. Higher genetically determined iron status was associated with increased risk of venous thromboembolism. Odds ratios per SD increase in biomarker levels were 1.37 (95% CI 1.14-1.66) for serum iron, 1.25 (1.09-1.43) for transferrin saturation, 1.92 (1.28-2.88) for ferritin, and 0.76 (0.63-0.92) for serum transferrin (with higher transferrin levels representing lower iron status). In contrast, higher iron status was associated with lower risk of carotid plaque. Corresponding odds ratios were 0.85 (0.73-0.99) for serum iron and 0.89 (0.80-1.00) for transferrin saturation, with concordant trends for serum transferrin and ferritin that did not reach statistical significance. There was no Mendelian randomization evidence of an effect of iron status on carotid intima-media thickness. Conclusions These findings support previous work to suggest that higher genetically determined iron status is protective against some forms of atherosclerotic disease but increases the risk of thrombosis related to stasis of blood.
Pulmonary veno-occlusive disease (PVOD) is a rare and devastating cause of pulmonary hypertension that is characterized histologically by widespread fibrous intimal proliferation of septal veins and ...preseptal venules and is frequently associated with pulmonary capillary dilatation and proliferation. PVOD is categorized into a separate pulmonary arterial hypertension-related group in the current classification of pulmonary hypertension. PVOD presents either sporadically or as familial cases with a seemingly recessive mode of transmission. Using whole-exome sequencing, we detected recessive mutations in EIF2AK4 (also called GCN2) that cosegregated with PVOD in all 13 families studied. We also found biallelic EIF2AK4 mutations in 5 of 20 histologically confirmed sporadic cases of PVOD. All mutations, either in a homozygous or compound-heterozygous state, disrupted the function of the gene. These findings point to EIF2AK4 as the major gene that is linked to PVOD development and contribute toward an understanding of the complex genetic architecture of pulmonary hypertension.
Dilated cardiomyopathy (DCM) is a heart disease characterized by left ventricular dilatation and systolic dysfunction. In 30% of cases, pathogenic variants, essentially private to each patient, are ...identified in at least one of almost 50 reported genes. Thus, while costly, exons capture-based Next Generation Sequencing (NGS) of a targeted gene panel appears as the best strategy to genetically diagnose DCM. Here, we report a NGS strategy applied to pools of 8 DNAs from DCM patients and validate its robustness for rare variants detection at 4-fold reduced cost. Our pipeline uses Freebayes to detect variants with the expected 1/16 allele frequency. From the whole set of detected rare variants in 96 pools we set the variants quality parameters optimizing true positives calling. When compared to simplex DNA sequencing in a shared subset of 50 DNAs, 96% of SNVs/InsDel were accurately identified in pools. Extended to the 384 DNAs included in the study, we detected 100 variants (ACMG class 4 and 5), mostly in well-known morbid gene causing DCM such as TTN, MYH7, FLNC, and TNNT2. To conclude, we report an original pool-sequencing NGS method accurately detecting rare variants. This innovative approach is cost-effective for genetic diagnostic in rare diseases.
Technical variation plays an important role in microarray-based gene expression studies, and batch effects explain a large proportion of this noise. It is therefore mandatory to eliminate technical ...variation while maintaining biological variability. Several strategies have been proposed for the removal of batch effects, although they have not been evaluated in large-scale longitudinal gene expression data. In this study, we aimed at identifying a suitable method for batch effect removal in a large study of microarray-based longitudinal gene expression. Monocytic gene expression was measured in 1092 participants of the Gutenberg Health Study at baseline and 5-year follow up. Replicates of selected samples were measured at both time points to identify technical variability. Deming regression, Passing-Bablok regression, linear mixed models, non-linear models as well as ReplicateRUV and ComBat were applied to eliminate batch effects between replicates. In a second step, quantile normalization prior to batch effect correction was performed for each method. Technical variation between batches was evaluated by principal component analysis. Associations between body mass index and transcriptomes were calculated before and after batch removal. Results from association analyses were compared to evaluate maintenance of biological variability. Quantile normalization, separately performed in each batch, combined with ComBat successfully reduced batch effects and maintained biological variability. ReplicateRUV performed perfectly in the replicate data subset of the study, but failed when applied to all samples. All other methods did not substantially reduce batch effects in the replicate data subset. Quantile normalization plus ComBat appears to be a valuable approach for batch correction in longitudinal gene expression data.
Summary
Venous thromboembolism (VTE) has a strong genetic component. This review summarizes what is known at the seventeen genes that are now well established to harbour VTE‐associated genetic ...variants. In addition, it discusses additional candidate genes that deserve further validation before being claimed as VTE associated genes. Finally, several research strategies are briefly described to identify other molecular determinants of the disease.
Genetics of Venous Thrombosis: update in 2015 Morange, Pierre-Emmanuel; Suchon, Pierre; Trégouët, David-Alexandre
Thrombosis and haemostasis,
11/2015, Volume:
114, Issue:
5
Journal Article
Peer reviewed
Venous thrombosis (VT) is a common multifactorial disease with a genetic component that was first suspected nearly 60 years ago. In this review, we document the genetic determinants of the disease, ...and update recent findings delivered by the application of high-throughput genotyping and sequencing technologies. To date, 17 genes have been robustly demonstrated to harbour genetic variations associated with VT risk: ABO, F2, F5, F9, F11, FGG, GP6, KNG1, PROC, PROCR, PROS1, SERPINC1, SLC44A2, STXBP5, THBD, TSPAN15 and VWF. The common polymorphisms are estimated to account only for a modest part (~5 %) of the VT heritability. Much remains to be done to fully disentangle the exact genetic (and epigenetic) architecture of the disease. A large suite of powerful tools and research strategies can be deployed on the large collections of patients that have already been assembled (and additional are ongoing).
Many human traits are highly correlated. This correlation can be leveraged to improve the power of genetic association tests to identify markers associated with one or more of the traits. Principal ...component analysis (PCA) is a useful tool that has been widely used for the multivariate analysis of correlated variables. PCA is usually applied as a dimension reduction method: the few top principal components (PCs) explaining most of total trait variance are tested for association with a predictor of interest, and the remaining components are not analyzed. In this study we review the theoretical basis of PCA and describe the behavior of PCA when testing for association between a SNP and correlated traits. We then use simulation to compare the power of various PCA-based strategies when analyzing up to 100 correlated traits. We show that contrary to widespread practice, testing only the top PCs often has low power, whereas combining signal across all PCs can have greater power. This power gain is primarily due to increased power to detect genetic variants with opposite effects on positively correlated traits and variants that are exclusively associated with a single trait. Relative to other methods, the combined-PC approach has close to optimal power in all scenarios considered while offering more flexibility and more robustness to potential confounders. Finally, we apply the proposed PCA strategy to the genome-wide association study of five correlated coagulation traits where we identify two candidate SNPs that were not found by the standard approach.