Burkitt lymphoma (BL) is an aggressive, MYC-driven lymphoma comprising 3 distinct clinical subtypes: sporadic BLs that occur worldwide, endemic BLs that occur predominantly in sub-Saharan Africa, and ...immunodeficiency-associated BLs that occur primarily in the setting of HIV. In this study, we comprehensively delineated the genomic basis of BL through whole-genome sequencing (WGS) of 101 tumors representing all 3 subtypes of BL to identify 72 driver genes. These data were additionally informed by CRISPR screens in BL cell lines to functionally annotate the role of oncogenic drivers. Nearly every driver gene was found to have both coding and non-coding mutations, highlighting the importance of WGS for identifying driver events. Our data implicate coding and non-coding mutations in IGLL5, BACH2, SIN3A, and DNMT1. Epstein-Barr virus (EBV) infection was associated with higher mutation load, with type 1 EBV showing a higher mutational burden than type 2 EBV. Although sporadic and immunodeficiency-associated BLs had similar genetic profiles, endemic BLs manifested more frequent mutations in BCL7A and BCL6 and fewer genetic alterations in DNMT1, SNTB2, and CTCF. Silencing mutations in ID3 were a common feature of all 3 subtypes of BL. In vitro, mass spectrometry–based proteomics demonstrated that the ID3 protein binds primarily to TCF3 and TCF4. In vivo knockout of ID3 potentiated the effects of MYC, leading to rapid tumorigenesis and tumor phenotypes consistent with those observed in the human disease.
•We investigated all 3 subtypes of BL by WGS and transcriptome sequencing.•Experimental validation through CRISPR screening and mouse models provides a better functional understanding of BL genetic drivers.
Display omitted
Understanding mechanisms of hepatocellular damage may lead to new treatments for liver disease, and genome-wide association studies (GWAS) of alanine aminotransferase (ALT) and aspartate ...aminotransferase (AST) serum activities have proven useful for investigating liver biology. Here we report 100 loci associating with both enzymes, using GWAS across 411,048 subjects in the UK Biobank. The rare missense variant SLC30A10 Thr95Ile (rs188273166) associates with the largest elevation of both enzymes, and this association replicates in the DiscovEHR study. SLC30A10 excretes manganese from the liver to the bile duct, and rare homozygous loss of function causes the syndrome hypermanganesemia with dystonia-1 (HMNDYT1) which involves cirrhosis. Consistent with hematological symptoms of hypermanganesemia, SLC30A10 Thr95Ile carriers have increased hematocrit and risk of iron deficiency anemia. Carriers also have increased risk of extrahepatic bile duct cancer. These results suggest that genetic variation in SLC30A10 adversely affects more individuals than patients with diagnosed HMNDYT1.
Sequencing of large cohorts offers an unprecedented opportunity to identify rare genetic variants and to find novel contributors to human disease. We used gene-based collapsing tests to identify ...genes associated with glucose, HbA1c and type 2 diabetes (T2D) diagnosis in 379,066 exome-sequenced participants in the UK Biobank. We identified associations for variants in GCK, HNF1A and PDX1, which are known to be involved in Mendelian forms of diabetes. Notably, we uncovered novel associations for GIGYF1, a gene not previously implicated by human genetics in diabetes. GIGYF1 predicted loss of function (pLOF) variants associated with increased levels of glucose (0.77 mmol/L increase, p = 4.42 × 10
) and HbA1c (4.33 mmol/mol, p = 1.28 × 10
) as well as T2D diagnosis (OR = 4.15, p = 6.14 × 10
). Multiple rare variants contributed to these associations, including singleton variants. GIGYF1 pLOF also associated with decreased cholesterol levels as well as an increased risk of hypothyroidism. The association of GIGYF1 pLOF with T2D diagnosis replicated in an independent cohort from the Geisinger Health System. In addition, a common variant association for glucose and T2D was identified at the GIGYF1 locus. Our results highlight the role of GIGYF1 in regulating insulin signaling and protecting from diabetes.
Non-Hodgkin lymphomas (NHLs) are a diverse group of entities, both clinically and molecularly. Here, we review the evolution of classification schemes in B-cell lymphoma, noting the now standard WHO ...classification system that is based on immune cell-of-origin and molecular phenotypes. We review how lymphomas arise throughout the B-cell development process as well as the molecular and clinical features of prominent B-cell lymphomas. We provide an overview of the major progress that has occurred over the past decade in terms of our molecular understanding of these diseases. We discuss treatment options available and focus on a number of the diverse research tools that have been employed to improve our understanding of these diseases. We discuss the problem of heterogeneity in lymphomas and anticipate that the near future will bring significant advances that provide a measurable impact on NHL outcomes.
Background
Severe alpha‐1‐antitrypsin deficiency (AATD), phenotype PiZZ, was associated with venous thromboembolism (VTE) in a case‐control study.
Objectives
This study aimed to determine the genetic ...variation in the SERPINA1 gene and a possible thrombotic risk of these variants in a population‐based cohort study.
Patients/Methods
The coding sequence of SERPINA1 was analyzed for the Z (rs28929474), S (rs17580), and other qualifying variants in 28,794 subjects without previous VTE (born 1923–1950, 60% women), who participated in the Malmö Diet and Cancer study (1991–1996). Individuals were followed from baseline until the first event of VTE, death, or 2018.
Results
Resequencing the coding sequence of SERPINA1 identified 84 variants in the total study population, 21 synonymous, 62 missense, and 1 loss‐of‐function variant. Kaplan‐Meier analysis showed that homozygosity for the Z allele increased the risk of VTE whereas heterozygosity showed no effect. The S (rs17580) variant was not associated with VTE. Thirty‐one rare variants were qualifying and included in collapsing analysis using the following selection criteria, loss of function, in frame deletion or non‐benign (PolyPhen‐2) missense variants with minor allele frequency (MAF) <0.1%. Combining the rare qualifying variants with the Z variant showed that carrying two alleles (ZZ or compound heterozygotes) showed increased risk. Cox regression analysis revealed an adjusted hazard ratio of 4.5 (95% confidence interval 2.0–10.0) for combinations of the Z variant and rare qualifying variants. One other variant (rs141620200; MAF = 0.002) showed an increased risk of VTE.
Conclusions
The SERPINA1 ZZ genotype and compound heterozygotes for severe AATD are rare but associated with VTE in a population‐based Swedish study.
Genome-wide association studies have identified hundreds of single nucleotide variations (formerly single nucleotide polymorphisms) associated with several cancers, but the predictive ability of ...polygenic risk scores (PRSs) is unclear, especially among non-Whites.
PRSs were derived from genome-wide significant single-nucleotide variations for 15 cancers in 20,079 individuals in an academic biobank. We evaluated the improvement in discriminatory accuracy by including cancer-specific PRS in patients of genetically-determined African and European ancestry.
Among the individuals of European genetic ancestry, PRSs for breast, colon, melanoma, and prostate were significantly associated with their respective cancers. Among the individuals of African genetic ancestry, PRSs for breast, colon, prostate, and thyroid were significantly associated with their respective cancers. The area under the curve of the model consisting of age, sex, and principal components was 0.621 to 0.710, and it increased by 1% to 4% with the inclusion of PRS in individuals of European genetic ancestry. In individuals of African genetic ancestry, area under the curve was overall higher in the model without the PRS (0.723-0.810) but increased by <1% with the inclusion of PRS for most cancers.
PRS moderately increased the ability to discriminate the cancer status in individuals of European but not African ancestry. Further large-scale studies are needed to identify ancestry-specific genetic factors in non-White populations to incorporate PRS into cancer risk assessment.
The advent of new genomic approaches, particularly next generation sequencing (NGS) has resulted in explosive growth of biological data. As the size of biological data keeps growing at exponential ...rates, new methods for data management and data processing are becoming essential in bioinformatics and computational biology. Indeed, data analysis has now become the central challenge in genomics.NGS has provided rich tools for defining genomic alterations that cause cancer. The processing time and computing requirements have now become a serious bottleneck to the characterization and analysis of these genomic alterations. Moreover, as the adoption of NGS continues to increase, the computing power required often exceeds what any single institution can provide, leading to major restraints in the type and number of analyses that can be performed.Cloud computing represents a potential solution to this problem. On a cloud platform, computing resources can be available on-demand, thus allowing users to implement scalable and highly parallel methods. However, few centralized frameworks exist to allow the average researcher the ability to apply bioinformatics workflows using cloud resources. Moreover, bioinformatics approaches are associated with multiple processing challenges, such as the variability in the methods or data used and the reproducibility requirements of the research analysis.Here, we present CloudConductor, a software system that is specifically designed to harness the power of cloud computing to perform complex analysis pipelines on large biological datasets. CloudConductor was designed with five central features in mind: scalability, modularity, parallelism, reproducibility and platform agnosticism.We demonstrate the processing power afforded by CloudConductor on a real-world genomics problem. Using CloudConductor, we processed and analyzed 101 whole genome tumor-normal paired samples from Burkitt lymphoma subtypes to identify novel genomic alterations. We identified a total of 72 driver genes associated with the disease. Somatic events were identified in both coding and non-coding regions of nearly all driver genes, notably in genes IGLL5, BACH2, SIN3A, and DNMT1. We have developed the analysis framework by implementing a graphical user interface, a back-end database system, a data loader and a workflow management system.In this thesis, we develop the concepts and describe an implementation of automated cloud-based infrastructure to analyze genomics data, creating a fast and efficient analysis resource for genomics researchers.
Background Five classic thrombophilias have been recognized: factor V Leiden (rs6025), the prothrombin G20210A variant (rs1799963), and protein C, protein S, and antithrombin deficiencies. This study ...aimed to determine the thrombotic risk of classic thrombophilias in a cohort of middle-aged and older adults. Methods and Results Factor V Leiden, prothrombin G20210A and protein-coding variants in the
(protein C),
(protein S), and
(antithrombin) anticoagulant genes were determined in 29 387 subjects (born 1923-1950, 60% women) who participated in the Malmö Diet and Cancer study (1991-1996). The Human Gene Mutation Database was used to define 68 disease-causing mutations. Patients were followed up from baseline until the first event of venous thromboembolism (VTE), death, or Dec 31, 2018. Carriership (n=908, 3.1%) for disease-causing mutations in the
,
, and
genes was associated with incident VTE: Hazard ratio (HR) was 1.6 (95% CI, 1.3-1.9). Variants not in Human Gene Mutation Database were not linked to VTE (HR, 1.1; 95% CI, 0.8-1.5). Heterozygosity for rs6025 and rs1799963 was associated with incident VTE: HR, 1.8 (95% CI, 1.6-2.0) and HR, 1.6 (95% CI, 1.3-2.0), respectively. The HR for carrying 1 classical thrombophilia variant was 1.7 (95% CI, 1.6-1.9). HR was 3.9 (95% CI, 3.1-5.0) for carriers of ≥2 thrombophilia variants. Conclusions The 5 classic thrombophilias are associated with a dose-graded risk of VTE in middle-aged and older adults. Disease-causing variants in the
,
, and
genes were more common than the rs1799963 variant but the conferred genetic risk was comparable with the rs6025 and rs1799963 variants.
Follicular Lymphoma (FL) is the most common indolent lymphoma derived from light zone germinal center B cells and characterized by a t(14;18) translocation resulting in upregulation of BCL2 in over ...80% of cases. This translocation alone is not sufficient for tumorogenesis, and must be combined with additional genetic mutations to transform B cells. FL is incurable and the disease course can be highly varied, with survival ranging from a few months to decades following diagnosis and treatment with standard chemoimmunotherapy. The heterogeneity of FL poses major challenges to identifying the association of genetic alterations and clinical outcome.
Current WHO guidelines recommend establishing grade for each FL case with grade 3 thought to be more aggressive than 1 and 2. The genetic basis and clinical implications of grade in FL are unclear. Recent sequencing studies have identified many genes found to be recurrently mutated in FL including KMT2D and CREBBP. However, the degree to which genetic alterations cooperate with each other or contribute to clinical outcome is unclear. Based on the observed mutational rates in follicular lymphoma, we estimated 900 cases were needed to comprehensively delineate the genetic alterations that underlie histologic grade and clinical outcome.
Accordingly, we enrolled a cohort of 1042 patients with newly diagnosed FL. All treated patients received rituximab-containing standard regimens. To go beyond the identification of gene-coding events, we developed a very large panel of 110 Mbp covering exonic (~40Mbp) and non-exonic regions (~70Mbp) of interest to enable a wide range of genomic analysis including mutation calling in both coding and non-coding regions, rearrangement detection, viral identification, and copy number analysis. In addition to the whole exome, we extended coverage to include introns, promoters, and untranslated regions of all known driver genes in cancer. We included the entirety of the immunoglobulin loci, T-cell receptor loci and CD3 loci to detect clonotypes and rearrangements. We also included lymphoma-relevant long non-coding RNAs, microRNAs, enhancers, and breakpoint-prone regions. For viral detection, we targeted the genomes of eight cancer-related viruses: Epstein-Barr virus, human papillomavirus, human immunodeficiency virus, hepatitis B, hepatitis C, Kaposi's sarcoma-associated herpesvirus, human T-lymphotropic virus, and Merkel cell polyomavirus. In addition, to enable high resolution identification of copy number variation (CNV) calls, the entire genome was tiled with probes spaced 10kb apart.
DNA and RNA were extracted from all tumors and their paired normal samples, prepared into DNA and RNA sequencing libraries and subjected to sequencing on the Illumina platform to a targeted coverage of 150X. Somatic events were identified and further filtered to identify driver events in both coding and non-coding regions.
FLs demonstrated a significant degree of genetic heterogeneity with over 100 genes mutated with a frequency of at least 2%. Nearly 100% of FL cases had a mutation in at least one chromatin-modifying gene. The most frequently mutated genes in follicular lymphoma were KMT2D, BCL2, IGLL5 and CREBBP. In addition, we identified frequent mutations in SPEN, BIRC6 and SETD2. To our knowledge, this is the first description of alterations in these genes in FL. Transcriptome analysis indicated a strong correlation between BIRC6 mutations and the previously described immune response 2 signature that is associated with a poor prognosis. We further performed unbiased clustering of genetic alterations in these FL cases. We identified a cluster that was specifically enriched in BCL6 and TP53 alterations and was strongly associated with grade 3 FLs which are predicted to have poorer outcomes with low intensity therapies. We further examined the genetic profiles of 1001 DLBCLs in comparison to this cohort of FLs. Our data indicate a continuum of highly overlapping genetic alterations with DLBCL displaying more complex patterns that included alterations in MYC, TP53 and CDKN2A (mainly copy number losses), indicating shared pathogenetic mechanisms underlying FL and DLBCL, particularly those germinal center B cell origin.
Koff:Burroughs Wellcome Fund: Research Funding; V Foundation: Research Funding; Lymphoma Research Foundation: Research Funding; American Association for Cancer Research: Research Funding. Leppä:Roche: Honoraria, Research Funding; Takeda: Honoraria, Membership on an entity's Board of Directors or advisory committees, Research Funding; Bayer: Research Funding; Celgene: Membership on an entity's Board of Directors or advisory committees, Research Funding; Janssen-Cilag: Research Funding; Novartis: Membership on an entity's Board of Directors or advisory committees. Gang:ROCHE: Membership on an entity's Board of Directors or advisory committees; Novartis: Membership on an entity's Board of Directors or advisory committees. Hsi:Abbvie: Research Funding; Eli Lilly: Research Funding; Cleveland Clinic&Abbvie Biotherapeutics Inc: Patents & Royalties: US8,603,477 B2; Jazz: Consultancy. Flowers:AbbVie: Consultancy, Research Funding; Denovo Biopharma: Consultancy; BeiGene: Consultancy, Research Funding; Burroughs Wellcome Fund: Research Funding; Eastern Cooperative Oncology Group: Research Funding; National Cancer Institute: Research Funding; V Foundation: Research Funding; Optimum Rx: Consultancy; Millenium/Takeda: Research Funding; TG Therapeutics: Research Funding; Gilead: Consultancy, Research Funding; Celgene: Consultancy, Research Funding; Karyopharm: Consultancy; AstraZeneca: Consultancy; Pharmacyclics/Janssen: Consultancy, Research Funding; Spectrum: Consultancy; Bayer: Consultancy; Acerta: Research Funding; Genentech, Inc./F. Hoffmann-La Roche Ltd: Consultancy, Research Funding. Neff:Enzyvant: Consultancy; EUSA Pharma: Honoraria, Membership on an entity's Board of Directors or advisory committees. Fedoriw:Alexion Pharmaceuticals: Other: Consultant and Speaker. Reddy:Genentech: Research Funding; BMS: Consultancy, Research Funding; Celgene: Consultancy; KITE Pharma: Consultancy; Abbvie: Consultancy. Mason:Sysmex: Honoraria. Behdad:Loxo-Bayer: Membership on an entity's Board of Directors or advisory committees; Thermo Fisher: Membership on an entity's Board of Directors or advisory committees; Pfizer: Other: Speaker. Burton:Bristol-Myers Squibb: Honoraria, Membership on an entity's Board of Directors or advisory committees; Roche: Honoraria, Membership on an entity's Board of Directors or advisory committees, Other: Travel; Celgene: Membership on an entity's Board of Directors or advisory committees; Takeda: Honoraria, Membership on an entity's Board of Directors or advisory committees. Dave:Data Driven Bioscience: Equity Ownership.
Tissue factor is the main initiator of blood coagulation, and tissue factor pathway inhibitor (TFPI) is the primary inhibitor of the initiation of blood coagulation.
The genetic variation of TFPI and ...the relation to venous thromboembolism (VTE), that is, venous thrombosis and pulmonary embolism, remains to be clarified. This exome sequencing study aimed to determine the molecular epidemiology of the TFPI gene and the relation to VTE in a large population‐based cohort of middle‐aged and older adults.
The exomes of TFPI were analyzed for variants in 28,794 subjects without previous VTE (born 1923–1950, 60% women), who participated in the Malmö Diet and Cancer Study (1991–1996). Patients were followed until the first event of VTE, death, or 2018. Qualifying variants were defined as loss‐of‐function or nonbenign (PolyPhen‐2) missense variants with minor allele frequency less than 0.1%.
No common variant was associated with VTE. Nine rare variants (two loss‐of‐function and seven nonbenign missense) were classified as qualifying and included in collapsing analysis. Prevalence of qualifying variants was 0.09%. Five individuals with VTE compared to 17 individuals without VTE carried one qualifying variant. Cox multivariate regression analysis adjusted for age, sex, body mass index, systolic blood pressure, smoking and alcohol consumption, rs6025, rs1799963, and ancestry showed a hazard ratio of 2.9 (95% CI, 1.2–7.1) for rare qualifying variants.
Rare qualifying TFPI variants were associated with VTE, suggesting that rare variants in TFPI contribute to the development of VTE. The qualifying TFPI gene variants were very rare, suggesting a constrained gene.
Display omitted