Benchmark small variant calls are required for developing, optimizing and assessing the performance of sequencing and bioinformatics methods. Here, as part of the Genome in a Bottle (GIAB) ...Consortium, we apply a reproducible, cloud-based pipeline to integrate multiple short- and linked-read sequencing datasets and provide benchmark calls for human genomes. We generate benchmark calls for one previously analyzed GIAB sample, as well as six genomes from the Personal Genome Project. These new genomes have broad, open consent, making this a 'first of its kind' resource that is available to the community for multiple downstream applications. We produce 17% more benchmark single nucleotide variations, 176% more indels and 12% larger benchmark regions than previously published GIAB benchmarks. We demonstrate that this benchmark reliably identifies errors in existing callsets and highlight challenges in interpreting performance metrics when using benchmarks that are not perfect or comprehensive. Finally, we identify strengths and weaknesses of callsets by stratifying performance according to variant type and genome context.
Standardized benchmarking approaches are required to assess the accuracy of variants called from sequence data. Although variant-calling tools and the metrics used to assess their performance ...continue to improve, important challenges remain. Here, as part of the Global Alliance for Genomics and Health (GA4GH), we present a benchmarking framework for variant calling. We provide guidance on how to match variant calls with different representations, define standard performance metrics, and stratify performance by variant type and genome context. We describe limitations of high-confidence calls and regions that can be used as truth sets (for example, single-nucleotide variant concordance of two methods is 99.7% inside versus 76.5% outside high-confidence regions). Our web-based app enables comparison of variant calls against truth sets to obtain a standardized performance report. Our approach has been piloted in the PrecisionFDA variant-calling challenges to identify the best-in-class variant-calling methods within high-confidence regions. Finally, we recommend a set of best practices for using our tools and evaluating the results.
A new study highlights the biases and inaccuracies of polygenic risk scores (PRS) when predicting disease risk in individuals from populations other than those used in their derivation. The design ...bias of workhorse tools used for research, particularly genotyping arrays, contributes to these distortions. To avoid further inequities in health outcomes, the inclusion of diverse populations in research, unbiased genotyping, and methods of bias reduction in PRS are critical.
Genomics for the world Bustamante, Carlos D; Burchard, Esteban González; De la Vega, Francisco M
Nature (London),
07/2011, Volume:
475, Issue:
7355
Journal Article
Peer reviewed
Open access
If we do not, a biased picture will emerge of which variants are important, and genomic medicine will largely benefit a privileged few. Since the 1970s, geneticists have known that most of the ...genetic variance between individuals stems from differences in DNA sequence (genetic variants).
Endocrine-resistant HR+/HER2- breast cancer (BC) and triple-negative BC (TNBC) are of interest for molecularly informed treatment due to their aggressive natures and limited treatment profiles. ...Patients of African Ancestry (AA) experience higher rates of TNBC and mortality than European Ancestry (EA) patients, despite lower overall BC incidence. Here, we compare the molecular landscapes of AA and EA patients with HR+/HER2- BC and TNBC in a real-world cohort to promote equity in precision oncology by illuminating the heterogeneity of potentially druggable genomic and transcriptomic pathways.
De-identified records from patients with TNBC or HR+/HER2- BC in the Tempus Database were randomly selected (N = 5000), with most having stage IV disease. Mutations, gene expression, and transcriptional signatures were evaluated from next-generation sequencing data. Genetic ancestry was estimated from DNA-seq. Differences in mutational prevalence, gene expression, and transcriptional signatures between AA and EA were compared. EA patients were used as the reference population for log fold-changes (logFC) in expression.
After applying inclusion criteria, 3433 samples were evaluated (n = 623 AA and n = 2810 EA). Observed patterns of dysregulated pathways demonstrated significant heterogeneity among the two groups. Notably, PIK3CA mutations were significantly lower in AA HR+/HER2- tumors (AA = 34% vs. EA = 42%, P < 0.05) and the overall cohort (AA = 28% vs. EA = 37%, P = 2.08e-05). Conversely, KMT2C mutation was significantly more frequent in AA than EA TNBC (23% vs. 12%, P < 0.05) and HR+/HER2- (24% vs. 15%, P = 3e-03) tumors. Across all subtypes and stages, over 8000 genes were differentially expressed between the two ancestral groups including RPL10 (logFC = 2.26, P = 1.70e-162), HSPA1A (logFC = - 2.73, P = 2.43e-49), ATRX (logFC = - 1.93, P = 5.89e-83), and NUTM2F (logFC = 2.28, P = 3.22e-196). Ten differentially expressed gene sets were identified among stage IV HR+/HER2- tumors, of which four were considered relevant to BC treatment and were significantly enriched in EA: ERBB2_UP.V1_UP (P = 3.95e-06), LTE2_UP.V1_UP (P = 2.90e-05), HALLMARK_FATTY_ACID_METABOLISM (P = 0.0073), and HALLMARK_ANDROGEN_RESPONSE (P = 0.0074).
We observed significant differences in mutational spectra, gene expression, and relevant transcriptional signatures between patients with genetically determined African and European ancestries, particularly within the HR+/HER2- BC and TNBC subtypes. These findings could guide future development of treatment strategies by providing opportunities for biomarker-informed research and, ultimately, clinical decisions for precision oncology care in diverse populations.
In vitro cancer cultures, including three-dimensional organoids, typically contain exclusively neoplastic epithelium but require artificial reconstitution to recapitulate the tumor microenvironment ...(TME). The co-culture of primary tumor epithelia with endogenous, syngeneic tumor-infiltrating lymphocytes (TILs) as a cohesive unit has been particularly elusive. Here, an air-liquid interface (ALI) method propagated patient-derived organoids (PDOs) from >100 human biopsies or mouse tumors in syngeneic immunocompetent hosts as tumor epithelia with native embedded immune cells (T, B, NK, macrophages). Robust droplet-based, single-cell simultaneous determination of gene expression and immune repertoire indicated that PDO TILs accurately preserved the original tumor T cell receptor (TCR) spectrum. Crucially, human and murine PDOs successfully modeled immune checkpoint blockade (ICB) with anti-PD-1- and/or anti-PD-L1 expanding and activating tumor antigen-specific TILs and eliciting tumor cytotoxicity. Organoid-based propagation of primary tumor epithelium en bloc with endogenous immune stroma should enable immuno-oncology investigations within the TME and facilitate personalized immunotherapy testing.
Display omitted
•Air-liquid interface (ALI) patient-derived tumor organoids (PDO) retain immune cells•5′ V(D)J and RNA-seq from the same single cells allows robust immune characterization•T cell receptor repertoire is highly conserved between tumor and PDO•ALI PDOs functionally recapitulate the PD-1/PD-L1-dependent immune checkpoint
The tumor-immune microenvironment is modeled using a patient-derived organoid approach that preserves the original tumor T cell receptor spectrum and successfully models immune checkpoint blockade.
Abstract
The incidence and mortality of early onset colorectal cancer (EOCRC) is rising; outcomes appear to differ by race and ethnicity. We aimed to assess differences in mutational landscape and ...gene expression of EOCRC by racial and ethnic groups (non-Hispanic Asian, non-Hispanic Black, non-Hispanic White, White Hispanic) using data from the American Association for Cancer Research Project GENIE (10.2) and University of Texas Southwestern, the latter enriched in Hispanic patients. All statistical tests were 2-sided. Of 1752 EOCRC patients, non-Hispanic Black patients had higher rates of KRAS mutations (60.9%; P = .001, q = 0.015), and non-Hispanic White and non-Hispanic Black patients had higher rates of APC mutations (77.1% and 76.6% among non-Hispanic White and non-Hispanic Black patients, respectively; P = .001, q = 0.015) via the Fisher exact test with Benjamini-Hochberg correction. Using R packages DESeq2 and clusterProfiler, we found that White Hispanic patients had increased expression of genes involved in oxidative phosphorylation (P < .001, q = 0.025). Genomic profiling has the potential to identify novel diagnostics and influence individualized treatment options to address the currently limited prognosis of EOCRC.
We performed a genome-wide association study of 19,779 nonsynonymous SNPs in 735 individuals with Crohn disease and 368 controls. A total of 7,159 of these SNPs were informative. We followed up on ...all 72 SNPs with P ≤ 0.01 with an allele-based disease association test in 380 independent Crohn disease trios, 498 Crohn disease singleton cases and 1,032 controls. Disease association of rs2241880 in the autophagy-related 16-like 1 gene (ATG16L1) was replicated in these samples (P = 4.0 × 10−8) and confirmed in a UK case-control sample (P = 0.0004). By haplotype and regression analysis, we found that marker rs2241880, a coding SNP (T300A), carries virtually all the disease risk exerted by the ATG16L1 locus. The ATG16L1 gene encodes a protein in the autophagosome pathway that processes intracellular bacteria. We found a statistically significant interaction with respect to Crohn disease risk between rs2241880 and the established CARD15 susceptibility variants (P = 0.039). Together with the lack of association between rs2241880 and ulcerative colitis (P > 0.4), these data suggest that the underlying biological process may be specific to Crohn disease.
Clinical management of human cancer is dependent on the accurate monitoring of residual and recurrent tumors. The evaluation of patient-specific translocations in leukemias and lymphomas has ...revolutionized diagnostics for these diseases. We have developed a method, called personalized analysis of rearranged ends (PARE), which can identify translocations in solid tumors. Analysis of four colorectal and two breast cancers with massively parallel sequencing revealed an average of nine rearranged sequences (range, 4 to 15) per tumor. Polymerase chain reaction with primers spanning the breakpoints was able to detect mutant DNA molecules present at levels lower than 0.001% and readily identified mutated circulating DNA in patient plasma samples. This approach provides an exquisitely sensitive and broadly applicable approach for the development of personalized biomarkers to enhance the clinical management of cancer patients.