Structural variants (SVs), including small insertion and deletion variants (indels), are challenging to detect through standard alignment-based variant calling methods. Sequence assembly offers a ...powerful approach to identifying SVs, but is difficult to apply at scale genome-wide for SV detection due to its computational complexity and the difficulty of extracting SVs from assembly contigs. We describe SvABA, an efficient and accurate method for detecting SVs from short-read sequencing data using genome-wide local assembly with low memory and computing requirements. We evaluated SvABA's performance on the NA12878 human genome and in simulated and real cancer genomes. SvABA demonstrates superior sensitivity and specificity across a large spectrum of SVs and substantially improves detection performance for variants in the 20-300 bp range, compared with existing methods. SvABA also identifies complex somatic rearrangements with chains of short (<1000 bp) templated-sequence insertions copied from distant genomic regions. We applied SvABA to 344 cancer genomes from 11 cancer types and found that short templated-sequence insertions occur in ∼4% of all somatic rearrangements. Finally, we demonstrate that SvABA can identify sites of viral integration and cancer driver alterations containing medium-sized (50-300 bp) SVs.
We review chimeric antigen receptor (CAR) T-cell therapy for solid tumors. We discuss patient selection factors and aspects of clinical management. We describe challenges including physical and ...molecular barriers to trafficking CAR-Ts, an immunosuppressive tumor microenvironment, and difficulty finding cell surface target antigens. The application of new approaches in synthetic biology and cellular engineering toward solid tumor CAR-Ts is described. Finally, we summarize reported and ongoing clinical trials of CAR-T therapies for select disease sites such as head and neck (including thyroid cancer), lung, central nervous system (glioblastoma, neuroblastoma, glioma), sarcoma, genitourinary (prostate, renal, bladder, kidney), breast and ovarian cancer.
New technologies and analysis methods are enabling genomic structural variants (SVs) to be detected with ever-increasing accuracy, resolution and comprehensiveness. To help translate these methods to ...routine research and clinical practice, we developed a sequence-resolved benchmark set for identification of both false-negative and false-positive germline large insertions and deletions. To create this benchmark for a broadly consented son in a Personal Genome Project trio with broadly available cells and DNA, the Genome in a Bottle Consortium integrated 19 sequence-resolved variant calling methods from diverse technologies. The final benchmark set contains 12,745 isolated, sequence-resolved insertion (7,281) and deletion (5,464) calls ≥50 base pairs (bp). The Tier 1 benchmark regions, for which any extra calls are putative false positives, cover 2.51 Gbp and 5,262 insertions and 4,095 deletions supported by ≥1 diploid assembly. We demonstrate that the benchmark set reliably identifies false negatives and false positives in high-quality SV callsets from short-, linked- and long-read sequencing and optical mapping.
Determining how somatic copy number alterations (SCNAs) promote cancer is an important goal. We characterized SCNA patterns in 4,934 cancers from The Cancer Genome Atlas Pan-Cancer data set. ...Whole-genome doubling, observed in 37% of cancers, was associated with higher rates of every other type of SCNA, TP53 mutations, CCNE1 amplifications and alterations of the PPP2R complex. SCNAs that were internal to chromosomes tended to be shorter than telomere-bounded SCNAs, suggesting different mechanisms underlying their generation. Significantly recurrent focal SCNAs were observed in 140 regions, including 102 without known oncogene or tumor suppressor gene targets and 50 with significantly mutated genes. Amplified regions without known oncogenes were enriched for genes involved in epigenetic regulation. When levels of genomic disruption were accounted for, 7% of region pairs were anticorrelated, and these regions tended to encompass genes whose proteins physically interact, suggesting related functions. These results provide insights into mechanisms of generation and functional consequences of cancer-related SCNAs.
We developed VariantBam, a C ++ read filtering and profiling tool for use with BAM, CRAM and SAM sequencing files. VariantBam provides a flexible framework for extracting sequencing reads or ...read-pairs that satisfy combinations of rules, defined by any number of genomic intervals or variant sites. We have implemented filters based on alignment data, sequence motifs, regional coverage and base quality. For example, VariantBam achieved a median size reduction ratio of 3.1:1 when applied to 10 lung cancer whole genome BAMs by removing large tags and selecting for only high-quality variant-supporting reads and reads matching a large dictionary of sequence motifs. Thus VariantBam enables efficient storage of sequencing data while preserving the most relevant information for downstream analysis.
VariantBam and full documentation are available at github.com/jwalabroad/VariantBam
rameen@broadinstitute.org
Supplementary data are available at Bioinformatics online.
Nearly all prostate cancer deaths are from metastatic castration-resistant prostate cancer (mCRPC), but there have been few whole-genome sequencing (WGS) studies of this disease state. We performed ...linked-read WGS on 23 mCRPC biopsy specimens and analyzed cell-free DNA sequencing data from 86 patients with mCRPC. In addition to frequent rearrangements affecting known prostate cancer genes, we observed complex rearrangements of the AR locus in most cases. Unexpectedly, these rearrangements include highly recurrent tandem duplications involving an upstream enhancer of AR in 70%–87% of cases compared with <2% of primary prostate cancers. A subset of cases displayed AR or MYC enhancer duplication in the context of a genome-wide tandem duplicator phenotype associated with CDK12 inactivation. Our findings highlight the complex genomic structure of mCRPC, nominate alterations that may inform prostate cancer treatment, and suggest that additional recurrent events in the non-coding mCRPC genome remain to be discovered.
Display omitted
•Linked-read genome sequencing of mCRPC resolves haplotypes and rearrangements•CDK12 inactivation is associated with a global tandem duplication phenotype•A majority of cases have duplications of an enhancer of the androgen receptor•Progression on androgen pathway inhibitors is associated with gains in AR and AR enhancer
Linked-read genome sequencing data from patients highlight that amplification of an enhancer upstream of the androgen receptor locus is a key feature of metastatic castration-resistant prostate cancer.
Cancers may resist single-agent targeted therapies when the flux of cellular growth signals is shifted from one pathway to another. Blockade of multiple pathways may be necessary for effective ...inhibition of tumor growth. We document a case in which a patient with anaplastic thyroid carcinoma (ATC) failed to respond to either mTOR/PI3K or combined RAF/MEK inhibition but experienced a dramatic response when both drug regimens were combined.
Multi-region whole-exome sequencing of five diagnostic and four autopsy tumor biopsies was performed. Meta-analysis of DNA and RNA sequencing studies of ATC was performed.
Sequencing revealed truncal BRAF and PIK3CA mutations, which are known to activate the MAPK and PI3K/AKT pathways, respectively. Meta-analysis demonstrated 10.3% cooccurrence of MAPK and PI3K pathway alterations in ATC. These tumors display a separate transcriptional profile from other ATCs, consistent with a novel subgroup of ATC.
BRAF and PIK3CA mutations define a distinct subset of ATC. Blockade of the MAPK and PI3K pathways appears necessary for tumor response in this subset of ATC. This identification of synergistic activity between targeted agents may inform clinical trial design in ATC.
.
High-grade meningiomas frequently recur and are associated with high rates of morbidity and mortality. To determine the factors that promote the development and evolution of these tumors, we analyzed ...the genomes of 134 high-grade meningiomas and compared this information with data from 587 previously published meningiomas. High-grade meningiomas had a higher mutation burden than low-grade meningiomas but did not harbor any statistically significant mutated genes aside from
. High-grade meningiomas also possessed significantly elevated rates of chromosomal gains and losses, especially among tumors with monosomy 22. Meningiomas previously treated with adjuvant radiation had significantly more copy number alterations than radiation-induced or radiation-naïve meningiomas. Across serial recurrences, genomic disruption preceded the emergence of nearly all mutations, remained largely uniform across time, and when present in low-grade meningiomas, correlated with subsequent progression to a higher grade. In contrast to the largely stable copy number alterations, mutations were strikingly heterogeneous across tumor recurrences, likely due to extensive geographic heterogeneity in the primary tumor. While high-grade meningiomas harbored significantly fewer overtly targetable alterations than low-grade meningiomas, they contained numerous mutations that are predicted to be neoantigens, suggesting that immunologic targeting may be of therapeutic value.
Renal medullary carcinomas (RMCs) are rare kidney cancers that occur in adolescents and young adults of African ancestry. Although RMC is associated with the sickle cell trait and somatic loss of the ...tumor suppressor, SMARCB1, the ancestral origins of RMC remain unknown. Further, characterization of structural variants (SVs) involving SMARCB1 in RMC remains limited.
We used linked-read genome sequencing to reconstruct germline and somatic haplotypes in 15 unrelated patients with RMC registered on the Children's Oncology Group (COG) AREN03B2 study between 2006 and 2017 or from our prior study. We performed fine-mapping of the HBB locus and assessed the germline for cancer predisposition genes. Subsequently, we assessed the tumor samples for mutations outside of SMARCB1 and integrated RNA sequencing to interrogate the structural variants at the SMARCB1 locus.
We find that the haplotype of the sickle cell mutation in patients with RMC originated from three geographical regions in Africa. In addition, fine-mapping of the HBB locus identified the sickle cell mutation as the sole candidate variant. We further identify that the SMARCB1 structural variants are characterized by blunt or 1-bp homology events.
Our findings suggest that RMC does not arise from a single founder population and that the HbS allele is a strong candidate germline allele which confers risk for RMC. Furthermore, we find that the SVs that disrupt SMARCB1 function are likely repaired by non-homologous end-joining. These findings highlight how haplotype-based analyses using linked-read genome sequencing can be applied to identify potential risk variants in small and rare disease cohorts and provide nucleotide resolution to structural variants.
Recent studies have detailed the genomic landscape of primary endometrial cancers, but the evolution of these cancers into metastases has not been characterized. We performed whole-exome sequencing ...of 98 tumor biopsies including complex atypical hyperplasias, primary tumors and paired abdominopelvic metastases to survey the evolutionary landscape of endometrial cancer. We expanded and reanalyzed The Cancer Genome Atlas (TCGA) data, identifying new recurrent alterations in primary tumors, including mutations in the estrogen receptor cofactor gene NRIP1 in 12% of patients. We found that likely driver events were present in both primary and metastatic tissue samples, with notable exceptions such as ARID1A mutations. Phylogenetic analyses indicated that the sampled metastases typically arose from a common ancestral subclone that was not detected in the primary tumor biopsy. These data demonstrate extensive genetic heterogeneity in endometrial cancers and relative homogeneity across metastatic sites.