An estimated 15% or more of the cancer burden worldwide is attributable to known infectious agents. We screened colorectal carcinoma and matched normal tissue specimens using RNA-seq followed by host ...sequence subtraction and found marked over-representation of Fusobacterium nucleatum sequences in tumors relative to control specimens. F. nucleatum is an invasive anaerobe that has been linked previously to periodontitis and appendicitis, but not to cancer. Fusobacteria are rare constituents of the fecal microbiota, but have been cultured previously from biopsies of inflamed gut mucosa. We obtained a Fusobacterium isolate from a frozen tumor specimen; this showed highest sequence similarity to a known gut mucosa isolate and was confirmed to be invasive. We verified overabundance of Fusobacterium sequences in tumor versus matched normal control tissue by quantitative PCR analysis from a total of 99 subjects (p = 2.5 × 10(-6)), and we observed a positive association with lymph node metastasis.
Due to advances in sequencing technology, somatically mutated cancer antigens, or neoantigens, are now readily identifiable and have become compelling targets for immunotherapy. In particular, ...neoantigen-targeted vaccines have shown promise in several pre-clinical and clinical studies. However, to date, neoantigen-targeted vaccine studies have involved tumors with exceptionally high mutation burdens. It remains unclear whether neoantigen-targeted vaccines will be broadly applicable to cancers with intermediate to low mutation burdens, such as ovarian cancer. To address this, we assessed whether a derivative of the murine ovarian tumor model ID8 could be targeted with neoantigen vaccines. We performed whole exome and transcriptome sequencing on ID8-G7 cells. We identified 92 somatic mutations, 39 of which were transcribed, missense mutations. For the 17 top predicted MHC class I binding mutations, we immunized mice subcutaneously with synthetic long peptide vaccines encoding the relevant mutation. Seven of 17 vaccines induced robust mutation-specific CD4 and/or CD8 T cell responses. However, none of the vaccines prolonged survival of tumor-bearing mice in either the prophylactic or therapeutic setting. Moreover, none of the neoantigen-specific T cell lines recognized ID8-G7 tumor cells in vitro, indicating that the corresponding mutations did not give rise to bonafide MHC-presented epitopes. Additionally, bioinformatic analysis of The Cancer Genome Atlas data revealed that only 12% (26/220) of HGSC cases had a ≥90% likelihood of harboring at least one authentic, naturally processed and presented neoantigen versus 51% (80/158) of lung cancers. Our findings highlight the limitations of applying neoantigen-targeted vaccines to tumor types with intermediate/low mutation burdens.
Cytotoxic CD8
T cells recognize and eliminate infected or malignant cells that present peptide epitopes derived from intracellularly processed antigens on their surface. However, comprehensive ...profiling of specific major histocompatibility complex (MHC)-bound peptide epitopes that are naturally processed and capable of eliciting a functional T cell response has been challenging. Here, we report a method for deep and unbiased T cell epitope profiling, using in vitro co-culture of CD8
T cells together with target cells transduced with high-complexity, epitope-encoding minigene libraries. Target cells that are subject to cytotoxic attack from T cells in co-culture are isolated prior to apoptosis by fluorescence-activated cell sorting, and characterized by sequencing the encoded minigenes. We then validate this highly parallelized method using known murine T cell receptor/peptide-MHC pairs and diverse minigene-encoded epitope libraries. Our data thus suggest that this epitope profiling method allows unambiguous and sensitive identification of naturally processed and MHC-presented peptide epitopes.
The immunoglobulin heavy-chain locus (IGH) encodes variable (IGHV), diversity (IGHD), joining (IGHJ), and constant (IGHC) genes and is responsible for antibody heavy-chain biosynthesis, which is ...vital to the adaptive immune response. Programmed V-(D)-J somatic rearrangement and the complex duplicated nature of the locus have impeded attempts to reconcile its genomic organization based on traditional B-lymphocyte derived genetic material. As a result, sequence descriptions of germline variation within IGHV are lacking, haplotype inference using traditional linkage disequilibrium methods has been difficult, and the human genome reference assembly is missing several expressed IGHV genes. By using a hydatidiform mole BAC clone resource, we present the most complete haplotype of IGHV, IGHD, and IGHJ gene regions derived from a single chromosome, representing an alternate assembly of ∼1 Mbp of high-quality finished sequence. From this we add 101 kbp of previously uncharacterized sequence, including functional IGHV genes, and characterize four large germline copy-number variants (CNVs). In addition to this germline reference, we identify and characterize eight CNV-containing haplotypes from a panel of nine diploid genomes of diverse ethnic origin, discovering previously unmapped IGHV genes and an additional 121 kbp of insertion sequence. We genotype four of these CNVs by using PCR in 425 individuals from nine human populations. We find that all four are highly polymorphic and show considerable evidence of stratification (Fst = 0.3–0.5), with the greatest differences observed between African and Asian populations. These CNVs exhibit weak linkage disequilibrium with SNPs from two commercial arrays in most of the populations tested.
Sequence verification is essential for plasmids used as critical reagents or therapeutic products. Typically, high-quality plasmid sequence is achieved through capillary-based Sanger sequencing, ...requiring customized sets of primers for each plasmid. This process can become expensive, particularly for applications where the validated sequence needs to be produced within a regulated and quality-controlled environment for downstream clinical research applications.
Here, we describe a cost-effective and accurate plasmid sequencing and consensus generation procedure using the Oxford Nanopore Technologies' MinION device as an alternative to capillary-based plasmid sequencing options. This procedure can verify the identity of a pure population of plasmid, either confirming it matches the known and expected sequence, or identifying mutations present in the plasmid if any exist. We use a full MinION flow cell per plasmid, maximizing available data and allowing for stringent quality filters. Pseudopairing reads for consensus base calling reduces read error rates from 5.3 to 0.53%, and our pileup consensus approach provides per-base counts and confidence scores, allowing for interpretation of the certainty of the resulting consensus sequences. For pure plasmid samples, we demonstrate 100% accuracy in the resulting consensus sequence, and the sensitivity to detect small mutations such as insertions, deletions, and single nucleotide variants. In test cases where the sequenced pool of plasmids contains subclonal templates, detection sensitivity is similar to that of traditional capillary sequencing.
Our pipeline can provide significant cost savings compared to outsourcing clinical-grade sequencing of plasmids, making generation of high-quality plasmid sequence for clinical sequence verification more accessible. While other long-read-based methods offer higher-throughput and less cost, our pipeline produces complete and accurate sequence verification for cases where absolute sequence accuracy is required.
Immunotherapies have revolutionized cancer treatment. In particular, immune checkpoint therapy (ICT) leads to durable responses in some patients with some cancers. However, the majority of treated ...patients do not respond. Understanding immune mechanisms that underlie responsiveness to ICT will help identify predictive biomarkers of response and develop treatments to convert non-responding patients to responding ones. ICT primarily acts at the level of adaptive immunity. The specificity of adaptive immune cells, such as T and B cells, is determined by antigen-specific receptors. T cell repertoires can be comprehensively profiled by high-throughput sequencing at the bulk and single-cell level. T cell receptor (TCR) sequencing allows for sensitive tracking of dynamic changes in antigen-specific T cells at the clonal level, giving unprecedented insight into the mechanisms by which ICT alters T cell responses. Here, we review how the repertoire influences response to ICT and conversely how ICT affects repertoire diversity. We will also explore how changes to the repertoire in different anatomical locations can better correlate and perhaps predict treatment outcome. We discuss the advantages and limitations of current metrics used to characterize and represent TCR repertoire diversity. Discovery of predictive biomarkers could lie in novel analysis approaches, such as network analysis of amino acids similarities between TCR sequences. Single-cell sequencing is a breakthrough technology that can link phenotype with specificity, identifying T cell clones that are crucial for successful ICT. The field of immuno-sequencing is rapidly developing and cross-disciplinary efforts are required to maximize the analysis, application, and validation of sequencing data. Unravelling the dynamic behavior of the TCR repertoire during ICT will be highly valuable for tracking and understanding anti-tumor immunity, biomarker discovery, and ultimately for the development of novel strategies to improve patient outcomes.
Mucosal infiltration by certain bacterial species may contribute to the development and progression of colorectal cancer (CRC). There is considerable variation in reported detection rates in human ...CRC samples and the extent to which bacterial infiltration varies across regions of the primary tumour is unknown. This study aimed to determine if there is an optimal site for bacterial detection within CRC tumours.
Presence of target bacterial species was assessed by quantitative real-time PCR (qPCR) in 42 human CRC tumours. Abundance in primary tumour regions, normal epithelium and at metastatic sites was investigated in an expanded cohort of 51 patients. Species presence/absence was confirmed by diversity profiling in five patients. Correlation with total bacterial load and clinicopathological features was assessed.
Fusobacterium nucleatum and Bacteroides fragilis were detected in tumours from 43% and 24% of patients, respectively (17% positive for both species). The optimal detection site was the tumour luminal surface (TLS). Patients testing positive at the TLS frequently tested negative at other sites, including central tumour and invasive margin. F. nucleatum was detected at a higher frequency in tumour versus normal epithelium (p < 0.01) and was associated with more advanced disease (p = 0.01). Detection of both species correlated with total bacterial load. However, corroboration of qPCR results via diversity profiling suggests detection of these species may indicate a specific microbial signature.
This study supports a role for F. nucleatum in CRC development. Presence of F. nucleatum and B. fragilis varies across primary tumour regions, with the TLS representing the optimal site for bacterial detection.
Numerous cancers have been linked to microorganisms. Given that colorectal cancer is a leading cause of cancer deaths and the colon is continuously exposed to a high diversity of microbes, the ...relationship between gut mucosal microbiome and colorectal cancer needs to be explored. Metagenomic studies have shown an association between Fusobacterium species and colorectal carcinoma. Here, we have extended these studies with deeper sequencing of a much larger number (n = 130) of colorectal carcinoma and matched normal control tissues. We analyzed these data using co-occurrence networks in order to identify microbe-microbe and host-microbe associations specific to tumors.
We confirmed tumor over-representation of Fusobacterium species and observed significant co-occurrence within individual tumors of Fusobacterium, Leptotrichia and Campylobacter species. This polymicrobial signature was associated with over-expression of numerous host genes, including the gene encoding the pro-inflammatory chemokine Interleukin-8. The tumor-associated bacteria we have identified are all Gram-negative anaerobes, recognized previously as constituents of the oral microbiome, which are capable of causing infection. We isolated a novel strain of Campylobacter showae from a colorectal tumor specimen. This strain is substantially diverged from a previously sequenced oral Campylobacter showae isolate, carries potential virulence genes, and aggregates with a previously isolated tumor strain of Fusobacterium nucleatum.
A polymicrobial signature of Gram-negative anaerobic bacteria is associated with colorectal carcinoma tissue.
Frogs play important ecological roles, and several species are important model organisms for scientific research. The globally distributed Ranidae (true frogs) are the largest frog family, and have ...substantial evolutionary distance from the model laboratory Xenopus frog species. Unfortunately, there are currently no genomic resources for the former, important group of amphibians. More widely applicable amphibian genomic data is urgently needed as more than two-thirds of known species are currently threatened or are undergoing population declines. We report a 5.8 Gbp (NG50 = 69 kbp) genome assembly of a representative North American bullfrog (Rana Lithobates catesbeiana). The genome contains over 22,000 predicted protein-coding genes and 6,223 candidate long noncoding RNAs (lncRNAs). RNA-Seq experiments show thyroid hormone causes widespread transcriptional change among protein-coding and putative lncRNA genes. This initial bullfrog draft genome will serve as a key resource with broad utility including amphibian research, developmental biology, and environmental research.