Advances in high-throughput sequencing technologies now allow for large-scale characterization of B cell immunoglobulin (Ig) repertoires. The high germline and somatic diversity of the Ig repertoire ...presents challenges for biologically meaningful analysis, which requires specialized computational methods. We have developed a suite of utilities, Change-O, which provides tools for advanced analyses of large-scale Ig repertoire sequencing data. Change-O includes tools for determining the complete set of Ig variable region gene segment alleles carried by an individual (including novel alleles), partitioning of Ig sequences into clonal populations, creating lineage trees, inferring somatic hypermutation targeting models, measuring repertoire diversity, quantifying selection pressure, and calculating sequence chemical properties. All Change-O tools utilize a common data format, which enables the seamless integration of multiple analyses into a single workflow.
Change-O is freely available for non-commercial use and may be downloaded from http://clip.med.yale.edu/changeo.
steven.kleinstein@yale.edu.
Individual variation in germline and expressed B-cell immunoglobulin (Ig) repertoires has been associated with aging, disease susceptibility, and differential response to infection and vaccination. ...Repertoire properties can now be studied at large-scale through next-generation sequencing of rearranged Ig genes. Accurate analysis of these repertoire-sequencing (Rep-Seq) data requires identifying the germline variable (V), diversity (D), and joining (J) gene segments used by each Ig sequence. Current V(D)J assignment methods work by aligning sequences to a database of known germline V(D)J segment alleles. However, existing databases are likely to be incomplete and novel polymorphisms are hard to differentiate from the frequent occurrence of somatic hypermutations in Ig sequences. Here we develop a Tool for Ig Genotype Elucidation via Rep-Seq (TIgGER). TIgGER analyzes mutation patterns in Rep-Seq data to identify novel V segment alleles, and also constructs a personalized germline database containing the specific set of alleles carried by a subject. This information is then used to improve the initial V segment assignments from existing tools, like IMGT/HighV-QUEST. The application of TIgGER to Rep-Seq data from seven subjects identified 11 novel V segment alleles, including at least one in every subject examined. These novel alleles constituted 13% of the total number of unique alleles in these subjects, and impacted 3% of V(D)J segment assignments. These results reinforce the highly polymorphic nature of human Ig V genes, and suggest that many novel alleles remain to be discovered. The integration of TIgGER into Rep-Seq processing pipelines will increase the accuracy of V segment assignments, thus improving B-cell repertoire analyses.
High-throughput immunoglobulin sequencing promises new insights into the somatic hypermutation and antigen-driven selection processes that underlie B-cell affinity maturation and adaptive immunity. ...The ability to estimate positive and negative selection from these sequence data has broad applications not only for understanding the immune response to pathogens, but is also critical to determining the role of somatic hypermutation in autoimmunity and B-cell cancers. Here, we develop a statistical framework for Bayesian estimation of Antigen-driven SELectIoN (BASELINe) based on the analysis of somatic mutation patterns. Our approach represents a fundamental advance over previous methods by shifting the problem from one of simply detecting selection to one of quantifying selection. Along with providing a more intuitive means to assess and visualize selection, our approach allows, for the first time, comparative analysis between groups of sequences derived from different germline V(D)J segments. Application of this approach to next-generation sequencing data demonstrates different selection pressures for memory cells of different isotypes. This framework can easily be adapted to analyze other types of DNA mutation patterns resulting from a mutator that displays hot/cold-spots, substitution preference or other intrinsic biases.
Abstract
Summary
Antibody haplotype inference (chromosomal phasing) may have clinical implications for the identification of genetic predispositions to diseases. Yet, our knowledge of the genomic ...loci encoding for the variable regions of the antibody is only partial, mostly due to the challenge of aligning short reads from genome sequencing to these highly repetitive loci. A powerful approach to infer the content of these loci relies on analyzing repertoires of rearranged V(D)J sequences. We present here RAbHIT, an R Haplotype Antibody Inference Tool, that implements a novel algorithm to infer V(D)J haplotypes by adapting a Bayesian framework. RAbHIT offers inference of haplotype and gene deletions. It may be applied to sequences from naïve and non-naïve B-cells, sequenced by different library preparation protocols.
Availability and implementation
RAbHIT is freely available for academic use from comprehensive R archive network (CRAN) (https://cran.r-project.org/web/packages/rabhit/) under CC BY-SA 4.0 license.
Supplementary information
Supplementary data are available at Bioinformatics online.
Analysis of antibody repertoires by high-throughput sequencing is of major importance in understanding adaptive immune responses. Our knowledge of variations in the genomic loci ...encoding immunoglobulin genes is incomplete, resulting in conflicting VDJ gene assignments and biased genotype and haplotype inference. Haplotypes can be inferred using IGHJ6 heterozygosity, observed in one third of the people. Here, we propose a robust novel method for determining VDJ haplotypes by adapting a Bayesian framework. Our method extends haplotype inference to IGHD- and IGHV-based analysis, enabling inference of deletions and copy number variations in the entire population. To test this method, we generated a multi-individual data set of naive B-cell repertoires, and found allele usage bias, as well as a mosaic, tiled pattern of deleted IGHD and IGHV genes. The inferred haplotypes may have clinical implications for genetic disease predispositions. Our findings expand the knowledge that can be extracted from antibody repertoire sequencing data.
Cancer immunotherapy has made enormous progress in offering safer and more effective treatments for the disease. Specifically, programmed death ligand 1 antibody (αPDL1), designed to perform immune ...checkpoint blockade (ICB), is now considered a pillar in cancer immunotherapy. However, due to the complexity and heterogeneity of tumors, as well as the diversity in patient response, ICB therapy only has a 30% success rate, at most; moreover, the efficacy of ICB can be evaluated only two months after start of treatment. Therefore, early identification of potential responders and nonresponders to therapy, using noninvasive means, is crucial for improving treatment decisions. Here, we report a straightforward approach for fast, image-guided prediction of therapeutic response to ICB. In a colon cancer mouse model, we demonstrate that the combination of computed tomography imaging and gold nanoparticles conjugated to αPDL1 allowed prediction of therapeutic response, as early as 48 h after treatment. This was achieved by noninvasive measurement of nanoparticle accumulation levels within the tumors. Moreover, we show that the nanoparticles efficiently prevented tumor growth with only a fifth of the standard dosage of clinical care. This technology may be developed into a powerful tool for early and noninvasive patient stratification as responders or nonresponders.
Computational modeling of signal propagation in neurons is critical to our understanding of basic principles underlying brain organization and activity. Exploring these models is used to address ...basic neuroscience questions as well as to gain insights for clinical applications. The seminal Hodgkin Huxley model is a common theoretical framework to study brain activity. It was mainly used to investigate the electrochemical and physical properties of neurons. The influence of neuronal structure on activity patterns was explored, however, the rich dynamics observed in neurons with different morphologies is not yet fully understood. Here, we study signal propagation in fundamental building blocks of neuronal branching trees, unbranched and branched axons. We show how these simple axonal elements can code information on spike trains, and how asymmetric responses can emerge in axonal branching points. This asymmetric phenomenon has been observed experimentally but until now lacked theoretical characterization. Together, our results suggest that axonal morphological parameters are instrumental in activity modulation and information coding. The insights gained from this work lay the ground for better understanding the interplay between function and form in real-world complex systems. It may also supply theoretical basis for the development of novel therapeutic approaches to damaged nervous systems.
Driven by dramatic technological improvements, large-scale characterization of lymphocyte receptor repertoires via high-throughput sequencing is now feasible. Although promising, the high germline ...and somatic diversity, especially of B-cell immunoglobulin repertoires, presents challenges for analysis requiring the development of specialized computational pipelines. We developed the REpertoire Sequencing TOolkit (pRESTO) for processing reads from high-throughput lymphocyte receptor studies. pRESTO processes raw sequences to produce error-corrected, sorted and annotated sequence sets, along with a wealth of metrics at each step. The toolkit supports multiplexed primer pools, single- or paired-end reads and emerging technologies that use single-molecule identifiers. pRESTO has been tested on data generated from Roche and Illumina platforms. It has a built-in capacity to parallelize the work between available processors and is able to efficiently process millions of sequences generated by typical high-throughput projects.
pRESTO is freely available for academic use. The software package and detailed tutorials may be downloaded from http://clip.med.yale.edu/presto.
The long lasting debate initiated by Gilovich, Vallone and Tversky in Formula: see text is revisited: does a "hot hand" phenomenon exist in sports? Hereby we come back to one of the cases analyzed by ...the original study, but with a much larger data set: all free throws taken during five regular seasons (Formula: see text) of the National Basketball Association (NBA). Evidence supporting the existence of the "hot hand" phenomenon is provided. However, while statistical traces of this phenomenon are observed in the data, an open question still remains: are these non random patterns a result of "success breeds success" and "failure breeds failure" mechanisms or simply "better" and "worse" periods? Although free throws data is not adequate to answer this question in a definite way, we speculate based on it, that the latter is the dominant cause behind the appearance of the "hot hand" phenomenon in the data.
The increasing worldwide prevalence of Hepatocellular carcinoma (HCC), characterized by resistance to conventional chemotherapy, poor prognosis and eventually mortality, place it as a prime target ...for new modes of prevention and treatment. Hepatitis C Virus (HCV) is the predominant risk factor for HCC in the US and Europe. Multiple epidemiological studies showed that sustained virological responses (SVR) following treatment with the powerful direct acting antivirals (DAAs), which have replaced interferon-based regimes, do not eliminate tumor development. We aimed to identify an HCV-specific pathogenic mechanism that persists post SVR following DAAs treatment. We demonstrate that HCV infection induces genome-wide epigenetic changes by performing chromatin immunoprecipitation followed by next-generation sequencing (ChIP-seq) for histone post-translational modifications that are epigenetic markers for active and repressed chromatin. The changes in histone modifications correlate with reprogramed host gene expression and alter signaling pathways known to be associated with HCV life cycle and HCC. These epigenetic alterations require the presence of HCV RNA or/and expression of the viral proteins in the cells. Importantly, the epigenetic changes induced following infection persist as an "epigenetic signature" after virus eradication by DAAs treatment, as detected using in vitro HCV infection models. These observations led to the identification of an 8 gene signature that is associated with HCC development and demonstrate persistent epigenetic alterations in HCV infected and post SVR liver biopsy samples. The epigenetic signature was reverted in vitro by drugs that inhibit epigenetic modifying enzyme and by the EGFR inhibitor, Erlotinib. This epigenetic "scarring" of the genome, persisting following HCV eradication, suggest a novel mechanism for the persistent pathogenesis of HCV after its eradication by DAAs. Our study offers new avenues for prevention of the persistent oncogenic effects of chronic hepatitis infections using specific drugs to revert the epigenetic changes to the genome.