Systems biology relies on data sets in which the same group of proteins is consistently identified and precisely quantified across multiple samples, a requirement that is only partially achieved by ...current proteomics approaches. Selected reaction monitoring (SRM)—also called multiple reaction monitoring—is emerging as a technology that ideally complements the discovery capabilities of shotgun strategies by its unique potential for reliable quantification of analytes of low abundance in complex mixtures. In an SRM experiment, a predefined precursor ion and one of its fragments are selected by the two mass filters of a triple quadrupole instrument and monitored over time for precise quantification. A series of transitions (precursor/fragment ion pairs) in combination with the retention time of the targeted peptide can constitute a definitive assay. Typically, a large number of peptides are quantified during a single LC‐MS experiment. This tutorial explains the application of SRM for quantitative proteomics, including the selection of proteotypic peptides and the optimization and validation of transitions. Furthermore, normalization and various factors affecting sensitivity and accuracy are discussed.
The advent of next generation sequencing (NGS) has altered the face of genotyping the human leukocyte antigen (HLA) system in clinical, stem cell donor registry, and research contexts. NGS has led to ...a dramatically increased sequencing throughput at high accuracy, while being more time and cost efficient than precursor technologies. This has led to a broader and deeper profiling of the key genes in the human immunogenetic make-up. The rapid evolution of sequencing technologies is evidenced by the development of varied short-read sequencing platforms with differing read lengths and sequencing capacities to long-read sequencing platforms capable of profiling full genes without fragmentation. Concomitantly, there has been development of a diverse set of computational analyses and software tools developed to deal with the various strengths and limitations of the sequencing data generated by the different sequencing platforms. This review surveys the different modalities involved in generating NGS HLA profiling sequence data. It systematically describes various computational approaches that have been developed to achieve HLA genotyping to different degrees of resolution. At each stage, this review enumerates the drawbacks and advantages of each of the platforms and analysis approaches, thus providing a comprehensive picture of the current state of HLA genotyping technologies.
High resolution HLA genotyping of donors and recipients is a crucially important prerequisite for haematopoetic stem-cell transplantation and relies heavily on the quality and completeness of ...immunogenetic reference sequence databases of allelic variation.
Here, we report on DR2S, an R package that leverages the strengths of two sequencing technologies-the accuracy of next-generation sequencing with the read length of third-generation sequencing technologies like PacBio's SMRT sequencing or ONT's nanopore sequencing-to reconstruct fully-phased high-quality full-length haplotype sequences. Although optimised for HLA and KIR genes, DR2S is applicable to all loci with known reference sequences provided that full-length sequencing data is available for analysis. In addition, DR2S integrates supporting tools for easy visualisation and quality control of the reconstructed haplotype to ensure suitability for submission to public allele databases.
DR2S is a largely automated workflow designed to create high-quality fully-phased reference allele sequences for highly polymorphic gene regions such as HLA or KIR. It has been used by biologists to successfully characterise and submit more than 500 HLA alleles and more than 500 KIR alleles to the IPD-IMGT/HLA and IPD-KIR databases.
A close match of the HLA alleles between donor and recipient is an important prerequisite for successful unrelated hematopoietic stem cell transplantation. To increase the chances of finding an ...unrelated donor, registries recruit many hundred thousands of volunteers each year. Many registries with limited resources have had to find a trade-off between cost and resolution and extent of typing for newly recruited donors in the past. Therefore, we have taken advantage of recent improvements in NGS to develop a workflow for low-cost, high-resolution HLA typing.
We have established a straightforward three-step workflow for high-throughput HLA typing: Exons 2 and 3 of HLA-A, -B, -C, -DRB1, -DQB1 and -DPB1 are amplified by PCR on Fluidigm Access Array microfluidic chips. Illumina sequencing adapters and sample specific tags are directly incorporated during PCR. Upon pooling and cleanup, 384 samples are sequenced in a single Illumina MiSeq run. We developed "neXtype" for streamlined data analysis and HLA allele assignment. The workflow was validated with 1140 samples typed at 6 loci. All neXtype results were concordant with the Sanger sequences, demonstrating error-free typing of more than 6000 HLA loci. Current capacity in routine operation is 12,000 samples per week.
The workflow presented proved to be a cost-efficient alternative to Sanger sequencing for high-throughput HLA typing. Despite the focus on cost efficiency, resolution exceeds the current standards of Sanger typing for donor registration.
The detection and quantification of plasma (serum) proteins at or below the ng/ml concentration range are of critical importance for the discovery and evaluation of new protein biomarkers. This has ...been achieved either by the development of high sensitivity ELISA or other immunoassays for specific proteins or by the extensive fractionation of the plasma proteome followed by the mass spectrometric analysis of the resulting fractions. The first approach is limited by the high cost and time investment for assay development and the requirement of a validated target. The second, although reasonably comprehensive and unbiased, is limited by sample throughput. Here we describe a method for the detection of plasma proteins at concentrations in the ng/ml or sub-ng/ml range and their accurate quantification over 5 orders of magnitude. The method is based on the selective isolation of N-glycosites from the plasma proteome and the detection and quantification of targeted peptides in a quadrupole linear ion trap instrument operated in the multiple reaction monitoring (MRM) mode. The unprecedented sensitivity of the mass spectrometric analysis of minimally fractionated plasma samples is the result of the significantly reduced sample complexity of the isolated N-glycosites compared with whole plasma proteome digests and the selectivity of the MRM process. Precise quantification was achieved via stable isotope dilution by adding 13C- and/or 15N-labeled reference analytes. We also demonstrate the possibility of significantly expanding the number of MRM measurements during one single LC-MS run without compromising sensitivity by including elution time constraints for the targeted transitions, thus allowing quantification of large sets of peptides in a single analysis.
Homozygous carriers of CCR5-Δ32, a gene variant of CC-type chemokine receptor 5 (CCR5), are highly resistant to infections with human immunodeficiency virus type 1 (HIV-1) and therefore preferred ...stem cell donors for HIV-infected patients. We analyzed CCR5 typing data of 1,333,035 potential hematopoietic stem cell donors enlisted with three national DKMS donor centers. Allele and genotype frequencies were determined for 87 countries of origin as self-assessed by the donors. CCR5-Δ32 allele frequencies ranged from 16.4% in the Norwegian sample to 0 in donors from Ethiopia. The highest CCR5-Δ32/Δ32 genotype frequency was found in the sample from the Faroe Islands (2.3%), whereas in 27 samples, predominantly of donors from Africa, Asia and South America, none of the individuals carried this genotype. The characteristic CCR5-Δ32 allele frequency decline from Northern to Southeastern Eurasia supports findings of earlier studies. With available HLA haplotype frequency information for the patient’s ethnicity, our data allows upfront estimation of the probability that an HLA-matched donor with CCR5-Δ32/Δ32 genotype can be found for a patient in need of hematopoietic stem cell transplantation.
The killer-cell immunoglobulin-like receptor (KIR) complex on chromosome 19 encodes receptors that modulate the activity of natural killer cells, and variation in these genes has been linked to ...infectious and autoimmune disease, as well as having bearing on pregnancy and transplant outcomes. The medical relevance and high variability of KIR genes makes short-read sequencing an attractive technology for interrogating the region, providing a high-throughput, high-fidelity sequencing method that is cost-effective. However, because this gene complex is characterized by extensive nucleotide polymorphism, structural variation including gene fusions and deletions, and a high level of homology between genes, its interrogation at high resolution has been thwarted by bioinformatic challenges, with most studies limited to examining presence or absence of specific genes. Here, we present the PING (Pushing Immunogenetics to the Next Generation) pipeline, which incorporates empirical data, novel alignment strategies and a custom alignment processing workflow to enable high-throughput KIR sequence analysis from short-read data. PING provides KIR gene copy number classification functionality for all KIR genes through use of a comprehensive alignment reference. The gene copy number determined per individual enables an innovative genotype determination workflow using genotype-matched references. Together, these methods address the challenges imposed by the structural complexity and overall homology of the KIR complex. To determine copy number and genotype determination accuracy, we applied PING to European and African validation cohorts and a synthetic dataset. PING demonstrated exceptional copy number determination performance across all datasets and robust genotype determination performance. Finally, an investigation into discordant genotypes for the synthetic dataset provides insight into misaligned reads, advancing our understanding in interpretation of short-read sequencing data in complex genomic regions. PING promises to support a new era of studies of KIR polymorphism, delivering high-resolution KIR genotypes that are highly accurate, enabling high-quality, high-throughput KIR genotyping for disease and population studies.
Systems biology conceptualizes biological systems as dynamic networks of interacting elements, whereby functionally important properties are thought to emerge from the structure of such networks. ...Owing to the ubiquitous role of complexes of interacting proteins in biological systems, their subunit composition and temporal and spatial arrangement within the cell are of particular interest. 'Visual proteomics' attempts to localize individual macromolecular complexes inside of intact cells by template matching reference structures into cryo-electron tomograms. Here we combined quantitative mass spectrometry and cryo-electron tomography to detect, count and localize specific protein complexes in the cytoplasm of the human pathogen Leptospira interrogans. We describe a scoring function for visual proteomics and assess its performance and accuracy under realistic conditions. We discuss current and general limitations of the approach, as well as expected improvements in the future.
Several studies suggest that harnessing natural killer (NK) cell reactivity mediated through killer cell immunoglobulin-like receptors (KIRs) could reduce the risk of relapse after allogeneic ...hematopoietic cell transplantation. Based on one promising model, information on KIR2DS1 and KIR3DL1 and their cognate ligands can be used to classify donors as KIR-advantageous or KIR-disadvantageous. This study was aimed at externally validating this model in unrelated donor hematopoietic cell transplantation. The impact of the predictor on overall survival (OS) and relapse incidence was tested in a Cox regression model adjusted for patient age, a modified disease risk index, Karnofsky performance status, donor age, HLA match, sex match, cytomegalovirus match, conditioning intensity, type of T-cell depletion, and graft type. Data from 2222 patients with acute myeloid leukemia or myelodysplastic syndrome were analyzed. KIR genes were typed by using high-resolution amplicon-based next-generation sequencing. In univariable analyses and subgroup analyses, OS and the cumulative incidence of relapse of patients with a KIR-advantageous donor were comparable to patients with a KIR-disadvantageous donor. The adjusted hazard ratio from the multivariable Cox regression model was 0.99 (Wald test, P = .93) for OS and 1.04 (Wald test, P = .78) for relapse incidence. We also tested the impact of activating donor KIR2DS1 and inhibition by KIR3DL1 separately but found no significant impact on OS and the risk of relapse. Thus, our study shows that the proposed model does not universally predict NK-mediated disease control. Deeper knowledge of NK-mediated alloreactivity is necessary to predict its contribution to graft-versus-leukemia reactions and to eventually use KIR genotype information for donor selection.
•KIR2DS1/KIR3DL1 genotype-informed selection of unrelated stem cell donors is not ready for routine use.•Deeper knowledge of NK-mediated alloreactivity is necessary to predict and optimize its contribution to graft-versus-leukemia reactions.
Display omitted
A key barrier to the realization of personalized medicine for cancer is the identification of biomarkers. Here we describe a two-stage strategy for the discovery of serum biomarker signatures ...corresponding to specific cancer-causing mutations and its application to prostate cancer (PCa) in the context of the commonly occurring phosphatase and tensin homolog (PTEN) tumor-suppressor gene inactivation. In the first stage of our approach, we identified 775 N-linked glycoproteins from sera and prostate tissue of wild-type and Pten-null mice. Using label-free quantitative proteomics, we showed that Pten inactivation leads to measurable perturbations in the murine prostate and serum glycoproteome. Following bioinformatic prioritization, in a second stage we applied targeted proteomics to detect and quantify 39 human ortholog candidate biomarkers in the sera of PCa patients and control individuals. The resulting proteomic profiles were analyzed by machine learning to build predictive regression models for tissue PTEN status and diagnosis and grading of PCa. Our approach suggests a general path to rational cancer biomarker discovery and initial validation guided by cancer genetics and based on the integration of experimental mouse models, proteomics-based technologies, and computational modeling.