FDA proactively invests in tools to support innovation of emerging technologies, such as infectious disease next generation sequencing (ID-NGS). Here, we introduce FDA-ARGOS quality-controlled ...reference genomes as a public database for diagnostic purposes and demonstrate its utility on the example of two use cases. We provide quality control metrics for the FDA-ARGOS genomic database resource and outline the need for genome quality gap filling in the public domain. In the first use case, we show more accurate microbial identification of Enterococcus avium from metagenomic samples with FDA-ARGOS reference genomes compared to non-curated GenBank genomes. In the second use case, we demonstrate the utility of FDA-ARGOS reference genomes for Ebola virus target sequence comparison as part of a composite validation strategy for ID-NGS diagnostic tests. The use of FDA-ARGOS as an in silico target sequence comparator tool combined with representative clinical testing could reduce the burden for completing ID-NGS clinical trials.
To evaluate the utility of transcript profiling for prediction of protein expression levels, we compared profiles across the NCI-60 cancer cell panel, which represents nine tissues of origin. For ...that analysis, we present here two new NCI-60 transcript profile data sets (A based on Affymetrix HG-U95 and HG-U133A chips; Affymetrix, Santa Clara, CA) and one new protein profile data set (based on reverse-phase protein lysate arrays). The data sets are available online at http://discover.nci.nih.gov in the CellMiner program package. Using the new transcript data in combination with our previously published cDNA array and Affymetrix HU6800 data sets, we first developed a "consensus set" of transcript profiles based on the four different microarray platforms. Using that set, we found that 65% of the genes showed statistically significant transcript-protein correlation, and the correlations were generally higher than those reported previously for panels of mammalian cells. Using the predictive analysis of microarray nearest shrunken centroid algorithm for functional prediction of tissue of origin, we then found that (a) the consensus mRNA set did better than did data from any of the individual mRNA platforms and (b) the protein data seemed to do somewhat better (P = 0.027) on a gene-for-gene basis in this particular study than did the consensus mRNA data, but both did well. Analysis based on the Gene Ontology showed protein levels of structure-related genes to be well predicted by mRNA levels (mean r = 0.71). Because the transcript-based technologies are more mature and are currently able to assess larger numbers of genes at one time, they continue to be useful, even when the ultimate aim is information about proteins.
In an effort to develop a genomics-based approach to the prediction of drug response, we have developed an algorithm for classification of cell line chemosensitivity based on gene expression profiles ...alone. Using oligonucleotide microarrays, the expression levels of 6,817 genes were measured in a panel of 60 human cancer cell lines (the NCI-60) for which the chemosensitivity profiles of thousands of chemical compounds have been determined. We sought to determine whether the gene expression signatures of untreated cells were sufficient for the prediction of chemosensitivity. Gene expression-based classifiers of sensitivity or resistance for 232 compounds were generated and then evaluated on independent sets of data. The classifiers were designed to be independent of the cells' tissue of origin. The accuracy of chemosensitivity prediction was considerably better than would be expected by chance. Eighty-eight of 232 expression-based classifiers performed accurately (with P < 0.05) on an independent test set, whereas only 12 of the 232 would be expected to do so by chance. These results suggest that at least for a subset of compounds genomic approaches to chemosensitivity prediction are feasible.
The acceptance of microarray technology in regulatory decision-making is being challenged by the existence of various platforms and data analysis methods. A recent report (E. Marshall, Science, 306, ...630-631, 2004), by extensively citing the study of Tan et al. (Nucleic Acids Res., 31, 5676-5684, 2003), portrays a disturbingly negative picture of the cross-platform comparability, and, hence, the reliability of microarray technology.
We reanalyzed Tan's dataset and found that the intra-platform consistency was low, indicating a problem in experimental procedures from which the dataset was generated. Furthermore, by using three gene selection methods (i.e., p-value ranking, fold-change ranking, and Significance Analysis of Microarrays (SAM)) on the same dataset we found that p-value ranking (the method emphasized by Tan et al.) results in much lower cross-platform concordance compared to fold-change ranking or SAM. Therefore, the low cross-platform concordance reported in Tan's study appears to be mainly due to a combination of low intra-platform consistency and a poor choice of data analysis procedures, instead of inherent technical differences among different platforms, as suggested by Tan et al. and Marshall.
Our results illustrate the importance of establishing calibrated RNA samples and reference datasets to objectively assess the performance of different microarray platforms and the proficiency of individual laboratories as well as the merits of various data analysis procedures. Thus, we are progressively coordinating the MAQC project, a community-wide effort for microarray quality control.
L-Asparaginase (l-ASP), a bacterial enzyme used since the 1970s to treat acute lymphoblastic leukemia, selectively starves cells that cannot synthesize sufficient asparagine for their own needs. ...Molecular profiling of the NCI-60 cancer cell lines using five different microarray platforms showed strong negative correlations of asparagine synthetase (ASNS) expression and DNA copy number with sensitivity to l-ASP in the leukemia and ovarian cancer cell subsets. To assess whether the ovarian relationship is causal, we used RNA interference to silence ASNS in three ovarian lines and observed 4- to 5-fold potentiation of sensitivity to l-ASP with two of the lines. For OVCAR-8, the line that expresses the least ASNS, the potentiation was >500-fold. Significantly, that potentiation was >700-fold in the multidrug-resistant derivative OVCAR-8/ADR, showing that the causal relationship between ASNS expression and l-ASP activity survives development of classical multidrug resistance. Tissue microarrays confirmed low ASNS expression in a subset of clinical ovarian cancers as well as other tumor types. Overall, this pharmacogenomic/pharmacoproteomic study suggests the use of l-ASP for treatment of a subset of ovarian cancers (and perhaps other tumor types), with ASNS as a biomarker for patient selection.
E-cadherin (E-cad) is a transmembrane adhesion glycoprotein, the expression of which is often reduced in invasive or metastatic tumors. To assess E-cad's distribution among different types of cancer ...cells, we used bisulfite-sequencing for detailed, base-by-base measurement of CpG methylation in E-cad's promoter region in the NCI-60 cell lines. The mean methylation levels of the cell lines were distributed bimodally, with values pushed toward either the high or low end of the methylation scale. The 38 epithelial cell lines showed substantially lower (28%) mean methylation levels compared with the nonepithelial cell lines (58%). The CpG site at -143 with respect to the transcriptional start was commonly methylated at intermediate levels, even in cell lines with low overall DNA methylation. We also profiled the NCI-60 cell lines using Affymetrix U133 microarrays and found E-cad expression to be correlated with E-cad methylation at highly statistically significant levels. Above a threshold of approximately 20% to 30% mean methylation, the expression of E-cad was effectively silenced. Overall, this study provides a type of detailed analysis of methylation that can also be applied to other cancer-related genes. As has been shown in recent years, DNA methylation status can serve as a biomarker for use in choosing therapy.
In 2015, Zika virus (ZIKV) appeared as an emerging pathogen, generating a global and urgent need for accurate diagnostic devices. During this public health crisis, several nucleic acid testing ...(NAT)–based Zika assays were submitted to the US Food and Drug Administration (FDA) for Emergency Use Authorization. The FDA’s Center for Devices and Radiological Health, in collaboration with the FDA's Center for Biologics Evaluation and Research, responded to this Zika emergency by developing and producing a reference panel (RP) for Zika RNA (Zika FDA-RP) suitable for performance assessment of ZIKV NAT-based in vitro diagnostic devices. Reference panels are a fundamental tool for performance assessment of molecular tests. The panel is composed of five vials: two different heat-inactivated ZIKV strains (PRVABC59 and FSS13025) in concentrated stocks and three blinded concentrations prepared from those strains. The Zika FDA-RP was shared with developers who had devices in the final stages of validation. In vitro diagnostic developers tested the Zika FDA-RP using the FDA-provided protocol. Depending on sample type, 85% (12/14) of the NAT assays had analytical sensitivities between 500 and 5000 RNA NAT-detectable units/mL (NDUs/mL). One device showed better performance (100 NDUs/mL), and another one showed lower performance (10,000 to 30,000 NDUs/mL). Vials of the Zika FDA-RP are available on request to developers who have interacted with the FDA through the review process.
In this paper we report exploratory analyses of high-density oligonucleotide array data from the Affymetrix GeneChip system with the objective of improving upon currently used measures of gene ...expression. Our analyses make use of three data sets: a small experimental study consisting of five MGU74A mouse GeneChip arrays, part of the data from an extensive spike-in study conducted by Gene Logic and Wyeth's Genetics Institute involving 95 HG-U95A human GeneChip arrays; and part of a dilution study conducted by Gene Logic involving 75 HG-U95A GeneChip arrays. We display some familiar features of the perfect match and mismatch probe (PM and MM) values of these data, and examine the variance-mean relationship with probe-level data from probes believed to be defective, and so delivering noise only. We explain why we need to normalize the arrays to one another using probe level intensities. We then examine the behavior of the PM and MM using spike-in data and assess three commonly used summary measures: Affymetrix's (i) average difference (AvDiff) and (ii) MAS 5.0 signal, and (iii) the Li and Wong multiplicative model-based expression index (MBEI). The exploratory data analyses of the probe level data motivate a new summary measure that is a robust multi-array average (RMA) of background-adjusted, normalized, and log-transformed PM values. We evaluate the four expression summary measures using the dilution study data, assessing their behavior in terms of bias, variance and (for MBEI and RMA) model fit. Finally, we evaluate the algorithms in terms of their ability to detect known levels of differential expression using the spike-in data. We conclude that there is no obvious downside to using RMA and attaching a standard error (SE) to this quantity using a linear model which removes probe-specific affinities.
We have previously isolated by expression/selection cloning plasmids containing human cDNAs that rendered MCF-7 breast cancer cells resistant to immunotoxins, Pseudomonas exotoxin (PE), and ...diphtheria toxin (DT) Brinkmann et al. (1995) Mol. Med. 1, 206−216. Here we describe that one of these resistant plasmids, which contains an antisense cDNA fragment homologous to the yeast chromosome segregation gene CSE1 CAS; Brinkmann et al. (1995) Proc. Natl. Acad. Sci. U.S.A. 92, 10427−10431, reduces the intracellular content of the human CSE1 homologue CAS protein. CAS reduction confers resistance not only to the ADP-ribosylating toxins PE and DT, but also to tumor necrosis factor α and β. The resistance was observed as reduced apoptosis. CAS antisense did not affect the cell death induced by staurosporine, cycloheximide, or etoposide. The observation that CAS antisense can interfere with apoptosis mediated by TNF and ADP-ribosylating toxins suggests that CAS may play a role in selected pathways of apoptosis.