The National Cancer Institute conducted the Biospecimen Pre-analytical Variables (BPV) study to determine the effects of formalin fixation and delay to fixation (DTF) on the analysis of nucleic ...acids. By performing whole transcriptome sequencing and small RNA profiling on matched snap-frozen and FFPE specimens exposed to different delays to fixation, this study aimed to determine acceptable delays to fixation and proper workflow for accurate and reliable Next-Generation Sequencing (NGS) analysis of FFPE specimens. In comparison to snap-freezing, formalin fixation changed the relative proportions of intronic/exonic/untranslated RNA captured by RNA-seq for most genes. The effects of DTF on NGS analysis were negligible. In 80% of specimens, a subset of RNAs was found to differ between snap-frozen and FFPE specimens in a consistent manner across tissue groups; this subset was unaffected in the remaining 20% of specimens. In contrast, miRNA expression was generally stable across various formalin fixation protocols, but displayed increased variability following a 12 h delay to fixation.
Nicotine is a known risk factor for cancer development and has been shown to alter gene expression in cells and tissue upon exposure. We used Illumina® Next Generation Sequencing (NGS) technology to ...gain unbiased biological insight into the transcriptome of normal epithelial cells (MCF-10A) to nicotine exposure. We generated expression data from 54,699 transcripts using triplicates of control and nicotine stressed cells. As a result, we identified 138 differentially expressed transcripts, including 39 uncharacterized genes. Additionally, 173 transcripts that are primarily associated with DNA replication, recombination, and repair showed evidence for alternative splicing. We discovered the greatest nicotine stress response by HPCAL4 (up-regulated by 4.71 fold) and NPAS3 (down-regulated by -2.73 fold); both are genes that have not been previously implicated in nicotine exposure but are linked to cancer. We also discovered significant down-regulation (-2.3 fold) and alternative splicing of NEAT1 (lncRNA) that may have an important, yet undiscovered regulatory role. Gene ontology analysis revealed nicotine exposure influenced genes involved in cellular and metabolic processes. This study reveals previously unknown consequences of nicotine stress on the transcriptome of normal breast epithelial cells and provides insight into the underlying biological influence of nicotine on normal cells, marking the foundation for future studies.
Androgen signaling through androgen receptor (AR) is critical for prostate tumorigenesis. Given that AR-mediated gene regulation is enhanced by AR coregulators, inactivation of those coregulators is ...emerging as a promising therapy for prostate cancer (PCa). Here, we show that the N-acetyltransferase arrest-defect 1 protein (ARD1) functions as a unique AR regulator in PCa cells. ARD1 is up-regulated in human PCa cell lines and primary tumor biopsies. The expression of ARD1 was augmented by treatment with synthetic androgen (R1881) unless AR is deficient or is inhibited by AR-specific siRNA or androgen inhibitor bicalutamide (Casodex). Depletion of ARD1 by shRNA suppressed PCa cell proliferation, anchorage-independent growth, and xenograft tumor formation in SCID mice, suggesting that AR-dependent ARD1 expression is biologically germane. Notably, ARD1 was critical for transcriptionally regulating a number of AR target genes that are involved in prostate tumorigenesis. Furthermore, ARD1 interacted physically with and acetylated the AR protein in vivo and in vitro. Because AR–ARD1 interaction facilitated the AR binding to its targeted promoters for gene transcription, we propose that ARD1 functions as a unique AR regulator and forms a positive feedback loop for AR-dependent prostate tumorigenesis. Disruption of AR–ARD1 interactions may be a potent intervention for androgen-dependent PCa therapy.
Although the connection between cancer and cigarette smoke is well established, nicotine is not characterized as a carcinogen. Here, we used exome sequencing to identify nicotine and oxidative ...stress-induced somatic mutations in normal human epithelial cells and its correlation with cancer. We identified over 6,400 SNVs, indels and microsatellites in each of the stress exposed cells relative to the control, of which, 2,159 were consistently observed at all nicotine doses. These included 429 nsSNVs including 158 novel and 79 cancer-associated. Over 80% of consistently nicotine induced variants overlap with variations detected in oxidative stressed cells, indicating that nicotine induced genomic alterations could be mediated through oxidative stress. Nicotine induced mutations were distributed across 1,585 genes, of which 49% were associated with cancer. MUC family genes were among the top mutated genes. Analysis of 591 lung carcinoma tumor exomes from The Cancer Genome Atlas (TCGA) revealed that 20% of non-small-cell lung cancer tumors in smokers have mutations in at least one of the MUC4, MUC6 or MUC12 genes in contrast to only 6% in non-smokers. These results indicate that nicotine induces genomic variations, promotes instability potentially mediated by oxidative stress, implicating nicotine in carcinogenesis, and establishes MUC genes as potential targets.
Several studies have demonstrated that unmapped reads in next generation sequencing data could be used to identify infectious agents or structural variants, but there has been no intensive effort to ...analyze and classify all non-human sequences found in individual large data sets. To identify commonality in non-human sequences by infectious agents and putative contamination events, we analyzed non-human sequences in 150 genomic sequencing data files from the 1000 Genomes Project and observed that 0.13% of reads on average showed similarities to non-human genomes. We compared results among different sample groups divided based on ethnicities, sequencing centers and enrichment methods (whole genome sequencing vs. exome sequencing) and found that sequencing centers had specific signatures of contaminating genomes as ‘time stamps’. We also observed many unmapped reads that falsely indicated contamination because of the high similarity of human sequences to sequences in non-human genome assemblies such as mouse and Nicotiana.
•An analysis of non-human sequences in sequencing data from the 1000 Genomes Project•Identification of commonality in non-human sequences in human genome sequencing data•Comparison of different ethnicities, sequencing centers and sequence methods
Simple tandem repeats are highly variable genetic elements and widespread in genomes of many organisms. Next-generation sequencing technologies have enabled a robust comparison of large numbers of ...simple tandem repeat loci; however, analysis of their variation using traditional sequence analysis approaches still remains limiting and problematic due to variants occurring in repeat sequences confusing alignment programs into mapping sequence reads to incorrect loci when the sequence reads are significantly different from the reference sequence.
We have developed a program, ReviSTER, which is an automated pipeline using a 'local mapping reference reconstruction method' to revise mismapped or partially misaligned reads at simple tandem repeat loci. RevisSTER estimates alleles of repeat loci using a local alignment method and creates temporary local mapping reference sequences, and finally remaps reads to the local mapping references. Using this approach, ReviSTER was able to successfully revise reads misaligned to repeat loci from both simulated data and real data.
ReviSTER is open-source software available at http://revister.sourceforge.net.
garner@vbi.vt.edu
Supplementary data are available at Bioinformatics online.
A singular genome used for inference into population-based studies is a standard method in genomics. Recent studies show that spontaneous genomic variants can propagate into new generations and these ...changes can contribute to individual cell aging with environmental and evolutionary elements contributing to cumulative genomic variation. However, the contribution of aging to genomic changes in tissue samples remains uncharacterized. Here, we report the impact of aging on individual human exomes and their implications. We found the human genome to be dynamic, acquiring a varying number of mutations with age (5,000 to 50,000 in 9 to 16 years). This equates to a variation rate of 9.6x10(-7) to 8.4x10(-6) bp(-1) year(-1) for nonsynonymous single nucleotide variants and 2.0x10(-4) to 1.0x10(-3) locus(-1) year(-1) for microsatellite loci in these individuals. These mutations span across 3,000 to 13,000 genes, which commonly showed association with Wnt signaling and Gonadotropin releasing hormone receptor pathways, and indicated for individuals a specific and significant enrichment for increased risk for diabetes, kidney failure, cancer, Rheumatoid arthritis, and Alzheimer's disease--conditions usually associated with aging. The results suggest that "age" is an important variable while analyzing an individual human genome to extract individual-specific clinically significant information necessary for personalized genomics.
Inflammation and immune activation of the cervicovaginal mucosa are considered factors that increase susceptibility to HIV infection. Therefore, it is essential to screen candidate anti-HIV ...microbicides for potential mucosal immunomodulatory/inflammatory effects prior to further clinical development. The goal of this study was to develop an in vitro method for preclinical evaluation of the inflammatory potential of new candidate microbicides using a microarray gene expression profiling strategy.
To this end, we compared transcriptomes of human vaginal cells (Vk2/E6E7) treated with well-characterized pro-inflammatory (PIC) and non-inflammatory (NIC) compounds. PICs included compounds with different mechanisms of action. Gene expression was analyzed using Affymetrix U133 Plus 2 arrays. Data processing was performed using GeneSpring 11.5 (Agilent Technologies, Santa Clara, CA).
Microarraray comparative analysis allowed us to generate a panel of 20 genes that were consistently deregulated by PICs compared to NICs, thus distinguishing between these two groups. Functional analysis mapped 14 of these genes to immune and inflammatory responses. This was confirmed by the fact that PICs induced NFkB pathway activation in Vk2 cells. By testing microbicide candidates previously characterized in clinical trials we demonstrated that the selected PIC-associated genes properly identified compounds with mucosa-altering effects. The discriminatory power of these genes was further demonstrated after culturing vaginal cells with vaginal bacteria. Prevotella bivia, prevalent bacteria in the disturbed microbiota of bacterial vaginosis, induced strong upregulation of seven selected PIC-associated genes, while a commensal Lactobacillus gasseri associated to vaginal health did not cause any changes.
In vitro evaluation of the immunoinflammatory potential of microbicides using the PIC-associated genes defined in this study could help in the initial screening of candidates prior to entering clinical trials. Additional characterization of these genes can provide further insight into the cervicovaginal immunoinflammatory and mucosal-altering processes that facilitate or limit HIV transmission with implications for the design of prevention strategies.