We introduce new statistical methods for analyzing genomic data sets that measure many effects in many conditions (for example, gene expression changes under many treatments). These new methods ...improve on existing methods by allowing for arbitrary correlations in effect sizes among conditions. This flexible approach increases power, improves effect estimates and allows for more quantitative assessments of effect-size heterogeneity compared to simple shared or condition-specific assessments. We illustrate these features through an analysis of locally acting variants associated with gene expression (cis expression quantitative trait loci (eQTLs)) in 44 human tissues. Our analysis identifies more eQTLs than existing approaches, consistent with improved power. We show that although genetic effects on expression are extensively shared among tissues, effect sizes can still vary greatly among tissues. Some shared eQTLs show stronger effects in subsets of biologically related tissues (for example, brain-related tissues), or in only one tissue (for example, testis). Our methods are widely applicable, computationally tractable for many conditions and available online.
Summary
Arabidopsis thaliana is a long established model species for plant molecular biology, genetics and genomics, and studies of A. thaliana gene function provide the basis for formulating ...hypotheses and designing experiments involving other plants, including economically important species. A comprehensive understanding of the A. thaliana genome and a detailed and accurate understanding of the expression of its associated genes is therefore of great importance for both fundamental research and practical applications. Such goal is reliant on the development of new genetic and genomic resources, involving new methods of data acquisition and analysis. We present here the genome‐wide analysis of A. thaliana gene expression profiles across different organs and developmental stages using high‐throughput transcriptome sequencing. The expression of 25 706 protein‐coding genes, as well as their stability and their spatiotemporal specificity, was assessed in 79 organs and developmental stages. A search for alternative splicing events identified 37 873 previously unreported splice junctions, approximately 30% of them occurred in intergenic regions. These potentially represent novel spliced genes that are not included in the TAIR10 database. These data are housed in an open‐access web‐based database, TraVA (Transcriptome Variation Analysis, http://travadb.org/), which allows visualization and analysis of gene expression profiles and differential gene expression between organs and developmental stages.
Significance Statement
The development and update of genetic and genomic resources, involving new methods of data acquisition and analysis, is necessary even for such long established model organisms such as Arabidopsis thaliana. We present a transcriptome map of A. thaliana created using high‐throughput RNA sequencing. It covers 79 organs and developmental stages; the analysis allowed the determination of global characteristics of the A. thaliana transcriptome and to add splice junctions and predicted genes to our current knowledge of the A. thaliana genome.
Significance Genes with different expression between males and females (sex-biased genes) show rapid rates of sequence and expression divergence in a range of taxa. These characteristics have led ...many to assume that sex-biased genes are the product of sexual selection and sexual conflict, but this assumption remains to be rigorously tested. Using a phylogenetically controlled analysis of birds that exhibit diverse levels of sexual selection, we show a rapid turnover in sex-biased gene expression primarily through evolution of male expression levels and that the degree of sexual selection predicts the proportion of male-biased genes but does not account for rates of coding sequence evolution. We also discuss the impact of allometry on gene expression studies, an issue rarely discussed in the literature.
The profound and pervasive differences in gene expression observed between males and females, and the unique evolutionary properties of these genes in many species, have led to the widespread assumption that they are the product of sexual selection and sexual conflict. However, we still lack a clear understanding of the connection between sexual selection and transcriptional dimorphism, often termed sex-biased gene expression. Moreover, the relative contribution of sexual selection vs. drift in shaping broad patterns of expression, divergence, and polymorphism remains unknown. To assess the role of sexual selection in shaping these patterns, we assembled transcriptomes from an avian clade representing the full range of sexual dimorphism and sexual selection. We use these species to test the links between sexual selection and sex-biased gene expression evolution in a comparative framework. Through ancestral reconstruction of sex bias, we demonstrate a rapid turnover of sex bias across this clade driven by sexual selection and show it to be primarily the result of expression changes in males. We use phylogenetically controlled comparative methods to demonstrate that phenotypic measures of sexual selection predict the proportion of male-biased but not female-biased gene expression. Although male-biased genes show elevated rates of coding sequence evolution, consistent with previous reports in a range of taxa, there is no association between sexual selection and rates of coding sequence evolution, suggesting that expression changes may be more important than coding sequence in sexual selection. Taken together, our results highlight the power of sexual selection to act on gene expression differences and shape genome evolution.
Rising demand for food and bioenergy makes it imperative to breed for increased crop yield. Vegetative plant growth could be driven by resource acquisition or developmental programs. Metabolite ...profiling in 94 Arabidopsis accessions revealed that biomass correlates negatively with many metabolites, especially starch. Starch accumulates in the light and is degraded at night to provide a sustained supply of carbon for growth. Multivariate analysis revealed that starch is an integrator of the overall metabolic response. We hypothesized that this reflects variation in a regulatory network that balances growth with the carbon supply. Transcript profiling in 21 accessions revealed coordinated changes of transcripts of more than 70 carbon-regulated genes and identified 2 genes (myo-inositol-1-phosphate synthase, a Kelch-domain protein) whose transcripts correlate with biomass. The impact of allelic variation at these 2 loci was shown by association mapping, identifying them as candidate lead genes with the potential to increase biomass production.
Advances in transcriptome sequencing allow for simultaneous interrogation of differentially expressed genes from multiple species originating from a single RNA sample, termed dual or multi-species ...transcriptomics. Compared to single-species differential expression analysis, the design of multi-species differential expression experiments must account for the relative abundances of each organism of interest within the sample, often requiring enrichment methods and yielding differences in total read counts across samples. The analysis of multi-species transcriptomics datasets requires modifications to the alignment, quantification, and downstream analysis steps compared to the single-species analysis pipelines. We describe best practices for multi-species transcriptomics and differential gene expression.
Histone acetylation and deacetylation play crucial roles in the modification of chromatin structure and regulation of gene expression in eukaryotes. Histone acetyltransferases (HATs) and histone ...deacetylases (HDACs) assist to maintain the balance of chromatin acetylation status. Previous studies showed that plant HDACs are key regulators involved in response to development and stresses. In this study, we examined the expression pattern and function of HDA705, a member of the RPD3/HDA1-type HDAC in rice. Overexpression of HDA705 in rice decreased ABA and salt stress resistance during seed germination. Delayed seed germination of HDA705 overexpression lines was associated with down-regulated expression of GA biosynthetic genes and up-regulation of ABA biosynthetic genes. Moreover, overexpression of HDA705 in rice enhanced osmotic stress resistance during the seedling stage. Our findings demonstrate that HDA705 may play a role in regulating seed germination and the response to abiotic stresses in rice.
•HDA705 is a member of the RPD3/HDA1-type histone deacetylases in rice.•Lower germination rates were detected for HDA705 OX seeds compared to the WT.•Overexpression of HDA705 decreases seed germination under ABA and salt stresses.•Overexpression of HDA705 enhance osmotic resistance during the seedling stage.
Differential mRNA expression studies implicitly assume that changes in mRNA expression have biological meaning, most likely mediated by corresponding changes in protein levels. Yet studies into ...mRNA-protein correspondence have shown notoriously poor correlation between mRNA and protein expression levels, creating concern for inferences from only mRNA expression data. However, none of these studies have examined in particular differentially expressed mRNA. Here, we examined this question in an ovarian cancer xenograft model. We measured protein and mRNA expression for twenty-nine genes in four drug-treatment conditions and in untreated controls. We identified mRNAs differentially expressed between drug-treated xenografts and controls, then analysed mRNA-protein expression correlation across a five-point time-course within each of the four experimental conditions. We evaluated correlations between mRNAs and their protein products for mRNAs differentially expressed within an experimental condition compared to those that are not. We found that differentially expressed mRNAs correlate significantly better with their protein product than non-differentially expressed mRNAs. This result increases confidence for the use of differential mRNA expression for biological discovery in this system, as well as providing optimism for the usefulness of inferences from mRNA expression in general.
We summarise various ways of performing dimensionality reduction on high-dimensional microarray data. Many different feature selection and feature extraction methods exist and they are being widely ...used. All these methods aim to remove redundant and irrelevant features so that classification of new instances will be more accurate. A popular source of data is microarrays, a biological platform for gathering gene expressions. Analysing microarrays can be difficult due to the size of the data they provide. In addition the complicated relations among the different genes make analysis more difficult and removing excess features can improve the quality of the results. We present some of the most popular methods for selecting significant features and provide a comparison between them. Their advantages and disadvantages are outlined in order to provide a clearer idea of when to use each one of them for saving computational time and resources.
Tomato (Solanum lycopersicum) is an established model for studying fruit biology; however, most studies of tomato fruit growth and ripening are based on homogenized pericarp, and do not consider the ...internal tissues, or the expression signatures of individual cell and tissue types. We present a spatiotemporally resolved transcriptome analysis of tomato fruit ontogeny, using laser microdissection (LM) or hand dissection coupled with RNA-Seq analysis. Regulatory and structural gene networks, including families of transcription factors and hormone synthesis and signaling pathways, are defined across tissue and developmental spectra. The ripening program is revealed as comprising gradients of gene expression, initiating in internal tissues then radiating outward, and basipetally along a latitudinal axis. We also identify spatial variations in the patterns of epigenetic control superimposed on ripening gradients. Functional studies elucidate previously masked regulatory phenomena and relationships, including those associated with fruit quality traits, such as texture, color, aroma, and metabolite profiles.
Although in vitro models have been a cornerstone of anti-cancer drug development, their direct applicability to clinical cancer research has been uncertain. Using a state-of-the-art Taqman-based ...quantitative RT-PCR assay, we investigated the multidrug resistance (MDR) transcriptome of six cancer types, in established cancer cell lines (grown in monolayer, 3D scaffold, or in xenograft) and clinical samples, either containing >75% tumor cells or microdissected. The MDR transcriptome was determined a priori based on an extensive curation of the literature published during the last three decades, which led to the enumeration of 380 genes. No correlation was found between clinical samples and established cancer cell lines. As expected, we found up-regulation of genes that would facilitate survival across all cultured cancer cell lines evaluated. More troubling, however, were data showing that all of the cell lines, grown either in vitro or in vivo, bear more resemblance to each other, regardless of the tissue of origin, than to the clinical samples they are supposed to model. Although cultured cells can be used to study many aspects of cancer biology and response of cells to drugs, this study emphasizes the necessity for new in vitro cancer models and the use of primary tumor models in which gene expression can be manipulated and small molecules tested in a setting that more closely mimics the in vivo cancer microenvironment so as to avoid radical changes in gene expression profiles brought on by extended periods of cell culture.