To characterize the role of the circadian clock in mouse physiology and behavior, we used RNA-seq and DNA arrays to quantify the transcriptomes of 12 mouse organs over time. We found 43% of all ...protein coding genes showed circadian rhythms in transcription somewhere in the body, largely in an organ-specific manner. In most organs, we noticed the expression of many oscillating genes peaked during transcriptional "rush hours" preceding dawn and dusk. Looking at the genomic landscape of rhythmic genes, we saw that they clustered together, were longer, and had more spiiceforms than nonoscillating genes. Systems-level analysis revealed intricate rhythmic orchestration of gene pathways throughout the body. We also found oscillations in the expression of more than 1,000 known and novel noncoding RNAs (ncRNAs). Supporting their potential role in mediating clock function, ncRNAs conserved between mouse and human showed rhythmic expression in similar proportions as protein coding genes. Importantly, we also found that the majority of best-selling drugs and World Health Organization essential medicines directly target the products of rhythmic genes. Many of these drugs have short half-lives and may benefit from timed dosage. In sum, this study highlights critical, systemic, and surprising roles of the mammalian circadian clock and provides a blueprint for advancement in chronotherapy.
Circadian disruption has multiple pathological consequences, but the underlying mechanisms are largely unknown. To address such mechanisms, we subjected transformed cultured cells to chronic ...circadian desynchrony (CCD), mimicking a chronic jet-lag scheme, and assayed a range of cellular functions. The results indicated a specific circadian clock-dependent increase in cell proliferation. Transcriptome analysis revealed up-regulation of G1/S phase transition genes (myelocytomatosis oncogene cellular homolog Myc, cyclin D1/3, chromatin licensing and DNA replication factor 1 Cdt1), concomitant with increased phosphorylation of the retinoblastoma (RB) protein by cyclin-dependent kinase (CDK) 4/6 and increased G1-S progression. Phospho-RB (Ser807/811) was found to oscillate in a circadian fashion and exhibit phase-shifted rhythms in circadian desynchronized cells. Consistent with circadian regulation, a CDK4/6 inhibitor approved for cancer treatment reduced growth of cultured cells and mouse tumors in a time-of-day-specific manner. Our study identifies a mechanism that underlies effects of circadian disruption on tumor growth and underscores the use of treatment timed to endogenous circadian rhythms.
CircaDB (http://circadb.org) is a new database of circadian transcriptional profiles from time course expression experiments from mice and humans. Each transcript's expression was evaluated by three ...separate algorithms, JTK_Cycle, Lomb Scargle and DeLichtenberg. Users can query the gene annotations using simple and powerful full text search terms, restrict results to specific data sets and provide probability thresholds for each algorithm. Visualizations of the data are intuitive charts that convey profile information more effectively than a table of probabilities. The CircaDB web application is open source and available at http://github.com/itmat/circadb.
The blood-brain barrier (BBB) is critical for neural function. We report here circadian regulation of the BBB in mammals. Efflux of xenobiotics by the BBB oscillates in mice, with highest levels ...during the active phase and lowest during the resting phase. This oscillation is abrogated in circadian clock mutants. To elucidate mechanisms of circadian regulation, we profiled the transcriptome of brain endothelial cells; interestingly, we detected limited circadian regulation of transcription, with no evident oscillations in efflux transporters. We recapitulated the cycling of xenobiotic efflux using a human microvascular endothelial cell line to find that the molecular clock drives cycling of intracellular magnesium through transcriptional regulation of TRPM7, which appears to contribute to the rhythm in efflux. Our findings suggest that considering circadian regulation may be important when therapeutically targeting efflux transporter substrates to the CNS.
Alternative splicing (AS) can critically affect gene function and disease, yet mapping splicing variations remains a challenge. Here, we propose a new approach to define and quantify mRNA splicing in ...units of local splicing variations (LSVs). LSVs capture previously defined types of alternative splicing as well as more complex transcript variations. Building the first genome wide map of LSVs from twelve mouse tissues, we find complex LSVs constitute over 30% of tissue dependent transcript variations and affect specific protein families. We show the prevalence of complex LSVs is conserved in humans and identify hundreds of LSVs that are specific to brain subregions or altered in Alzheimer's patients. Amongst those are novel isoforms in the Camk2 family and a novel poison exon in Ptbp1, a key splice factor in neurogenesis. We anticipate the approach presented here will advance the ability to relate tissue-specific splice variation to genetic variation, phenotype, and disease.
Physiological and behavioral circadian rhythms are driven by a conserved transcriptional/translational negative feedback loop in mammals. Although most core clock factors are transcription factors, ...post-transcriptional control introduces delays that are critical for circadian oscillations. Little work has been done on circadian regulation of translation, so to address this deficit we conducted ribosome profiling experiments in a human cell model for an autonomous clock. We found that most rhythmic gene expression occurs with little delay between transcription and translation, suggesting that the lag in the accumulation of some clock proteins relative to their mRNAs does not arise from regulated translation. Nevertheless, we found that translation occurs in a circadian fashion for many genes, sometimes imposing an additional level of control on rhythmically expressed mRNAs and, in other cases, conferring rhythms on noncycling mRNAs. Most cyclically transcribed RNAs are translated at one of two major times in a 24-h day, while rhythmic translation of most noncyclic RNAs is phased to a single time of day. Unexpectedly, we found that the clock also regulates the formation of cytoplasmic processing (P) bodies, which control the fate of mRNAs, suggesting circadian coordination of mRNA metabolism and translation.
Full-length isoform quantification from RNA-Seq is a key goal in transcriptomics analyses and has been an area of active development since the beginning. The fundamental difficulty stems from the ...fact that RNA transcripts are long, while RNA-Seq reads are short. Here we use simulated benchmarking data that reflects many properties of real data, including polymorphisms, intron signal and non-uniform coverage, allowing for systematic comparative analyses of isoform quantification accuracy and its impact on differential expression analysis. Genome, transcriptome and pseudo alignment-based methods are included; and a simple approach is included as a baseline control. Salmon, kallisto, RSEM, and Cufflinks exhibit the highest accuracy on idealized data, while on more realistic data they do not perform dramatically better than the simple approach. We determine the structural parameters with the greatest impact on quantification accuracy to be length and sequence compression complexity and not so much the number of isoforms. The effect of incomplete annotation on performance is also investigated. Overall, the tested methods show sufficient divergence from the truth to suggest that full-length isoform quantification and isoform level DE should still be employed selectively.
Natural silks crafted by spiders comprise some of the most versatile materials known. Artificial silks-based on the sequences of their natural brethren-replicate some desirable biophysical properties ...and are increasingly utilized in commercial and medical applications today. To characterize the repertoire of protein sequences giving silks their biophysical properties and to determine the set of expressed genes across each unique silk gland contributing to the formation of natural silks, we report here draft genomic and transcriptomic assemblies of Darwin's bark spider, Caerostris darwini, an orb-weaving spider whose dragline is one of the toughest known biomaterials on Earth. We identify at least 31 putative spidroin genes, with expansion of multiple spidroin gene classes relative to the golden orb-weaver, Trichonephila clavipes. We observed substantial sharing of spidroin repetitive sequence motifs between species as well as new motifs unique to C. darwini. Comparative gene expression analyses across six silk gland isolates in females plus a composite isolate of all silk glands in males demonstrated gland and sex-specific expression of spidroins, facilitating putative assignment of novel spidroin genes to classes. Broad expression of spidroins across silk gland types suggests that silks emanating from a given gland represent composite materials to a greater extent than previously appreciated. We hypothesize that the extraordinary toughness of C. darwini major ampullate dragline silk may relate to the unique protein composition of major ampullate spidroins, combined with the relatively high expression of stretchy flagelliform spidroins whose union into a single fiber may be aided by novel motifs and cassettes that act as molecule-binding helices. Our assemblies extend the catalog of sequences and sets of expressed genes that confer the unique biophysical properties observed in natural silks.
A critical task in high-throughput sequencing is aligning millions of short reads to a reference genome. Alignment is especially complicated for RNA sequencing (RNA-Seq) because of RNA splicing. A ...number of RNA-Seq algorithms are available, and claim to align reads with high accuracy and efficiency while detecting splice junctions. RNA-Seq data are discrete in nature; therefore, with reasonable gene models and comparative metrics RNA-Seq data can be simulated to sufficient accuracy to enable meaningful benchmarking of alignment algorithms. The exercise to rigorously compare all viable published RNA-Seq algorithms has not been performed previously.
We developed an RNA-Seq simulator that models the main impediments to RNA alignment, including alternative splicing, insertions, deletions, substitutions, sequencing errors and intron signal. We used this simulator to measure the accuracy and robustness of available algorithms at the base and junction levels. Additionally, we used reverse transcription-polymerase chain reaction (RT-PCR) and Sanger sequencing to validate the ability of the algorithms to detect novel transcript features such as novel exons and alternative splicing in RNA-Seq data from mouse retina. A pipeline based on BLAT was developed to explore the performance of established tools for this problem, and to compare it to the recently developed methods. This pipeline, the RNA-Seq Unified Mapper (RUM), performs comparably to the best current aligners and provides an advantageous combination of accuracy, speed and usability.
The RUM pipeline is distributed via the Amazon Cloud and for computing clusters using the Sun Grid Engine (http://cbil.upenn.edu/RUM).
ggrant@pcbi.upenn.edu; epierce@mail.med.upenn.edu
The RNA-Seq sequence reads described in the article are deposited at GEO, accession GSE26248.
Many chronic disease symptomatologies involve desynchronized sleep-wake cycles, indicative of disrupted biorhythms. This can be interrogated using body temperature rhythms, which have circadian as ...well as sleep-wake behavior/environmental evoked components. Here, we investigated the association of wrist temperature amplitudes with a future onset of disease in the UK Biobank one year after actigraphy. Among 425 disease conditions (range n = 200-6728) compared to controls (range n = 62,107-91,134), a total of 73 (17%) disease phenotypes were significantly associated with decreased amplitudes of wrist temperature (Benjamini-Hochberg FDR q < 0.05) and 26 (6.1%) PheCODEs passed a more stringent significance level (Bonferroni-correction α < 0.05). A two-standard deviation (1.8° Celsius) lower wrist temperature amplitude corresponded to hazard ratios of 1.91 (1.58-2.31 95% CI) for NAFLD, 1.69 (1.53-1.88) for type 2 diabetes, 1.25 (1.14-1.37) for renal failure, 1.23 (1.17-1.3) for hypertension, and 1.22 (1.11-1.33) for pneumonia (phenome-wide atlas available at http://bioinf.itmat.upenn.edu/biorhythm_atlas/ ). This work suggests peripheral thermoregulation as a digital biomarker.