Many animal species use a chromosome-based mechanism of sex determination, which has led to the coordinate evolution of dosage-compensation systems. Dosage compensation not only corrects the ...imbalance in the number of X chromosomes between the sexes but also is hypothesized to correct dosage imbalance within cells that is due to monoallelic X-linked expression and biallelic autosomal expression, by upregulating X-linked genes twofold (termed 'Ohno's hypothesis'). Although this hypothesis is well supported by expression analyses of individual X-linked genes and by microarray-based transcriptome analyses, it was challenged by a recent study using RNA sequencing and proteomics. We obtained new, independent RNA-seq data, measured RNA polymerase distribution and reanalyzed published expression data in mammals, C. elegans and Drosophila. Our analyses, which take into account the skewed gene content of the X chromosome, support the hypothesis of upregulation of expressed X-linked genes to balance expression of the genome.
We generated detailed RNA-seq data for the nematode Caenorhabditis elegans with high temporal resolution in the embryo as well as representative samples from post-embryonic stages across the life ...cycle. The data reveal that early and late embryogenesis is accompanied by large numbers of genes changing expression, whereas fewer genes are changing in mid-embryogenesis. This lull in genes changing expression correlates with a period during which histone mRNAs produce almost 40% of the RNA-seq reads. We find evidence for many more splice junctions than are annotated in WormBase, with many of these suggesting alternative splice forms, often with differential usage over the life cycle. We annotated internal promoter usage in operons using SL1 and SL2 data. We also uncovered correlated transcriptional programs that span >80 kb. These data provide detailed annotation of the C. elegans transcriptome.
We performed a genome-wide analysis of gene expression in C. elegans to identify germline- and sex-regulated genes. Using mutants that cause defects in germ cell proliferation or gametogenesis, we ...identified sets of genes with germline-enriched expression in either hermaphrodites or males, or in both sexes. Additionally, we compared gene expression profiles between males and hermaphrodites lacking germline tissue to define genes with sex-biased expression in terminally differentiated somatic tissues. Cross-referencing hermaphrodite germline and somatic gene sets with in situ hybridization data demonstrates that the vast majority of these genes have appropriate spatial expression patterns. Additionally, we examined gene expression at multiple times during wild-type germline development to define temporal expression profiles for these genes. Sex- and germline-regulated genes have a non-random distribution in the genome, with especially strong biases for and against the X chromosome. Comparison with data from large-scale RNAi screens demonstrates that genes expressed in the oogenic germline display visible phenotypes more frequently than expected.
The PMK-1 p38 mitogen-activated protein kinase pathway and the DAF-2-DAF-16 insulin signaling pathway control Caenorhabditis elegans intestinal innate immunity. pmk-1 loss-of-function mutants have ...enhanced sensitivity to pathogens, while daf-2 loss-of-function mutants have enhanced resistance to pathogens that requires upregulation of the DAF-16 transcription factor. We used genetic analysis to show that the pathogen resistance of daf-2 mutants also requires PMK-1. However, genome-wide microarray analysis indicated that there was essentially no overlap between genes positively regulated by PMK-1 and DAF-16, suggesting that they form parallel pathways to promote immunity. We found that PMK-1 controls expression of candidate secreted antimicrobials, including C-type lectins, ShK toxins, and CUB-like genes. Microarray analysis demonstrated that 25% of PMK-1 positively regulated genes are induced by Pseudomonas aeruginosa infection. Using quantitative PCR, we showed that PMK-1 regulates both basal and infection-induced expression of pathogen response genes, while DAF-16 does not. Finally, we used genetic analysis to show that PMK-1 contributes to the enhanced longevity of daf-2 mutants. We propose that the PMK-1 pathway is a specific, indispensable immunity pathway that mediates expression of secreted immune response genes, while the DAF-2-DAF-16 pathway appears to regulate immunity as part of a more general stress response. The contribution of the PMK-1 pathway to the enhanced lifespan of daf-2 mutants suggests that innate immunity is an important determinant of longevity.
Understanding the in vivo dynamics of protein localization and their physical interactions is important for many problems in biology. To enable systematic protein function interrogation in a ...multicellular context, we built a genome-scale transgenic platform for in vivo expression of fluorescent- and affinity-tagged proteins in Caenorhabditis elegans under endogenous cis regulatory control. The platform combines computer-assisted transgene design, massively parallel DNA engineering, and next-generation sequencing to generate a resource of 14,637 genomic DNA transgenes, which covers 73% of the proteome. The multipurpose tag used allows any protein of interest to be localized in vivo or affinity purified using standard tag-based assays. We illustrate the utility of the resource by systematic chromatin immunopurification and automated 4D imaging, which produced detailed DNA binding and cell/tissue distribution maps for key transcription factor proteins.
Display omitted
► A genome-wide resource for in vivo expression of tagged proteins was engineered ► The tagged gene alleles provide native protein expression and localization patterns ► Tag-based ChIP provides genome-wide DNA binding site maps for key transcription factors ► Live 4D tracing reveals rapid transcription factor protein localization dynamics
An engineered resource for tagged proteins recapitulates native expression and localization, enabling dynamic measurements of protein binding and function in C. elegans.
The wide variety of specialized permissive and repressive mechanisms by which germ cells regulate developmental gene expression are not well understood genome-wide. Isolation of germ cells with high ...integrity and purity from living animals is necessary to address these open questions, but no straightforward methods are currently available.
Here we present an experimental paradigm that permits the isolation of nuclei from C. elegans germ cells at quantities sufficient for genomic analyses. We demonstrate that these nuclei represent a very pure population and are suitable for both transcriptome analysis (RNA-seq) and chromatin immunoprecipitation (ChIP-seq) of histone modifications. From these data, we find unexpected germline- and soma-specific patterns of gene regulation.
This new capacity removes a major barrier in the field to dissect gene expression mechanisms in the germ line of C. elegans. Consequent discoveries using this technology will be relevant to conserved regulatory mechanisms across species.
Using massively parallel sequencing by synthesis methods, we have surveyed the polyA+ transcripts from four stages of the nematode Caenorhabditis elegans to an unprecedented depth. Using novel ...statistical approaches, we evaluated the coverage of annotated features of the genome and of candidate processed transcripts, including splice junctions, trans-spliced leader sequences, and polyadenylation tracts. The data provide experimental support for >85% of the annotated protein-coding transcripts in WormBase (WS170) and confirm additional details of processing. For example, the total number of confirmed splice junctions was raised from 70,911 to over 98,000. The data also suggest thousands of modifications to WormBase annotations and identify new spliced junctions and genes not part of any WormBase annotation, including at least 80 putative genes not found in any of three predicted gene sets. The quantitative nature of the data also suggests that mRNA levels may be measured by this approach with unparalleled precision. Although most sequences align with protein-coding genes, a small fraction falls in introns and intergenic regions. One notable region on the X chromosome encodes a noncoding transcript of >10 kb localized to somatic nuclei.
Despite the large evolutionary distances between metazoan species, they can show remarkable commonalities in their biology, and this has helped to establish fly and worm as model organisms for human ...biology. Although studies of individual elements and factors have explored similarities in gene regulation, a large-scale comparative analysis of basic principles of transcriptional regulatory features is lacking. Here we map the genome-wide binding locations of 165 human, 93 worm and 52 fly transcription regulatory factors, generating a total of 1,019 data sets from diverse cell types, developmental stages, or conditions in the three species, of which 498 (48.9%) are presented here for the first time. We find that structural properties of regulatory networks are remarkably conserved and that orthologous regulatory factor families recognize similar binding motifs in vivo and show some similar co-associations. Our results suggest that gene-regulatory properties previously observed for individual factors are general principles of metazoan regulation that are remarkably well-preserved despite extensive functional divergence of individual network connections. The comparative maps of regulatory circuitry provided here will drive an improved understanding of the regulatory underpinnings of model organism biology and how these relate to human biology, development and disease.
Regulation of gene expression by sequence-specific transcription factors is central to developmental programs and depends on the binding of transcription factors with target sites in the genome. To ...date, most such analyses in Caenorhabditis elegans have focused on the interactions between a single transcription factor with one or a few select target genes. As part of the modENCODE Consortium, we have used chromatin immunoprecipitation coupled with high-throughput DNA sequencing (ChIP-seq) to determine the genome-wide binding sites of 22 transcription factors (ALR-1, BLMP-1, CEH-14, CEH-30, EGL-27, EGL-5, ELT-3, EOR-1, GEI-11, HLH-1, LIN-11, LIN-13, LIN-15B, LIN-39, MAB-5, MDL-1, MEP-1, PES-1, PHA-4, PQM-1, SKN-1, and UNC-130) at diverse developmental stages. For each factor we determined candidate gene targets, both coding and non-coding. The typical binding sites of almost all factors are within a few hundred nucleotides of the transcript start site. Most factors target a mixture of coding and non-coding target genes, although one factor preferentially binds to non-coding RNA genes. We built a regulatory network among the 22 factors to determine their functional relationships to each other and found that some factors appear to act preferentially as regulators and others as target genes. Examination of the binding targets of three related HOX factors--LIN-39, MAB-5, and EGL-5--indicates that these factors regulate genes involved in cellular migration, neuronal function, and vulval differentiation, consistent with their known roles in these developmental processes. Ultimately, the comprehensive mapping of transcription factor binding sites will identify features of transcriptional networks that regulate C. elegans developmental processes.
While Caenorhabditis elegans specifically responds to infection by the up-regulation of certain genes, distinct pathogens trigger the expression of a common set of genes. We applied new methods to ...conduct a comprehensive and comparative study of the transcriptional response of C. elegans to bacterial and fungal infection. Using tiling arrays and/or RNA-sequencing, we have characterized the genome-wide transcriptional changes that underlie the host's response to infection by three bacterial (Serratia marcescens, Enterococcus faecalis and otorhabdus luminescens) and two fungal pathogens (Drechmeria coniospora and Harposporium sp.). We developed a flexible tool, the WormBase Converter (available at http://wormbasemanager.sourceforge.net/), to allow cross-study comparisons. The new data sets provided more extensive lists of differentially regulated genes than previous studies. Annotation analysis confirmed that genes commonly up-regulated by bacterial infections are related to stress responses. We found substantial overlaps between the genes regulated upon intestinal infection by the bacterial pathogens and Harposporium, and between those regulated by Harposporium and D. coniospora, which infects the epidermis. Among the fungus-regulated genes, there was a significant bias towards genes that are evolving rapidly and potentially encode small proteins. The results obtained using new methods reveal that the response to infection in C. elegans is determined by the nature of the pathogen, the site of infection and the physiological imbalance provoked by infection. They form the basis for future functional dissection of innate immune signaling. Finally, we also propose alternative methods to identify differentially regulated genes that take into account the greater variability in lowly expressed genes.