A universal feature of metazoan sexual development is the generation of oocyte P granules that withhold certain mRNA species from translation to provide coding potential for proteins during early ...post-fertilization development. Stabilisation of translationally quiescent mRNA pools in female Plasmodium gametocytes depends on the RNA helicase DOZI, but the molecular machinery involved in the silencing of transcripts in these protozoans is unknown. Using affinity purification coupled with mass-spectrometric analysis we identify a messenger ribonucleoprotein (mRNP) from Plasmodium berghei gametocytes defined by DOZI and the Sm-like factor CITH (homolog of worm CAR-I and fly Trailer Hitch). This mRNP includes 16 major factors, including proteins with homologies to components of metazoan P granules and archaeal proteins. Containing translationally silent transcripts, this mRNP integrates eIF4E and poly(A)-binding protein but excludes P body RNA degradation factors and translation-initiation promoting eIF4G. Gene deletion mutants of 2 core components of this mRNP (DOZI and CITH) are fertilization-competent, but zygotes fail to develop into ookinetes in a female gametocyte-mutant fashion. Through RNA-immunoprecipitation and global expression profiling of CITH-KO mutants we highlight CITH as a crucial repressor of maternally supplied mRNAs. Our data define Plasmodium P granules as an ancient mRNP whose protein core has remained evolutionarily conserved from single-cell organisms to germ cells of multi-cellular animals and stores translationally silent mRNAs that are critical for early post-fertilization development during the initial stages of mosquito infection. Therefore, translational repression may offer avenues as a target for the generation of transmission blocking strategies and contribute to limiting the spread of malaria.
The three RAS genes - HRAS, NRAS and KRAS - are collectively mutated in one-third of human cancers, where they act as prototypic oncogenes. Interestingly, there are rather distinct patterns to RAS ...mutations; the isoform mutated as well as the position and type of substitution vary between different cancers. As RAS genes are among the earliest, if not the first, genes mutated in a variety of cancers, understanding how these mutation patterns arise could inform on not only how cancer begins but also the factors influencing this event, which has implications for cancer prevention. To this end, we suggest that there is a narrow window or 'sweet spot' by which oncogenic RAS signalling can promote tumour initiation in normal cells. As a consequence, RAS mutation patterns in each normal cell are a product of the specific RAS isoform mutated, as well as the position of the mutation and type of substitution to achieve an ideal level of signalling.
The phenotypic consequence of a given mutation can be influenced by the genetic background. For example, conditional gene essentiality occurs when the loss of function of a gene causes lethality in ...one genetic background but not another. Between two individual
strains, S288c and Σ1278b, ∼1% of yeast genes were previously identified as "conditional essential." Here, in addition to confirming that some conditional essential genes are modified by a nonchromosomal element, we show that most cases involve a complex set of genomic modifiers. From tetrad analysis of S288C/Σ1278b hybrid strains and whole-genome sequencing of viable hybrid spore progeny, we identified complex sets of multiple genomic regions underlying conditional essentiality. For a smaller subset of genes, including
and
, each of which encodes components of the cysteine biosynthesis pathway, we observed a segregation pattern consistent with a single modifier associated with conditional essentiality. In natural yeast isolates, we found that the
/
conditional essentiality can be caused by variation in two independent modifiers,
and
, each with roles associated with cellular cysteine physiology. Interestingly, the
allelic variation appears to have arisen independently from separate lineages, with rare allele frequencies below 0.5%. Thus, while conditional gene essentiality is usually driven by genetic interactions associated with complex modifier architectures, our analysis also highlights the role of functionally related, genetically independent, and rare variants.
Tea plant (Camellia sinensis) is one of the world's most important beverage crops due to its numerous secondary metabolites conferring tea quality and health effects. However, only a small fraction ...of tea genes (especially for those metabolite-related genes) have been functionally characterized to date. A cohesive bioinformatics platform is thus urgently needed to aid in the functional determination of the remaining genes.
TeaCoN, a database of gene co-expression network for tea plant, was established to provide genome-wide associations in gene co-expression to survey gene modules (i.e., co-expressed gene sets) for a function of interest. TeaCoN featured a comprehensive collection of 261 high-quality RNA-Seq experiments that covered a wide range of tea tissues as well as various treatments for tea plant. In the current version of TeaCoN, 31,968 (94% coverage of the genome) tea gene models were documented. Users can retrieve detailed co-expression information for gene(s) of interest in four aspects: 1) co-expressed genes with the corresponding Pearson correlation coefficients (PCC-values) and statistical P-values, 2) gene information (gene ID, description, symbol, alias, chromosomal location, GO and KEGG annotation), 3) expression profile heatmap of co-expressed genes across seven main tea tissues (e.g., leaf, bud, stem, root), and 4) network visualization of co-expressed genes. We also implemented a gene co-expression analysis, BLAST search function, GO and KEGG enrichment analysis, and genome browser to facilitate use of the database.
The TeaCoN project can serve as a beneficial platform for candidate gene screening and functional exploration of important agronomical traits in tea plant. TeaCoN is freely available at http://teacon.wchoda.com .
Transcriptional heterogeneity among malignant cells of a tumor has been studied in individual cancer types and shown to be organized into cancer cell states; however, it remains unclear to what ...extent these states span tumor types, constituting general features of cancer. Here, we perform a pan-cancer single-cell RNA-sequencing analysis across 15 cancer types and identify a catalog of gene modules whose expression defines recurrent cancer cell states including 'stress', 'interferon response', 'epithelial-mesenchymal transition', 'metal response', 'basal' and 'ciliated'. Spatial transcriptomic analysis linked the interferon response in cancer cells to T cells and macrophages in the tumor microenvironment. Using mouse models, we further found that induction of the interferon response module varies by tumor location and is diminished upon elimination of lymphocytes. Our work provides a framework for studying how cancer cell states interact with the tumor microenvironment to form organized systems capable of immune evasion, drug resistance and metastasis.
SUMMARY
Sorghum anthracnose caused by the fungus Colletotrichum sublineola (Cs) is a damaging disease of the crop. Here, we describe the identification of ANTHRACNOSE RESISTANCE GENES (ARG4 and ARG5) ...encoding canonical nucleotide‐binding leucine‐rich repeat (NLR) receptors. ARG4 and ARG5 are dominant resistance genes identified in the sorghum lines SAP135 and P9830, respectively, that show broad‐spectrum resistance to Cs. Independent genetic studies using populations generated by crossing SAP135 and P9830 with TAM428, fine mapping using molecular markers, comparative genomics and gene expression studies determined that ARG4 and ARG5 are resistance genes against Cs strains. Interestingly, ARG4 and ARG5 are both located within clusters of duplicate NLR genes at linked loci separated by ~1 Mb genomic region. SAP135 and P9830 each carry only one of the ARG genes while having the recessive allele at the second locus. Only two copies of the ARG5 candidate genes were present in the resistant P9830 line while five non‐functional copies were identified in the susceptible line. The resistant parents and their recombinant inbred lines carrying either ARG4 or ARG5 are resistant to strains Csgl1 and Csgrg suggesting that these genes have overlapping specificities. The role of ARG4 and ARG5 in resistance was validated through sorghum lines carrying independent recessive alleles that show increased susceptibility. ARG4 and ARG5 are located within complex loci displaying interesting haplotype structures and copy number variation that may have resulted from duplication. Overall, the identification of anthracnose resistance genes with unique haplotype stucture provides a foundation for genetic studies and resistance breeding.
Significance Statement
We identified two tightly linked NLR genes conferring resistance to the sorghum anthracnose disease that is a major challenge to production of the crop globally. The two genes were identified among clusters of highly similar duplicate genes with resistant and susceptible lines having unique haplotypes, and amino acid substitutions in key positions, leading to changes in tertiary protein structure and disease resistance function. The results provide new opportunities for resistance breeding and genetic and molecular studies.
MicroRNAs (miRNAs) function in post-transcriptional regulation of gene expression by binding to target messenger RNAs (mRNAs). Because of the key part that miRNAs play, understanding the correct ...regulatory role of miRNAs in diverse patho-physiological conditions is of great interest. Although it is known that miRNAs act combinatorially to regulate genes, precise identification of miRNA-gene interactions and their specific functional roles in regulatory comodules remains a challenge. We developed THEIA, an effective method for simultaneously predicting miRNA-gene interactions and regulatory comodules, which group functionally related miRNAs and genes via non-negative matrix factorization (NMF).
We apply THEIA to RNA sequencing data from breast invasive carcinoma samples and demonstrate its effectiveness in discovering biologically significant regulatory comodules that are significantly enriched in spatial miRNA clusters, biological pathways, and various cancers.
THEIA is a theoretically rigorous optimization algorithm that simultaneously predicts the strength and direction (i.e., up-regulation or down-regulation) of the effect of modules of miRNAs on a gene. We posit that if THEIA is capable of recovering known clusters of genes and miRNA, then the clusters found by our method not previously identified by literature are also likely to have biological significance. We believe that these novel regulatory comodules found by our method will be a springboard for further research into the specific functional roles of these new functional ensembles of miRNAs and genes,especially those related to diseases like breast cancer.
The timing of flowering during the year is an important adaptive character affecting reproductive success in plants and is critical to crop yield. Flowering time has been extensively manipulated in ...crops such as wheat (Triticum aestivum L.) during domestication, and this enables them to grow productively in a wide range of environments. Several major genes controlling flowering time have been identified in wheat with mutant alleles having sequence changes such as insertions, deletions or point mutations. We investigated genetic variants in commercial varieties of wheat that regulate flowering by altering photoperiod response (Ppd-B1 alleles) or vernalization requirement (Vrn-A1 alleles) and for which no candidate mutation was found within the gene sequence. Genetic and genomic approaches showed that in both cases alleles conferring altered flowering time had an increased copy number of the gene and altered gene expression. Alleles with an increased copy number of Ppd-B1 confer an early flowering day neutral phenotype and have arisen independently at least twice. Plants with an increased copy number of Vrn-A1 have an increased requirement for vernalization so that longer periods of cold are required to potentiate flowering. The results suggest that copy number variation (CNV) plays a significant role in wheat adaptation.
Earlier research has revealed that the ndh loci have been pseudogenized, truncated, or deleted from most orchid plastomes sequenced to date, including in all available plastomes of the two most ...species-rich subfamilies, Orchidoideae and Epidendroideae. This study sought to resolve deeper-level phylogenetic relationships among major orchid groups and to refine the history of gene loss in the ndh loci across orchids. The complete plastomes of seven orchids, Oncidium sphacelatum (Epidendroideae), Masdevallia coccinea (Epidendroideae), Sobralia callosa (Epidendroideae), Sobralia aff. bouchei (Epidendroideae), Elleanthus sodiroi (Epidendroideae), Paphiopedilum armeniacum (Cypripedioideae), and Phragmipedium longifolium (Cypripedioideae) were sequenced and analyzed in conjunction with all other available orchid and monocot plastomes. Most ndh loci were found to be pseudogenized or lost in Oncidium, Paphiopedilum and Phragmipedium, but surprisingly, all ndh loci were found to retain full, intact reading frames in Sobralia, Elleanthus and Masdevallia. Character mapping suggests that the ndh genes were present in the common ancestor of orchids but have experienced independent, significant losses at least eight times across four subfamilies. In addition, ndhF gene loss was correlated with shifts in the position of the junction of the inverted repeat (IR) and small single-copy (SSC) regions. The Orchidaceae have unprecedented levels of homoplasy in ndh gene presence/absence, which may be correlated in part with the unusual life history of orchids. These results also suggest that ndhF plays a role in IR/SSC junction stability.
Many methods have been used to determine differential gene expression from single-cell RNA (scRNA)-seq data. We evaluated 36 approaches using experimental and synthetic data and found considerable ...differences in the number and characteristics of the genes that are called differentially expressed. Prefiltering of lowly expressed genes has important effects, particularly for some of the methods developed for bulk RNA-seq data analysis. However, we found that bulk RNA-seq analysis methods do not generally perform worse than those developed specifically for scRNA-seq. We also present conquer, a repository of consistently processed, analysis-ready public scRNA-seq data sets that is aimed at simplifying method evaluation and reanalysis of published results. Each data set provides abundance estimates for both genes and transcripts, as well as quality control and exploratory analysis reports.