We undertook a genome-wide search for novel noncoding RNAs (ncRNA) in the malaria parasite Plasmodium falciparum. We used the RNAz program to predict structures in the noncoding regions of the P. ...falciparum 3D7 genome that were conserved with at least one of seven other Plasmodium spp. genome sequences. By using Northern blot analysis for 76 high-scoring predictions and microarray analysis for the majority of candidates, we have verified the expression of 33 novel ncRNA transcripts including four members of a ncRNA family in the asexual blood stage. These transcripts represent novel structured ncRNAs in P. falciparum and are not represented in any RNA databases. We provide supporting evidence for purifying selection acting on the experimentally verified ncRNAs by comparing the nucleotide substitutions in the predicted ncRNA candidate structures in P. falciparum with the closely related chimp malaria parasite P. reichenowi. The high confirmation rate within a single parasite life cycle stage suggests that many more of the predictions may be expressed in other stages of the organism's life cycle.
Relics from the RNA world Jeffares, D C; Poole, A M; Penny, D
Journal of molecular evolution
46, Številka:
1
Journal Article
Recenzirano
An RNA world is widely accepted as a probable stage in the early evolution of life. Two implications are that proteins have gradually replaced RNA as the main biological catalysts and that RNA has ...not taken on any major de novo catalytic function after the evolution of protein synthesis, that is, there is an essentially irreversible series of steps RNA --> RNP --> protein. This transition, as expected from a consideration of catalytic perfection, is essentially complete for reactions when the substrates are small molecules. Based on these principles we derive criteria for identifying RNAs in modern organisms that are relics from the RNA world and then examine the function and phylogenetic distribution of RNA for such remnants of the RNA world. This allows an estimate of the minimum complexity of the last ribo-organism-the stage just preceding the advent of genetically encoded protein synthesis. Despite the constraints placed on its size by a low fidelity of replication (the Eigen limit), we conclude that the genome of this organism reached a considerable level of complexity that included several RNA-processing steps. It would include a large protoribosome with many smaller RNAs involved in its assembly, pre-tRNAs and tRNA processing, an ability for recombination of RNA, some RNA editing, an ability to copy to the end of each RNA strand, and some transport functions. It is harder to recognize specific metabolic reactions that must have existed but synthetic and bio-energetic functions would be necessary. Overall, this requires that such an organism maintained a multiple copy, double-stranded linear RNA genome capable of recombination and splicing. The genome was most likely fragmented, allowing each "chromosome" to be replicated with minimum error, that is, within the Eigen limit. The model as developed serves as an outgroup to root the tree of life and is an alternative to using sequence data for inferring properties of the earliest cells.
The path from the RNA world Poole, A M; Jeffares, D C; Penny, D
Journal of molecular evolution
46, Številka:
1
Journal Article
Recenzirano
We describe a sequential (step by step) Darwinian model for the evolution of life from the late stages of the RNA world through to the emergence of eukaryotes and prokaryotes. The starting point is ...our model, derived from current RNA activity, of the RNA world just prior to the advent of genetically-encoded protein synthesis. By focusing on the function of the protoribosome we develop a plausible model for the evolution of a protein-synthesizing ribosome from a high-fidelity RNA polymerase that incorporated triplets of oligonucleotides. With the standard assumption that during the evolution of enzymatic activity, catalysis is transferred from RNA --> RNP --> protein, the first proteins in the "breakthrough organism" (the first to have encoded protein synthesis) would be nonspecific chaperone-like proteins rather than catalytic. Moreover, because some RNA molecules that pre-date protein synthesis under this model now occur as introns in some of the very earliest proteins, the model predicts these particular introns are older than the exons surrounding them, the "introns-first" theory. Many features of the model for the genome organization in the final RNA world ribo-organism are more prevalent in the eukaryotic genome and we suggest that the prokaryotic genome organization (a single, circular genome with one center of replication) was derived from a "eukaryotic-like" genome organization (a fragmented linear genome with multiple centers of replication). The steps from the proposed ribo-organism RNA genome --> eukaryotic-like DNA genome --> prokaryotic-like DNA genome are all relatively straightforward, whereas the transition prokaryotic-like genome --> eukaryotic-like genome appears impossible under a Darwinian mechanism of evolution, given the assumption of the transition RNA --> RNP --> protein. A likely molecular mechanism, "plasmid transfer," is available for the origin of prokaryotic-type genomes from an eukaryotic-like architecture. Under this model prokaryotes are considered specialized and derived with reduced dependence on ssRNA biochemistry. A functional explanation is that prokaryote ancestors underwent selection for thermophily (high temperature) and/or for rapid reproduction (r selection) at least once in their history.
The Nepetoideae, a subfamily of Lamiaceae (mint family), is rich in aromatic plants, many of which are sought after for their use as flavours and fragrances or for their medicinal properties. Here we ...present genome assemblies for two species in Nepetiodeae: Drepanocaruym sewerzowii and Marmoritis complanata. Both assemblies were generated using Oxford Nanopore Q20+ reads with contigs anchored to nine pseudomolecules that resulted in 335 Mb and 305 Mb assemblies, respectively, and BUSCO scores above 95% for both the assembly and annotation. We furthermore provide a species tree for the Lamiaceae using only genome derived gene models, complementing existing transcriptome and marker-based phylogenies.The Nepetoideae, a subfamily of Lamiaceae (mint family), is rich in aromatic plants, many of which are sought after for their use as flavours and fragrances or for their medicinal properties. Here we present genome assemblies for two species in Nepetiodeae: Drepanocaruym sewerzowii and Marmoritis complanata. Both assemblies were generated using Oxford Nanopore Q20+ reads with contigs anchored to nine pseudomolecules that resulted in 335 Mb and 305 Mb assemblies, respectively, and BUSCO scores above 95% for both the assembly and annotation. We furthermore provide a species tree for the Lamiaceae using only genome derived gene models, complementing existing transcriptome and marker-based phylogenies.
Small nucleolar RNAs (snoRNAs) and microRNAs (miRNAs) are integral to a range of processes, including ribosome biogenesis and gene regulation. Some are intron encoded, and this organization may ...facilitate coordinated coexpression of host gene and RNA. However, snoRNAs and miRNAs are known to be mobile, so intron-RNA associations may not be evolutionarily stable. We have used genome alignments across 11 mammals plus chicken to examine positional orthology of snoRNAs and miRNAs and report that 21% of annotated snoRNAs and 11% of miRNAs are positionally conserved across mammals. Among RNAs traceable to the bird-mammal common ancestor, 98% of snoRNAs and 76% of miRNAs are intronic. Comparison of the most evolutionarily stable mammalian intronic snoRNAs with those positionally conserved among primates reveals that the former are more overrepresented among host genes involved in translation or ribosome biogenesis and are more broadly and highly expressed. This stability is likely attributable to a requirement for overlap between host gene and intronic snoRNA expression profiles, consistent with an ancestral role in ribosome biogenesis. In contrast, whereas miRNA positional conservation is comparable to that observed for snoRNAs, intronic miRNAs show no obvious association with host genes of a particular functional category, and no statistically significant differences in host gene expression are found between those traceable to mammalian or primate ancestors. Our results indicate evolutionarily stable associations of numerous intronic snoRNAs and miRNAs and their host genes, with probable continued diversification of snoRNA function from an ancestral role in ribosome biogenesis.
The Schizosaccharomyces pombe Mei2 gene encodes an RNA recognition motif (RRM) protein that stimulates meiosis upon binding a specific non-coding RNA and subsequent accumulation in a "mei2-dot" in ...the nucleus. We present here the first systematic characterization of the family of proteins with characteristic Mei2-like amino acid sequences. Mei2-like proteins are an ancient eukaryotic protein family with three identifiable RRMs. The C-terminal RRM (RRM3) is unique to Mei2-like proteins and is the most highly conserved of the three RRMs. RRM3 also contains conserved sequence elements at its C-terminus not found in other RRM domains. Single copy Mei2-like genes are present in some fungi, in alveolates such as Paramecium and in the early branching eukaryote Entamoeba histolytica, while plants contain small families of Mei2-like genes. While the C-terminal RRM is highly conserved between plants and fungi, indicating conservation of molecular mechanisms, plant Mei2-like genes have changed biological context to regulate various aspects of developmental pattern formation.
An exhaustive study of the Sam/Frodo family of non-LTR retrotransposons in the
Caenorhabditis elegans and
Caenorhabditis briggsae genomes demonstrated that
C. briggsae contains 60 Sam/Frodo elements ...including a new subfamily designated Merry, while at least 1000 elements are present in
C. elegans. In contrast to
C. elegans,
C. briggsae does not contain any other non-LTR retrotransposons. The Sam/Frodo/Merry sequences in
C. briggsae are shorter and less complete than the Sam/Frodo sequences in
C. elegans probably because they all lack a functional first open reading frame (
ORF1) and because the genome only encodes one functional reverse transcriptase gene of a non-LTR retrotransposon. Evidence of purifying selection for a functional reverse transcriptase sequence in master/leader elements was found in both nematodes in spite of low copy numbers in
C. briggsae. Sam elements in
C. elegans are the most abundant Sam/Frodo/Merry family members. They contain the only functional
ORF1 copies and, unlike Frodo and Merry members, have a higher GC content than the genomic regions in which they reside. This may indicate a higher transcription rate within this subfamily.
The nuclear pore complex (NPC) facilitates transport between nucleus and cytoplasm. The protein constituents of the NPC, termed nucleoporins (Nups), are conserved across a wide diversity of ...eukaryotes. In apparent exception to this, no nucleoporin genes have been identified in nucleomorph genomes. Nucleomorphs, nuclear remnants of once free-living eukaryotes, took up residence as secondary endosymbionts in cryptomonad and chlorarachniophyte algae. As these genomes are highly reduced, Nup genes may have been lost, or relocated to the host nucleus. However, Nup genes are often poorly conserved between species, so absence may be an artifact of low sequence similarity. We therefore constructed an evolutionary bioinformatic screen to establish whether the apparent absence of Nup genes in nucleomorph genomes is due to genuine absence or the inability of current methods to detect homologues. We searched green plant (Arabidopsis and rice), green alga (Chlamydomonas reinhardtii) and red alga (Cyanidioschyzon merolae) genomes, plus two nucleomorph genomes (Bigelowiella natans and Guillardia theta) with profile hidden Markov models (HMMs) from curated alignments of known vertebrate/yeast Nups. Since the plant, algal and nucleomorph genomes all belong to the kingdom Plantae, and are evolutionarily distant from the outgroup (vertebrate/yeast) training set, we use the plant and algal genomes as internal positive controls for the sensitivity of the searches in nucleomorph genomes. We find numerous Nup homologues in all plant and free-living algal species, but none in either nucleomorph genome. BLAST searches using identified plant and algal Nups also failed to detect nucleomorph homologues. We conclude that nucleomorph Nup genes have either been lost, being replaced by host Nup genes, or, that nucleomorph Nup genes have been transferred to the host nucleus twice independently; once in the evolution of the red algal nucleomorph and once in the green algal nucleomorph.
Background: Miltefosine has been used successfully to treat visceral leishmaniasis (VL) in India, but it was unsuccessful for VL in a clinical trial in Brazil. Methods: To identify molecular markers ...that predict VL treatment failure whole genome sequencing of 26 L. infantum isolates, from cured and relapsed patients allowed a GWAS analysis of SNPs, gene and chromosome copy number variations. Findings: A strong association was identified (p = 0·0005) between the presence of a genetically stable L. infantum Miltefosine Sensitivity Locus (MSL), and a positive response to miltefosine treatment. The risk of treatment failure increased 9·4-fold (95% CI 2·11–53·54) when an isolate did not have the MSL. The complete absence of the MSL predicted miltefosine failure with 0·92 (95% CI 0·65–0·996) sensitivity and 0·78 (95% CI 0·52–0·92) specificity. A genotyping survey of L. infantum (n = 157) showed that the frequency of MSL varies in a cline from 95% in North East Brazil to <5% in the South East. The MSL was found in the genomes of all L. infantum and L. donovani sequenced isolates from the Old World (n = 671), where miltefosine can have a cure rate higher than 93%. Interpretation: Knowledge on the presence or absence of the MSL in L. infantum will allow stratification of patients prior to treatment, helping to establish better therapeutic strategies for VL treatment. Fund: CNPq, FAPES, GCRF MRC and Wellcome Trust. Keywords: Visceral leishmaniasis, Miltefosine treatment failure, Whole-genome sequencing, Miltefosine Susceptibility Locus, Prognostic marker