The ability to simultaneously assess every gene in a genome for a role in a particular process has obvious appeal. This protocol describes how to perform genome-scale RNAi library screens in ...bloodstream-form African trypanosomes, a family of parasites that causes lethal human and animal diseases and also serves as a model for studies on basic aspects of eukaryotic biology and evolution. We discuss strain assembly, screen design and implementation, the RNAi target sequencing approach and hit validation, and we provide a step-by-step protocol. A screen can yield from one to thousands of 'hits' associated with the phenotype of interest. The screening protocol itself takes 2 weeks or less to be completed, and high-throughput sequencing may also be completed within weeks. Pre- and post-screen strain assembly, validation and follow-up can take several months, depending on the type of screen and the number of hits analyzed.
Early reports indicate that long non-coding RNAs (lncRNAs) are novel regulators of biological responses. However, their role in the human innate immune response, which provides the initial defence ...against infection, is largely unexplored. To address this issue, here we characterize the long non-coding RNA transcriptome in primary human monocytes using RNA sequencing. We identify 76 enhancer RNAs (eRNAs), 40 canonical lncRNAs, 65 antisense lncRNAs and 35 regions of bidirectional transcription (RBT) that are differentially expressed in response to bacterial lipopolysaccharide (LPS). Crucially, we demonstrate that knockdown of nuclear-localized, NF-κB-regulated, eRNAs (IL1β-eRNA) and RBT (IL1β-RBT46) surrounding the IL1β locus, attenuates LPS-induced messenger RNA transcription and release of the proinflammatory mediators, IL1β and CXCL8. We predict that lncRNAs can be important regulators of the human innate immune response.
Currently available sequencing technologies enable quick and economical sequencing of many new eukaryotic parasite (apicomplexan or kinetoplastid) species or strains. Compared to SNP calling ...approaches, de novo assembly of these genomes enables researchers to additionally determine insertion, deletion and recombination events as well as to detect complex sequence diversity, such as that seen in variable multigene families. However, there currently are no automated eukaryotic annotation pipelines offering the required range of results to facilitate such analyses. A suitable pipeline needs to perform evidence-supported gene finding as well as functional annotation and pseudogene detection up to the generation of output ready to be submitted to a public database. Moreover, no current tool includes quick yet informative comparative analyses and a first pass visualization of both annotation and analysis results. To overcome those needs we have developed the Companion web server (http://companion.sanger.ac.uk) providing parasite genome annotation as a service using a reference-based approach. We demonstrate the use and performance of Companion by annotating two Leishmania and Plasmodium genomes as typical parasite cases and evaluate the results compared to manually annotated references.
African trypanosomes are major pathogens of humans and livestock and represent a model for studies of unusual protozoal biology. We describe a high-throughput phenotyping approach termed RNA ...interference (RNAi) target sequencing, or RIT-seq that, using Illumina sequencing, maps fitness-costs associated with RNAi. We scored the abundance of >90,000 integrated RNAi targets recovered from trypanosome libraries before and after induction of RNAi. Data are presented for 7435 protein coding sequences, >99% of a non-redundant set in the Trypanosoma brucei genome. Analysis of bloodstream and insect life-cycle stages and differentiated libraries revealed genome-scale knockdown profiles of growth and development, linking thousands of previously uncharacterized and "hypothetical" genes to essential functions. Genes underlying prominent features of trypanosome biology are highlighted, including the constitutive emphasis on post-transcriptional gene expression control, the importance of flagellar motility and glycolysis in the bloodstream, and of carboxylic acid metabolism and phosphorylation during differentiation from the bloodstream to the insect stage. The current data set also provides much needed genetic validation to identify new drug targets. RIT-seq represents a versatile new tool for genome-scale functional analyses and for the exploitation of genome sequence data.
The Eukaryotic Pathogen Genomics Database Resource (EuPathDB, http://eupathdb.org) is a collection of databases covering 170+ eukaryotic pathogens (protists & fungi), along with relevant free-living ...and non-pathogenic species, and select pathogen hosts. To facilitate the discovery of meaningful biological relationships, the databases couple preconfigured searches with visualization and analysis tools for comprehensive data mining via intuitive graphical interfaces and APIs. All data are analyzed with the same workflows, including creation of gene orthology profiles, so data are easily compared across data sets, data types and organisms. EuPathDB is updated with numerous new analysis tools, features, data sets and data types. New tools include GO, metabolic pathway and word enrichment analyses plus an online workspace for analysis of personal, non-public, large-scale data. Expanded data content is mostly genomic and functional genomic data while new data types include protein microarray, metabolic pathways, compounds, quantitative proteomics, copy number variation, and polysomal transcriptomics. New features include consistent categorization of searches, data sets and genome browser tracks; redesigned gene pages; effective integration of alternative transcripts; and a EuPathDB Galaxy instance for private analyses of a user's data. Forthcoming upgrades include user workspaces for private integration of data with existing EuPathDB data and improved integration and presentation of host-pathogen interactions.
Leishmania parasites cause a spectrum of clinical pathology in humans ranging from disfiguring cutaneous lesions to fatal visceral leishmaniasis. We have generated a reference genome for Leishmania ...mexicana and refined the reference genomes for Leishmania major, Leishmania infantum, and Leishmania braziliensis. This has allowed the identification of a remarkably low number of genes or paralog groups (2, 14, 19, and 67, respectively) unique to one species. These were found to be conserved in additional isolates of the same species. We have predicted allelic variation and find that in these isolates, L. major and L. infantum have a surprisingly low number of predicted heterozygous SNPs compared with L. braziliensis and L. mexicana. We used short read coverage to infer ploidy and gene copy numbers, identifying large copy number variations between species, with 200 tandem gene arrays in L. major and 132 in L. mexicana. Chromosome copy number also varied significantly between species, with nine supernumerary chromosomes in L. infantum, four in L. mexicana, two in L. braziliensis, and one in L. major. A significant bias against gene arrays on supernumerary chromosomes was shown to exist, indicating that duplication events occur more frequently on disomic chromosomes. Taken together, our data demonstrate that there is little variation in unique gene content across Leishmania species, but large-scale genetic heterogeneity can result through gene amplification on disomic chromosomes and variation in chromosome number. Increased gene copy number due to chromosome amplification may contribute to alterations in gene expression in response to environmental conditions in the host, providing a genetic basis for disease tropism.
FungiDB (fungidb.org) is a free online resource for data mining and functional genomics analysis for fungal and oomycete species. FungiDB is part of the Eukaryotic Pathogen Genomics Database Resource ...(EuPathDB, eupathdb.org) platform that integrates genomic, transcriptomic, proteomic, and phenotypic datasets, and other types of data for pathogenic and nonpathogenic, free-living and parasitic organisms. FungiDB is one of the largest EuPathDB databases containing nearly 100 genomes obtained from GenBank,
Genome Database (AspGD), The Broad Institute, Joint Genome Institute (JGI), Ensembl, and other sources. FungiDB offers a user-friendly web interface with embedded bioinformatics tools that support custom in silico experiments that leverage FungiDB-integrated data. In addition, a Galaxy-based workspace enables users to generate custom pipelines for large-scale data analysis (e.g., RNA-Seq, variant calling, etc.). This review provides an introduction to the FungiDB resources and focuses on available features, tools, and queries and how they can be used to mine data across a diverse range of integrated FungiDB datasets and records.
Two key biological features distinguish Trypanosoma evansi from the T. brucei group: independence from the tsetse fly as obligatory vector, and independence from the need for functional mitochondrial ...DNA (kinetoplast or kDNA). In an effort to better understand the molecular causes and consequences of these differences, we sequenced the genome of an akinetoplastic T. evansi strain from China and compared it to the T. b. brucei reference strain. The annotated T. evansi genome shows extensive similarity to the reference, with 94.9% of the predicted T. b. brucei coding sequences (CDS) having an ortholog in T. evansi, and 94.6% of the non-repetitive orthologs having a nucleotide identity of 95% or greater. Interestingly, several procyclin-associated genes (PAGs) were disrupted or not found in this T. evansi strain, suggesting a selective loss of function in the absence of the insect life-cycle stage. Surprisingly, orthologous sequences were found in T. evansi for all 978 nuclear CDS predicted to represent the mitochondrial proteome in T. brucei, although a small number of these may have lost functionality. Consistent with previous results, the F1FO-ATP synthase γ subunit was found to have an A281 deletion, which is involved in generation of a mitochondrial membrane potential in the absence of kDNA. Candidates for CDS that are absent from the reference genome were identified in supplementary de novo assemblies of T. evansi reads. Phylogenetic analyses show that the sequenced strain belongs to a dominant group of clonal T. evansi strains with worldwide distribution that also includes isolates classified as T. equiperdum. At least three other types of T. evansi or T. equiperdum have emerged independently. Overall, the elucidation of the T. evansi genome sequence reveals extensive similarity of T. brucei and supports the contention that T. evansi should be classified as a subspecies of T. brucei.
GeneDB (http://www.genedb.org/) is a genome database for prokaryotic and eukaryotic organisms. The resource provides a portal through which data generated by the Pathogen Sequencing Unit at the ...Wellcome Trust Sanger Institute and other collaborating sequencing centres can be made publicly available. It combines data from finished and ongoing genome and expressed sequence tag (EST) projects with curated annotation, that can be searched, sorted and downloaded, using a single web based resource. The current release stores 11 datasets of which six are curated and maintained by biologists, who review and incorporate information from the scientific literature, public databases and the respective research communities.
The cell surface of Trypanosoma brucei, like many protistan blood parasites, is crucial for mediating host-parasite interactions and is instrumental to the initiation, maintenance and severity of ...infection. Previous comparisons with the related trypanosomatid parasites T. cruzi and Leishmania major suggest that the cell-surface proteome of T. brucei is largely taxon-specific. Here we compare genes predicted to encode cell surface proteins of T. brucei with those from two related African trypanosomes, T. congolense and T. vivax. We created a cell surface phylome (CSP) by estimating phylogenies for 79 gene families with putative surface functions to understand the more recent evolution of African trypanosome surface architecture. Our findings demonstrate that the transferrin receptor genes essential for bloodstream survival in T. brucei are conserved in T. congolense but absent from T. vivax and include an expanded gene family of insect stage-specific surface glycoproteins that includes many currently uncharacterized genes. We also identify species-specific features and innovations and confirm that these include most expression site-associated genes (ESAGs) in T. brucei, which are absent from T. congolense and T. vivax. The CSP presents the first global picture of the origins and dynamics of cell surface architecture in African trypanosomes, representing the principal differences in genomic repertoire between African trypanosome species and provides a basis from which to explore the developmental and pathological differences in surface architectures. All data can be accessed at: http://www.genedb.org/Page/trypanosoma_surface_phylome.