Genome annotation is the process of identifying the location and function of a genome's encoded features. Improving the biological accuracy of annotation is a complex and iterative process requiring ...researchers to review and incorporate multiple sources of information such as transcriptome alignments, predictive models based on sequence profiles, and comparisons to features found in related organisms. Because rapidly decreasing costs are enabling an ever-growing number of scientists to incorporate sequencing as a routine laboratory technique, there is widespread demand for tools that can assist in the deliberative analytical review of genomic information. To this end, we present Apollo, an open source software package that enables researchers to efficiently inspect and refine the precise structure and role of genomic features in a graphical browser-based platform. Some of Apollo's newer user interface features include support for real-time collaboration, allowing distributed users to simultaneously edit the same encoded features while also instantly seeing the updates made by other researchers on the same region in a manner similar to Google Docs. Its technical architecture enables Apollo to be integrated into multiple existing genomic analysis pipelines and heterogeneous laboratory workflow platforms. Finally, we consider the implications that Apollo and related applications may have on how the results of genome research are published and made accessible.
Abstract
Since its 2015 update, MaizeGDB, the Maize Genetics and Genomics database, has expanded to support the sequenced genomes of many maize inbred lines in addition to the B73 reference genome ...assembly. Curation and development efforts have targeted high quality datasets and tools to support maize trait analysis, germplasm analysis, genetic studies, and breeding. MaizeGDB hosts a wide range of data including recent support of new data types including genome metadata, RNA-seq, proteomics, synteny, and large-scale diversity. To improve access and visualization of data types several new tools have been implemented to: access large-scale maize diversity data (SNPversity), download and compare gene expression data (qTeller), visualize pedigree data (Pedigree Viewer), link genes with phenotype images (MaizeDIG), and enable flexible user-specified queries to the MaizeGDB database (MaizeMine). MaizeGDB also continues to be the community hub for maize research, coordinating activities and providing technical support to the maize research community. Here we report the changes MaizeGDB has made within the last three years to keep pace with recent software and research advances, as well as the pan-genomic landscape that cheaper and better sequencing technologies have made possible. MaizeGDB is accessible online at https://www.maizegdb.org.
Abstract
We report an update of the Hymenoptera Genome Database (HGD; http://HymenopteraGenome.org), a genomic database of hymenopteran insect species. The number of species represented in HGD has ...nearly tripled, with fifty-eight hymenopteran species, including twenty bees, twenty-three ants, eleven wasps and four sawflies. With a reorganized website, HGD continues to provide the HymenopteraMine genomic data mining warehouse and JBrowse/Apollo genome browsers integrated with BLAST. We have computed Gene Ontology (GO) annotations for all species, greatly enhancing the GO annotation data gathered from UniProt with more than a ten-fold increase in the number of GO-annotated genes. We have also generated orthology datasets that encompass all HGD species and provide orthologue clusters for fourteen taxonomic groups. The new GO annotation and orthology data are available for searching in HymenopteraMine, and as bulk file downloads.
JBrowse is a fast and full-featured genome browser built with JavaScript and HTML5. It is easily embedded into websites or apps but can also be served as a standalone web page.
Overall improvements ...to speed and scalability are accompanied by specific enhancements that support complex interactive queries on large track sets. Analysis functions can readily be added using the plugin framework; most visual aspects of tracks can also be customized, along with clicks, mouseovers, menus, and popup boxes. JBrowse can also be used to browse local annotation files offline and to generate high-resolution figures for publication.
JBrowse is a mature web application suitable for genome visualization and analysis.
Many temperate insects survive the harsh conditions of winter by undergoing photoperiodic diapause, a pre-programmed developmental arrest initiated by short day lengths. Despite the well-established ...ecological significance of photoperiodic diapause, the molecular basis of this crucial adaptation remains largely unresolved. The Asian tiger mosquito, Aedes albopictus (Skuse), represents an outstanding emerging model to investigate the molecular basis of photoperiodic diapause in a well-defined ecological and evolutionary context. Ae. albopictus is a medically significant vector and is currently considered the most invasive mosquito in the world. Traits related to diapause appear to be important factors contributing to the rapid spread of this mosquito. To generate novel sequence information for this species, as well as to discover transcripts involved in diapause preparation, we sequenced the transcriptome of Ae. albopictus oocytes destined to become diapausing or non-diapausing pharate larvae.
454 GS-FLX transcriptome sequencing yielded >1.1 million quality-filtered reads, which we assembled into 69,474 contigs (N50 = 1,009 bp). Our contig filtering approach, where we took advantage of strong sequence similarity to the fully sequenced genome of Aedes aegypti, as well as other reference organisms, resulted in 11,561 high-quality, conservative ESTs. Differential expression estimates based on normalized read counts revealed 57 genes with higher expression, and 257 with lower expression under diapause-inducing conditions. Analysis of expression by qPCR for 47 of these genes indicated a high correlation of expression levels between 454 sequence data and qPCR, but congruence of statistically significant differential expression was low. Seven genes identified as differentially expressed based on qPCR have putative functions that are consistent with the insect diapause syndrome; three genes have unknown function and represent novel candidates for the transcriptional basis of diapause.
Our transcriptome database provides a rich resource for the comparative genomics and functional genetics of Ae. albopictus, an invasive and medically important mosquito. Additionally, the identification of differentially expressed transcripts related to diapause enriches the limited knowledge base for the molecular basis of insect diapause, in particular for the preparatory stage. Finally, our analysis illustrates a useful approach that draws from a closely related reference genome to generate high-confidence ESTs in a non-model organism.
Embryos generated with the use of assisted reproductive technologies (ART) can develop overgrowth syndromes. In ruminants, the condition is referred to as large offspring syndrome (LOS) and exhibits ...variable phenotypic abnormalities including overgrowth, enlarged tongue, and abdominal wall defects. These characteristics recapitulate those observed in the human loss-of-imprinting (LOI) overgrowth syndrome Beckwith–Wiedemann (BWS). We have recently shown LOI at the KCNQ1 locus in LOS, the most common epimutation in BWS. Although the first case of ART-induced LOS was reported in 1995, studies have not yet determined the extent of LOI in this condition. Here, we determined allele-specific expression of imprinted genes previously identified in human and/or mouse in day ∼105 Bos taurus indicus × Bos taurus taurus F1 hybrid control and LOS fetuses using RNAseq. Our analysis allowed us to determine the monoallelic expression of 20 genes in tissues of control fetuses. LOS fetuses displayed variable LOI compared with controls. Biallelic expression of imprinted genes in LOS was associated with tissue-specific hypomethylation of the normally methylated parental allele. In addition, a positive correlation was observed between body weight and the number of biallelically expressed imprinted genes in LOS fetuses. Furthermore, not only was there loss of allele-specific expression of imprinted genes in LOS, but also differential transcript amounts of these genes between control and overgrown fetuses. In summary, we characterized previously unidentified imprinted genes in bovines and identified misregulation of imprinting at multiple loci in LOS. We concluded that LOS is a multilocus LOI syndrome, as is BWS.
Significance Large offspring syndrome (LOS) is a fetal overgrowth condition that mimics the human syndrome Beckwith–Wiedemann. These conditions have been observed with higher incidence in offspring conceived with the use of assisted reproductive technologies and are believed to be the result of misregulation of a set of genes that are expressed only from the maternally or paternally inherited chromosomes. These genes are known as imprinted genes. In our study, we demonstrate that the kidney, brain, muscle, and liver of LOS fetuses show misregulation of multiple imprinted genes when compared with controls. Furthermore, we show that the magnitude of overgrowth in LOS fetuses correlates with the number of misregulated imprinted genes. Our results may help create diagnostics for these fetal syndromes.
Although eusociality evolved independently within several orders of insects, research into the molecular underpinnings of the transition towards social complexity has been confined primarily to ...Hymenoptera (for example, ants and bees). Here we sequence the genome and stage-specific transcriptomes of the dampwood termite Zootermopsis nevadensis (Blattodea) and compare them with similar data for eusocial Hymenoptera, to better identify commonalities and differences in achieving this significant transition. We show an expansion of genes related to male fertility, with upregulated gene expression in male reproductive individuals reflecting the profound differences in mating biology relative to the Hymenoptera. For several chemoreceptor families, we show divergent numbers of genes, which may correspond to the more claustral lifestyle of these termites. We also show similarities in the number and expression of genes related to caste determination mechanisms. Finally, patterns of DNA methylation and alternative splicing support a hypothesized epigenetic regulation of caste differentiation.
Abstract
Background
Major advances in selection progress for cattle have been made following the introduction of genomic tools over the past 10–12 years. These tools depend upon the Bos taurus ...reference genome (UMD3.1.1), which was created using now-outdated technologies and is hindered by a variety of deficiencies and inaccuracies.
Results
We present the new reference genome for cattle, ARS-UCD1.2, based on the same animal as the original to facilitate transfer and interpretation of results obtained from the earlier version, but applying a combination of modern technologies in a de novo assembly to increase continuity, accuracy, and completeness. The assembly includes 2.7 Gb and is >250× more continuous than the original assembly, with contig N50 >25 Mb and L50 of 32. We also greatly expanded supporting RNA-based data for annotation that identifies 30,396 total genes (21,039 protein coding). The new reference assembly is accessible in annotated form for public use.
Conclusions
We demonstrate that improved continuity of assembled sequence warrants the adoption of ARS-UCD1.2 as the new cattle reference genome and that increased assembly accuracy will benefit future research on this species.
We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD ...maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search.
Assisted reproductive therapies (ART) have become increasingly common worldwide and numerous retrospective studies have indicated that ART-conceived children are more likely to develop the overgrowth ...syndrome Beckwith-Wiedemann (BWS). In bovine, the use of ART can induce a similar overgrowth condition, which is referred to as large offspring syndrome (LOS). Both BWS and LOS involve misregulation of imprinted genes. However, it remains unknown whether molecular alterations at non-imprinted loci contribute to these syndromes. Here we examined the transcriptome of skeletal muscle, liver, kidney, and brain of control and LOS bovine fetuses and found that different tissues within LOS fetuses have perturbations of distinct gene pathways. Notably, in skeletal muscle, multiple pathways involved in myoblast proliferation and fusion into myotubes are misregulated in LOS fetuses. Further, characterization of the DNA methylome of skeletal muscle demonstrates numerous local methylation differences between LOS and controls; however, only a small percent of differentially expressed genes (DEGs), including the imprinted gene IGF2R, could be associated with the neighboring differentially methylated regions. In summary, we not only show that misregulation of non-imprinted genes and loss-of-imprinting characterize the ART-induced overgrowth syndrome but also demonstrate that most of the DEGs is not directly associated with DNA methylome epimutations.