Analysis of metabolomic profiling data from gas chromatography−mass spectrometry (GC/MS) measurements usually relies upon reference libraries of metabolite mass spectra to structurally identify and ...track metabolites. In general, techniques to enumerate and track unidentified metabolites are nonsystematic and require manual curation. We present a method and software implementation, freely available at http://spectconnect.mit.edu, that can systematically detect components that are conserved across samples without the need for a reference library or manual curation. We validate this approach by correctly identifying the components in a known mixture and the discriminating components in a spiked mixture. Finally, we demonstrate an application of this approach with a brief analysis of the Escherichia coli metabolome. By systematically cataloguing conserved metabolite peaks prior to data analysis methods, our approach broadens the scope of metabolomics and facilitates biomarker discovery.
We have developed periscope, a tool for the detection and quantification of subgenomic RNA (sgRNA) in SARS-CoV-2 genomic sequence data. The translation of the SARS-CoV-2 RNA genome for most open ...reading frames (ORFs) occurs via RNA intermediates termed "subgenomic RNAs." sgRNAs are produced through discontinuous transcription, which relies on homology between transcription regulatory sequences (TRS-B) upstream of the ORF start codons and that of the TRS-L, which is located in the 5' UTR. TRS-L is immediately preceded by a leader sequence. This leader sequence is therefore found at the 5' end of all sgRNA. We applied periscope to 1155 SARS-CoV-2 genomes from Sheffield, United Kingdom, and validated our findings using orthogonal data sets and in vitro cell systems. By using a simple local alignment to detect reads that contain the leader sequence, we were able to identify and quantify reads arising from canonical and noncanonical sgRNA. We were able to detect all canonical sgRNAs at the expected abundances, with the exception of ORF10. A number of recurrent noncanonical sgRNAs are detected. We show that the results are reproducible using technical replicates and determine the optimum number of reads for sgRNA analysis. In VeroE6
+/- cell lines, periscope can detect the changes in the kinetics of sgRNA in orthogonal sequencing data sets. Finally, variants found in genomic RNA are transmitted to sgRNAs with high fidelity in most cases. This tool can be applied to all sequenced COVID-19 samples worldwide to provide comprehensive analysis of SARS-CoV-2 sgRNA.
Wolbachia are widespread maternally-transmitted bacteria of arthropods that often spread by manipulating their host's reproduction through cytoplasmic incompatibility (CI). Their invasive potential ...is currently being harnessed in field trials aiming to control mosquito-borne diseases. Wolbachia genomes commonly harbour prophage regions encoding the cif genes which confer their ability to induce CI. Recently, a plasmid-like element was discovered in wPip, a Wolbachia strain infecting Culex mosquitoes; however, it is unclear how common such extra-chromosomal elements are in Wolbachia. Here we sequenced the complete genome of wAlbA, a strain of the symbiont found in Aedes albopictus, after eliminating the co-infecting and higher density wAlbB strain that previously made sequencing of wAlbA challenging. We show that wAlbA is associated with two new plasmids and identified additional Wolbachia plasmids and related chromosomal islands in over 20% of publicly available Wolbachia genome datasets. These plasmids encode a variety of accessory genes, including several phage-like DNA packaging genes as well as genes potentially contributing to host-symbiont interactions. In particular, we recovered divergent homologues of the cif genes in both Wolbachia- and Rickettsia-associated plasmids. Our results indicate that plasmids are common in Wolbachia and raise fundamental questions around their role in symbiosis. In addition, our comparative analysis provides useful information for the future development of genetic tools to manipulate and study Wolbachia symbionts.
HIV-1 transmission via sexual exposure is an inefficient process. When transmission does occur, newly infected individuals are colonized by the descendants of either a single virion or a very small ...number of establishing virions. These transmitted founder (TF) viruses are more interferon (IFN)-resistant than chronic control (CC) viruses present 6 months after transmission. To identify the specific molecular defences that make CC viruses more susceptible to the IFN-induced 'antiviral state', we established a single pair of fluorescent TF and CC viruses and used arrayed interferon-stimulated gene (ISG) expression screening to identify candidate antiviral effectors. However, we observed a relatively uniform ISG resistance of transmitted HIV-1, and this directed us to investigate possible underlying mechanisms. Simple simulations, where we varied a single parameter, illustrated that reduced growth rate could possibly underly apparent interferon sensitivity. To examine this possibility, we closely monitored in vitro propagation of a model TF/CC pair (closely matched in replicative fitness) over a targeted range of IFN concentrations. Fitting standard four-parameter logistic growth models, in which experimental variables were regressed against growth rate and carrying capacity, to our in vitro growth curves, further highlighted that small differences in replicative growth rates could recapitulate our in vitro observations. We reasoned that if growth rate underlies apparent interferon resistance, transmitted HIV-1 would be similarly resistant to any growth rate inhibitor. Accordingly, we show that two transmitted founder HIV-1 viruses are relatively resistant to antiretroviral drugs, while their matched chronic control viruses were more sensitive. We propose that, when present, the apparent IFN resistance of transmitted HIV-1 could possibly be explained by enhanced replicative fitness, as opposed to specific resistance to individual IFN-induced defences. However, further work is required to establish how generalisable this mechanism of relative IFN resistance might be.
The mechanisms and consequences of genome evolution on viral fitness following host shifts are poorly understood. In addition, viral fitness -the ability of an organism to reproduce and survive- is ...multifactorial and thus difficult to quantify. Influenza A viruses (IAVs) circulate broadly among wild birds and have jumped into and become endemic in multiple mammalian hosts, including humans, pigs, dogs, seals, and horses. H3N8 equine influenza virus (EIV) is an endemic virus of horses that originated in birds and has been circulating uninterruptedly in equine populations since the early 1960s. Here, we used EIV to quantify changes in infection phenotype associated to viral fitness due to genome-wide changes acquired during long-term adaptation. We performed experimental infections of two mammalian cell lines and equine tracheal explants using the earliest H3N8 EIV isolated (A/equine/Uruguay/63 EIV/63), and A/equine/Ohio/2003 (EIV/2003), a monophyletic descendant of EIV/63 isolated 40 years after the emergence of H3N8 EIV. We show that EIV/2003 exhibits increased resistance to interferon, enhanced viral replication, and a more efficient cell-to-cell spread in cells and tissues. Transcriptomics analyses revealed virus-specific responses to each virus, mainly affecting host immunity and inflammation. Image analyses of infected equine respiratory explants showed that despite replicating at higher levels and spreading over larger areas of the respiratory epithelium, EIV/2003 induced milder lesions compared to EIV/63, suggesting that adaptation led to reduced tissue pathogenicity. Our results reveal previously unknown links between virus genotype and the host response to infection, providing new insights on the relationship between virus evolution and fitness.
Le Dantec virus (LDV), assigned to the species Ledantevirus ledantec, genus Ledantevirus, family Rhabdoviridae has been associated with human disease but has gone undetected since the 1970s. We ...describe the detection of LDV in a human case of undifferentiated fever in Uganda by metagenomic sequencing and demonstrate a serological response using ELISA and pseudotype neutralisation. By screening 997 individuals sampled in 2016, we show frequent exposure to ledanteviruses with 76% of individuals seropositive in Western Uganda, but lower seroprevalence in other areas. Serological cross-reactivity as measured by pseudotype-based neutralisation was confined to ledanteviruses, indicating population seropositivity may represent either exposure to LDV or related ledanteviruses. We also describe the discovery of a closely related ledantevirus in blood from the synanthropic rodent Mastomys erythroleucus. Ledantevirus infection is common in Uganda but is geographically heterogenous. Further surveys of patients presenting with acute fever are required to determine the contribution of these emerging viruses to febrile illness in Uganda.
Genome sequencing dramatically increased our ability to understand cellular response to perturbation. Integrating system-wide measurements such as gene expression with networks of protein-protein ...interactions and transcription factor binding revealed critical insights into cellular behavior. However, the potential of systems biology approaches is limited by difficulties in integrating metabolic measurements across the functional levels of the cell despite their being most closely linked to cellular phenotype. To address this limitation, we developed a model-based approach to correlate mRNA and metabolic flux data that combines information from both interaction network models and flux determination models. We started by quantifying 5,764 mRNAs, 54 metabolites, and 83 experimental ¹³C-based reaction fluxes in continuous cultures of yeast under stress in the absence or presence of global regulator Gcn4p. Although mRNA expression alone did not directly predict metabolic response, this correlation improved through incorporating a network-based model of amino acid biosynthesis (from r = 0.07 to 0.80 for mRNA-flux agreement). The model provides evidence of general biological principles: rewiring of metabolic flux (i.e., use of different reaction pathways) by transcriptional regulation and metabolite interaction density (i.e., level of pairwise metabolite-protein interactions) as a key biosynthetic control determinant. Furthermore, this model predicted flux rewiring in studies of follow-on transcriptional regulators that were experimentally validated with additional ¹³C-based flux measurements. As a first step in linking metabolic control and genetic regulatory networks, this model underscores the importance of integrating diverse data types in large-scale cellular models. We anticipate that an integrated approach focusing on metabolic measurements will facilitate construction of more realistic models of cellular regulation for understanding diseases and constructing strains for industrial applications.
The viral ubiquitin ligase ICP0 stimulates the onset of HSV-1 lytic infection and productive reactivation of viral genomes from latency. In order to mediate these processes, it requires its C3HC4 ...RING finger domain, a tertiary structural fold that is coordinated by the binding of two zinc (Zn2+) atoms. Here we formally demonstrate that Zn2+ binding and intracellular Zn2+ levels are critical for ICP0's biochemical activity and that depletion of intracellular Zn2+ severely attenuates HSV-1 replication.
Biting midges (
species) are vectors of arboviruses and were responsible for the emergence and spread of
(SBV) in Europe in 2011 and are likely to be involved in the emergence of other arboviruses in ...Europe. Improved surveillance and better understanding of risks require a better understanding of the circulating viral diversity in these biting insects. In this study, we expand the sequence space of RNA viruses by identifying a number of novel RNA viruses from
(biting midge) using a meta-transcriptomic approach. A novel metaviromic pipeline called MetaViC was developed specifically to identify novel virus sequence signatures from high throughput sequencing (HTS) datasets in the absence of a known host genome. MetaViC is a protein centric pipeline that looks for specific protein signatures in the reads and contigs generated as part of the pipeline. Several novel viruses, including an alphanodavirus with both segments, a novel relative of the Hubei sobemo-like virus 49, two rhabdo-like viruses and a chuvirus, were identified in the Scottish midge samples. The newly identified viruses were found to be phylogenetically distinct to those previous known. These findings expand our current knowledge of viral diversity in arthropods and especially in these understudied disease vectors.