Bacterial plasmids play a large role in allowing bacteria to adapt to changing environments and can pose a significant risk to human health if they confer virulence and antimicrobial resistance ...(AMR). Plasmids differ significantly in the taxonomic breadth of host bacteria in which they can successfully replicate, this is commonly referred to as 'host range' and is usually described in qualitative terms of 'narrow' or 'broad'. Understanding the host range potential of plasmids is of great interest due to their ability to disseminate traits such as AMR through bacterial populations and into human pathogens. We developed the MOB-suite to facilitate characterization of plasmids and introduced a whole-sequence-based classification system based on clustering complete plasmid sequences using Mash distances (https://github.com/phac-nml/mob-suite). We updated the MOB-suite database from 12 091 to 23 671 complete sequences, representing 17 779 unique plasmids. With advances in new algorithms for rapidly calculating average nucleotide identity (ANI), we compared clustering characteristics using two different distance measures - Mash and ANI - and three clustering algorithms on the unique set of plasmids. The plasmid nomenclature is designed to group highly similar plasmids together that are unlikely to have multiple representatives within a single cell. Based on our results, we determined that clusters generated using Mash and complete-linkage clustering at a Mash distance of 0.06 resulted in highly homogeneous clusters while maintaining cluster size. The taxonomic distribution of plasmid biomarker sequences for replication and relaxase typing, in combination with MOB-suite whole-sequence-based clusters have been examined in detail for all high-quality publicly available plasmid sequences. We have incorporated prediction of plasmid replication host range into the MOB-suite based on observed distributions of these sequence features in combination with known plasmid hosts from the literature. Host range is reported as the highest taxonomic rank that covers all of the plasmids which share replicon or relaxase biomarkers or belong to the same MOB-suite cluster code. Reporting host range based on these criteria allows for comparisons of host range between studies and provides information for plasmid surveillance.
Environmental DNA (eDNA) is an effective approach for detecting vertebrates and plants, especially in aquatic ecosystems, but prior studies have largely examined eDNA in cool temperate settings. By ...contrast, this study employs eDNA to survey the fish fauna in tropical Lake Bacalar (Mexico) with the additional goal of assessing the possible presence of invasive fishes, such as Amazon sailfin catfish and tilapia. Sediment and water samples were collected from eight stations in Lake Bacalar on three occasions over a 4-month interval. Each sample was stored in the presence or absence of lysis buffer to compare eDNA recovery. Short fragments (184-187 bp) of the cytochrome c oxidase I (COI) gene were amplified using fusion primers and then sequenced on Ion Torrent PGM or S5 before their source species were determined using a custom reference sequence database constructed on BOLD. In total, eDNA sequences were recovered from 75 species of vertebrates including 47 fishes, 15 birds, 7 mammals, 5 reptiles, and 1 amphibian. Although all species are known from this region, six fish species represent new records for the study area, while two require verification. Sequences for five species (2 birds, 2 mammals, 1 reptile) were only detected from sediments, while sequences from 52 species were only recovered from water. Because DNA from the Amazon sailfin catfish was not detected, we used a mock eDNA experiment to confirm our methods would enable its detection. In summary, we developed protocols that recovered eDNA from tropical oligotrophic aquatic ecosystems and confirmed their effectiveness in detecting fishes and diverse species of vertebrates.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
is a priority foodborne pathogen of public health concern and phenotypic serotyping provides critical information for surveillance and outbreak detection activities. Public health and food safety ...laboratories are increasingly adopting whole-genome sequencing (WGS) for characterizing pathogens, but it is imperative to maintain serotype designations in order to minimize disruptions to existing public health workflows. Multiple
tools have been developed for predicting serotypes from WGS data, including SRST2, SerotypeFinder and EToKi EBEis, but these tools were not designed with the specific requirements of diagnostic laboratories, which include: speciation, input data flexibility (fasta/fastq), quality control information and easily interpretable results. To address these specific requirements, we developed ECTyper (https://github.com/phac-nml/ecoli_serotyping) for performing both speciation within
and
, and
serotype prediction. We compared the serotype prediction performance of each tool on a newly sequenced panel of 185 isolates with confirmed phenotypic serotype information. We found that all tools were highly concordant, with 92-97 % for O-antigens and 98-100 % for H-antigens, and ECTyper having the highest rate of concordance. We extended the benchmarking to a large panel of 6954 publicly available
genomes to assess the performance of the tools on a more diverse dataset. On the public data, there was a considerable drop in concordance, with 75-91 % for O-antigens and 62-90 % for H-antigens, and ECTyper and SerotypeFinder being the most concordant. This study highlights that
predictions show high concordance with phenotypic serotyping results, but there are notable differences in tool performance. ECTyper provides highly accurate and sensitive
serotype predictions, in addition to speciation, and is designed to be easily incorporated into bioinformatic workflows.
Cryptosporidium is a protozoan parasite that is transmitted to both humans and animals through zoonotic or anthroponotic means. When a host is infected with this parasite, it causes a ...gastrointestinal disease known as cryptosporidiosis. To understand the transmission dynamics of Cryptosporidium, the small subunit (SSU or 18S) rRNA and gp60 genes are commonly studied through PCR analysis and conventional Sanger sequencing. However, analyzing sequence chromatograms manually is both time consuming and prone to human error, especially in the presence of poorly resolved, heterozygous peaks and the absence of a validated database. For this study, we developed a Cryptosporidium genotyping tool, called CryptoGenotyper, which has the capability to read raw Sanger sequencing data for the two common Cryptosporidium gene targets (SSU rRNA and gp60) and classify the sequence data into standard nomenclature. The CryptoGenotyper has the capacity to perform quality control and properly classify sequences using a high quality, manually curated reference database, saving users' time and removing bias during data analysis. The incorporated heterozygous base calling algorithms for the SSU rRNA gene target resolves double peaks, therefore recovering data previously classified as inconclusive. The CryptoGenotyper successfully genotyped 99.3% (428/431) and 95.1% (154/162) of SSU rRNA chromatograms containing single and mixed sequences, respectively, and correctly subtyped 95.6% (947/991) of gp60 chromatograms without manual intervention. This new, user-friendly tool can provide both fast and reproducible analyses of Sanger sequencing data for the two most common Cryptosporidium gene targets.
Display omitted
•Developed CryptoGenotyper to analyze Cryptosporidium sequencing chromatograms.•Uses a high quality, manually curated database to report results accurately.•Provides timely results in standard notation for ease of reporting.•Correctly identified 99.3% SSU rRNA and 95.1% mixed SSU rRNA.•Subtyped 95.6% of gp60 samples in concordance with manual analysis.
The classic isoforms of myelin basic protein (MBP) are essential for the formation and maintenance of myelin in the central nervous system of higher vertebrates. The protein is involved in all facets ...of the development, compaction, and stabilization of the multilamellar myelin sheath, and also interacts with cytoskeletal and signaling proteins. The predominant 18.5-kDa isoform of MBP is an intrinsically-disordered protein that is a candidate auto-antigen in the human demyelinating disease multiple sclerosis. A highly-conserved central segment within classic MBP consists of a proline-rich region (murine 18.5-kDa sequence -T92-P93-R94-T95-P96-P97-P98-S99-) containing a putative SH3-ligand, adjacent to a region that forms an amphipathic α-helix (P82-I90) upon interaction with membranes, or under membrane-mimetic conditions. The T92 and T95 residues within the proline-rich region can be post-translationally modified through phosphorylation by mitogen-activated protein (MAP) kinases. Here, we have investigated the structure of the α-helical and proline-rich regions in dilute aqueous buffer, and have evaluated the effects of phosphorylation at T92 and T95 on the stability and dynamics of the α-helical region, by utilizing four 36-residue peptides (S72-S107) with differing phosphorylation status. Nuclear magnetic resonance spectroscopy reveals that both the α-helical as well as the proline-rich regions are disordered in aqueous buffer, whereas they are both structured in a lipid environment (cf., Ahmed et al., Biochemistry 51, 7475-9487, 2012). Thermodynamic analysis of trifluoroethanol-titration curves monitored by circular dichroism spectroscopy reveals that phosphorylation, especially at residue T92, impedes formation of the amphipathic α-helix. This conclusion is supported by molecular dynamics simulations, which further illustrate that phosphorylation reduces the folding reversibility of the α-helix upon temperature perturbation and affect the global structure of the peptides through altered electrostatic interactions. The results support the hypothesis that the central conserved segment of MBP constitutes a molecular switch in which the conformation and/or intermolecular interactions are mediated by phosphorylation/dephosphorylation at T92 and T95.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
A mixed phospholipid-cholestrol bilayer, with cholera toxin B (CTB) units attached to the monosialotetrahexosylganglioside (GM1) binding sites in the distal leaflet, was deposited on a Au(111) ...electrode surface. Polarization modulation infrared reflection absorption spectroscopy (PM-IRRAS) measurements were used to characterize structural and orientational changes in this model biological membrane upon binding CTB and the application of the electrode potential. The data presented in this article show that binding cholera toxin to the membrane leads to an overall increase in the tilt angle of the fatty acid chains; however, the conformation of the bilayer remains relatively constant as indicated by the small decrease in the total number of gauche conformers of acyl tails. In addition, the bound toxin caused a significant decrease in the hydration of the ester group contained within the lipid bilayer. Furthermore, changes in the applied potential had a minimal effect on the overall structure of the membrane. In contrast, our results showed significant voltage-dependent changes in the average orientation of the protein α-helices that may correspond to the voltage-gated opening and closing of the central pore that resides within the B subunit of cholera toxin.
Ingestion of food- or waterborne antibiotic-resistant bacteria may lead to the dissemination of antibiotic-resistance genes in the gut microbiota and the development of antibiotic-resistant bacterial ...infection, a significant threat to animal and public health. Food or water may be contaminated with multiple resistant bacteria, but animal models on gene transfer were mainly based on single-strain infections. In this study, we investigated the mobility of β-lactam resistance following infection with single- versus multi-strain of resistant bacteria under ampicillin treatment. We characterized three bacterial strains isolated from food-animal production systems,
O80:H26 and
serovars Bredeney and Heidelberg. Each strain carries at least one conjugative plasmid that encodes a β-lactamase. We orally infected mice with each or all three bacterial strain(s) in the presence or absence of ampicillin treatment. We assessed plasmid transfer from the three donor bacteria to an introduced
CV601gfp recipient in the mouse gut, and evaluated the impacts of the bacterial infection on gut microbiota and gut health. In the absence of ampicillin treatment, none of the donor or recipient bacteria established in the normal gut microbiota and plasmid transfer was not detected. In contrast, the ampicillin treatment disrupted the gut microbiota and enabled
. Bredeney and Heidelberg to colonize and transfer their plasmids to the
CV601gfp recipient.
O80:H26 on its own failed to colonize the mouse gut. However, during co-infection with the two
strains,
O80:H26 colonized and transferred its plasmid to the
CV601gfp recipient and a residential
O2:H6 strain. The co-infection significantly increased plasmid transfer frequency, enhanced Proteobacteria expansion and resulted in inflammation in the mouse gut. Our findings suggest that single-strain infection models for evaluating
gene transfer may underrepresent the consequences of multi-strain infections following the consumption of heavily contaminated food or water.
Analyzing time-course expression data captured in microarray datasets is a complex undertaking as the vast and complex data space is represented by a relatively low number of samples as compared to ...thousands of available genes. Here, we developed the Interdependent Correlation Clustering (ICC) method to analyze relationships that exist among genes conditioned on the expression of a specific target gene in microarray data. Based on Correlation Clustering, the ICC method analyzes a large set of correlation values related to gene expression profiles extracted from given microarray datasets. ICC can be applied to any microarray dataset and any target gene. We applied this method to microarray data generated from wine fermentations and selected NSF1, which encodes a C2H2 zinc finger-type transcription factor, as the target gene. The validity of the method was verified by accurate identifications of the previously known functional roles of NSF1. In addition, we identified and verified potential new functions for this gene; specifically, NSF1 is a negative regulator for the expression of sulfur metabolism genes, the nuclear localization of Nsf1 protein (Nsf1p) is controlled in a sulfur-dependent manner, and the transcription of NSF1 is regulated by Met4p, an important transcriptional activator of sulfur metabolism genes. The inter-disciplinary approach adopted here highlighted the accuracy and relevancy of the ICC method in mining for novel gene functions using complex microarray datasets with a limited number of samples.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Myelin basic protein (MBP), specifically the 18.5 kDa isoform, is a peripheral membrane protein and a major component of mammalian central nervous system myelin. It is an intrinsically disordered and ...multifunctional protein that binds cytoskeletal and other cytosolic proteins to a membrane surface and thereby acquires ordered structure. These associations are modulated by post-translational modifications of MBP, as well as by interactions of MBP with Ca2+-calmodulin (CaM). Enzymatic deimination of usually six arginine residues to citrulline results in a decrease in the net positive charge of the protein from 19 to ≤13. This deiminated form is found in greater amounts in normal children and in adult patients with the demyelinating disease multiple sclerosis. In this paper, we examine the secondary structure of a calmodulin-binding domain, residues A141−L154, when associated with a lipid bilayer in recombinant murine 18.5 kDa forms rmC1 (unmodified) and rmC8 (pseudodeiminated). We demonstrate here by site-directed spin labeling and electron paramagnetic resonance (EPR) spectroscopy that the Y142−L154 segment in membrane-associated rmC1 forms an amphipathic α-helix, with high accessibility to O2 and low accessibility to NiEDDA. In membrane-associated rmC8, this segment assumed a structure distorted from an α-helix. Spin-labeled residues in rmC1 in solution were more immobilized on binding Ca2+-CaM than those in rmC8. Furthermore, rmC8 was dissociated more readily from a lipid bilayer by Ca2+-CaM than was rmC1. These results confirm both a predicted induced ordering upon membrane association in a specific segment of 18.5 kDa MBP, and that this segment is a CaM-binding site, with both interactions weakened by deimination of residues outside of this segment. The deiminated form would be more susceptible to regulation of its membrane binding functions by Ca2+-CaM than the unmodified form.
ABSTRACT
Gene regulatory network (GRN) inference is an active area of research that facilitates understanding the complex interplays between biological molecules. We propose a novel framework to ...create such GRNs, based on Conditional Inference Forests (CIFs) as proposed by Strobl et al. Our framework consists of using ensembles of Conditional Inference Trees (CITs) and selecting an appropriate aggregation scheme for variant selection prior to network construction. We show on synthetic microarray data that taking the original implementation of CIFs with conditional permutation scheme (CIFcond) may lead to improved performance compared to Breiman's implementation of Random Forests (RF). Among all newly introduced CIF‐based methods and five network scenarios obtained from the DREAM4 challenge, CIFcond performed best. Networks derived from well‐tuned CIFs, obtained by simply averaging P‐values over tree ensembles (CIFmean) are particularly attractive, because they combine adequate performance with computational efficiency. Moreover, thresholds for variable selection are based on significance levels for P‐values and, hence, do not need to be tuned. From a practical point of view, our extensive simulations show the potential advantages of CIFmean‐based methods. Although more work is needed to improve on speed, especially when fully exploiting the advantages of CITs in the context of heterogeneous and correlated data, we have shown that CIF methodology can be flexibly inserted in a framework to infer biological interactions. Notably, we confirmed biologically relevant interaction between IL2RA and FOXP1, linked to the IL‐2 signaling pathway and to type 1 diabetes.