COVID-19 is an infectious disease characterized by multiple respiratory and extrapulmonary manifestations, including gastrointestinal symptoms. Although recent studies have linked gut microbiota to ...infectious diseases such as influenza, little is known about the role of the gut microbiota in COVID-19 pathophysiology.
To better understand the host-gut microbiota interactions in COVID-19, we characterized the gut microbial community and gut barrier function using metagenomic and metaproteomic approaches in 63 COVID-19 patients and 8 non-infected controls. Both immunohematological parameters and transcriptional profiles were measured to reflect the immune response in COVID-19 patients.
Altered gut microbial composition was observed in COVID-19 patients, which was characterized by decreased commensal species and increased opportunistic pathogenic species. Severe illness was associated with higher abundance of four microbial species (i.e., Burkholderia contaminans, Bacteroides nordii, Bifidobacterium longum, and Blautia sp. CAG 257), six microbial pathways (e.g., glycolysis and fermentation), and 10 virulence genes. These severity-related microbial features were further associated with host immune response. For example, the abundance of Bu. contaminans was associated with higher levels of inflammation biomarkers and lower levels of immune cells. Furthermore, human-origin proteins identified from both blood and fecal samples suggested gut barrier dysfunction in COVID-19 patients. The circulating levels of lipopolysaccharide-binding protein increased in patients with severe illness and were associated with circulating inflammation biomarkers and immune cells. Besides, proteins of disease-related bacteria (e.g., B. longum) were detectable in blood samples from patients.
Our results suggest that the dysbiosis of the gut microbiome and the dysfunction of the gut barrier might play a role in the pathophysiology of COVID-19 by affecting host immune homeostasis.
The mouse has been widely used as a model organism for studying human diseases and for evaluating drug safety and efficacy. Many diseases and drug effects exhibit tissue specificity that may be ...reflected by tissue-specific gene-expression profiles. Here we construct a comprehensive mouse transcriptomic BodyMap across 17 tissues of six-weeks old C57BL/6JJcl mice using RNA-seq. We find different expression patterns between protein-coding and non-coding genes. Liver expressed the least complex transcriptomes, that is, the smallest number of genes detected in liver across all 17 tissues, whereas testis and ovary harbor more complex transcriptomes than other tissues. We report a comprehensive list of tissue-specific genes across 17 tissues, along with a list of 4,781 housekeeping genes in mouse. In addition, we propose a list of 27 consistently and highly expressed genes that can be used as reference controls in expression-profiling analysis. Our study provides a unique resource of mouse gene-expression profiles, which is helpful for further biomedical research.
Adenocarcinoma in situ and minimally invasive adenocarcinoma are the pre-invasive forms of lung adenocarcinoma. The genomic and immune profiles of these lesions are poorly understood. Here we report ...exome and transcriptome sequencing of 98 lung adenocarcinoma precursor lesions and 99 invasive adenocarcinomas. We have identified EGFR, RBM10, BRAF, ERBB2, TP53, KRAS, MAP2K1 and MET as significantly mutated genes in the pre/minimally invasive group. Classes of genome alterations that increase in frequency during the progression to malignancy are revealed. These include mutations in TP53, arm-level copy number alterations, and HLA loss of heterozygosity. Immune infiltration is correlated with copy number alterations of chromosome arm 6p, suggesting a link between arm-level events and the tumor immune environment.
In contrast to protein-coding genes, long-noncoding RNAs (lncRNAs) are much less well understood, despite increasing evidence indicating a wide range of their biological functions, and possible roles ...in various cancers. Based on public RNA-seq datasets of four solid cancer types, we here utilize Weighted Correlation Network Analysis (WGCNA) to propose a strategy for exploring the functions of lncRNAs altered in more than two cancer types, which we call onco-lncRNAs. Results indicate that cancer-expressed lncRNAs show high tissue specificity and are weakly expressed, more so than protein-coding genes. Most of the 236 onco-lncRNAs we identified have not been reported to have associations with cancers before. Our analysis exploits co-expression network to reveal that onco-lncRNAs likely play key roles in the multistep development of human cancers, covering a wide range of functions in genome stability maintenance, signaling, cell adhesion and motility, morphogenesis, cell cycle, immune and inflammatory response. These observations contribute to a more comprehensive understanding of cancer-associated lncRNAs, while demonstrating a novel and efficient strategy for subsequent functional studies of lncRNAs.
Eggs produced by the mature female parasite are responsible for the pathogenesis and transmission of schistosomiasis. Female schistosomes rely on a unique male-induced strategy to accomplish ...reproductive development, a process that is incompletely understood. Here we map detailed transcriptomic profiles of male and female Schistosoma japonicum across eight time points throughout the sexual developmental process from pairing to maturation. The dynamic gene expression pattern data reveal clear sex-related characteristics, indicative of an unambiguous functional division between males and females during their interplay. Cluster analysis, in situ hybridization and RNAi assays indicate that males likely use biogenic amine neurotransmitters through the nervous system to control and maintain pairing with females. In addition, the analyses indicate that reproductive development of females involves an insect-like hormonal regulation. These data sets and analyses serve as a foundation for deeper study of sexual development in this pathogen and identification of novel anti-schistosomal interventions.
Biodiversity within the animal kingdom is associated with extensive molecular diversity. The expansion of genomic, transcriptomic and proteomic data sets for invertebrate groups and species with ...unique biological traits necessitates reliable in silico tools for the accurate identification and annotation of molecules and molecular groups. However, conventional tools are inadequate for lesser-known organismal groups, such as eukaryotic pathogens (parasites), so that improved approaches are urgently needed. Here, we established a combined sequence- and structure-based workflow system to harness well-curated publicly available data sets and resources to identify, classify and annotate proteases and protease inhibitors of a highly pathogenic parasitic roundworm (nematode) of global relevance, called
(barber's pole worm). This workflow performed markedly better than conventional, sequence-based classification and annotation alone and allowed the first genome-wide characterisation of protease and protease inhibitor genes and gene products in this worm. In total, we identified 790 genes encoding 860 proteases and protease inhibitors representing 83 gene families. The proteins inferred included 280 metallo-, 145 cysteine, 142 serine, 121 aspartic and 81 "mixed" proteases as well as 91 protease inhibitors, all of which had marked physicochemical diversity and inferred involvements in >400 biological processes or pathways. A detailed investigation revealed a remarkable expansion of some protease or inhibitor gene families, which are likely linked to parasitism (e.g., host-parasite interactions, immunomodulation and blood-feeding) and exhibit stage- or sex-specific transcription profiles. This investigation provides a solid foundation for detailed explorations of the structures and functions of proteases and protease inhibitors of
and related nematodes, and it could assist in the discovery of new drug or vaccine targets against infections or diseases.
Abstract
One important aspect of precision medicine aims to deliver the right medicine to the right patient at the right dose at the right time based on the unique ‘omics’ features of each individual ...patient, thus maximizing drug efficacy and minimizing adverse drug reactions. However, fragmentation and heterogeneity of available data makes it challenging to readily obtain first-hand information regarding some particular diseases, drugs, genes and variants of interest. Therefore, we developed the Precision Medicine Knowledgebase (PreMedKB) by seamlessly integrating the four fundamental components of precision medicine: diseases, genes, variants and drugs. PreMedKB allows for search of comprehensive information within each of the four components, the relationships between any two or more components, and importantly, the interpretation of the clinical meanings of a patient's genetic variants. PreMedKB is an efficient and user-friendly tool to assist researchers, clinicians or patients in interpreting a patient's genetic profile in terms of discovering potential pathogenic variants, recommending therapeutic regimens, designing panels for genetic testing kits, and matching patients for clinical trials. PreMedKB is freely accessible and available at http://www.fudan-pgx.org/premedkb/index.html#/home.
Recent genome-sequencing studies have revealed dozens of genes frequently mutated in esophageal squamous cell carcinoma, but few genes are associated with patients' clinical outcomes. Novel ...prognostic biomarkers are urgently needed in the clinic. We collected both somatic mutations and clinical information of 442 Chinese esophageal squamous cell carcinoma patients from four published studies. Survival analysis was performed to reveal the clinical significance of the mutated genes. Dysregulation of the mutated genes was observed from public gene-expression data sets and its effects on cell migration and invasion were investigated with siRNA-mediated silencing. Our integrated analysis revealed 26 genes significantly and frequently mutated in esophageal squamous cell carcinoma. Importantly, mutations in ZFHX4, SPHKAP, NRXN1, KIAA1109, DNAH5 and KCNH7 were associated with poor survival. In addition, ZFHX4 was overexpressed in tumor tissues compared to normal controls, and knockdown of ZFHX4 in vitro significantly inhibited cell migration and invasion. Mutations in ZFHX4 were strongly associated with poor prognosis and the down-regulation of ZFHX4 inhibits the progression of esophageal squamous cell carcinoma. Further investigation is warranted to confirm the prognostic values of ZFHX4 in a prospective study.
Reference genome selection is a prerequisite for successful analysis of next generation sequencing (NGS) data. Current practice employs one of the two most recent human reference genome versions: ...HG19 or HG38. To date, the impact of genome version on SNV identification has not been rigorously assessed.
We conducted analysis comparing the SNVs identified based on HG19 vs HG38, leveraging whole genome sequencing (WGS) data from the genome-in-a-bottle (GIAB) project. First, SNVs were called using 26 different bioinformatics pipelines with either HG19 or HG38. Next, two tools were used to convert the called SNVs between HG19 and HG38. Lastly we calculated conversion rates, analyzed discordant rates between SNVs called with HG19 or HG38, and characterized the discordant SNVs.
The conversion rates from HG38 to HG19 (average 95%) were lower than the conversion rates from HG19 to HG38 (average 99%). The conversion rates varied slightly among the various calling pipelines. Around 1.5% SNVs were discordantly converted between HG19 or HG38. The conversions from HG38 to HG19 had more SNVs which failed conversion and more discordant SNVs than the opposite conversion (HG19 to HG38). Most of the discordant SNVs had low read depth, were low confidence SNVs as defined by GIAB, and/or were predominated by G/C alleles (52% observed versus 42% expected).
A significant number of SNVs could not be converted between HG19 and HG38. Based on careful review of our comparisons, we recommend HG38 (the newer version) for NGS SNV analysis. To summarize, our findings suggest caution when translating identified SNVs between different versions of the human reference genome.
Genomic structural variations (SV) are important determinants of genotypic and phenotypic changes in many organisms. However, the detection of SV from next-generation sequencing data remains ...challenging.
In this study, DNA from a Chinese family quartet is sequenced at three different sequencing centers in triplicate. A total of 288 derivative data sets are generated utilizing different analysis pipelines and compared to identify sources of analytical variability. Mapping methods provide the major contribution to variability, followed by sequencing centers and replicates. Interestingly, SV supported by only one center or replicate often represent true positives with 47.02% and 45.44% overlapping the long-read SV call set, respectively. This is consistent with an overall higher false negative rate for SV calling in centers and replicates compared to mappers (15.72%). Finally, we observe that the SV calling variability also persists in a genotyping approach, indicating the impact of the underlying sequencing and preparation approaches.
This study provides the first detailed insights into the sources of variability in SV identification from next-generation sequencing and highlights remaining challenges in SV calling for large cohorts. We further give recommendations on how to reduce SV calling variability and the choice of alignment methodology.