Abstract
ATTED-II (http://atted.jp) is a coexpression database for plant species to aid in the discovery of relationships of unknown genes within a species. As an advanced coexpression analysis ...method, multispecies comparisons have the potential to detect alterations in gene relationships within an evolutionary context. However, determining the validity of comparative coexpression studies is difficult without quantitative assessments of the quality of coexpression data. ATTED-II (version 9) provides 16 coexpression platforms for nine plant species, including seven species supported by both microarray- and RNA sequencing (RNAseq)-based coexpression data. Two independent sources of coexpression data enable the assessment of the reproducibility of coexpression. The latest coexpression data for Arabidopsis (Ath-m.c7-1 and Ath-r.c3-0) showed the highest reproducibility (Jaccard coefficient = 0.13) among previous coexpression data in ATTED-II. We also investigated the statistical basis of the mutual rank (MR) index as a coexpression measure by bootstrap sampling of experimental units. We found that the error distribution of the logit-transformed MR index showed normality with equal variances for each coexpression platform. Because the MR error was strongly correlated with the number of samples for the coexpression data, typical confidence intervals for the MR index can be estimated for any coexpression platform. These new, high-quality coexpression data can be analyzed with any tool in ATTED-II and combined with external resources to obtain insight into plant biology.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Abstract
The advent of RNA-sequencing and microarray technologies has led to rapid growth of transcriptome data generated for a wide range of organisms, under various cellular, organ and individual ...conditions. Since the number of possible combinations of intercellular and extracellular conditions is almost unlimited, cataloging all transcriptome conditions would be an immeasurable challenge. Gene coexpression refers to the similarity of gene expression patterns under various conditions, such as disease states, tissue types, and developmental stages. Since the quality of gene coexpression data depends on the quality and quantity of transcriptome data, timely usage of the growing data is key to promoting individual research in molecular biology. COXPRESdb (http://coxpresdb.jp) is a database providing coexpression information for 11 animal species. One characteristic feature of COXPRESdb is its ability to compare multiple coexpression data derived from different transcriptomics technologies and different species, which strongly reduces false positive relationships in individual gene coexpression data. Here, we summarized the current version of this database, including 23 coexpression platforms with the highest-level quality till date. Using various functionalities in COXPRESdb, the new coexpression data would support a broader area of research from molecular biology to medical sciences.
Motivation: Intrinsically disordered regions in proteins have no unique stable structures without their partner molecules, thus these regions sometimes prevent high-quality structure determination. ...Furthermore, proteins with disordered regions are often involved in important biological processes, and the disordered regions are considered to play important roles in molecular interactions. Therefore, identifying disordered regions is important to obtain high-resolution structural information and to understand the functional aspects of these proteins. Results: We developed a new prediction method for disordered regions in proteins based on the meta approach and implemented a web-server for this prediction method named ‘metaPrDOS’. The method predicts the disorder tendency of each residue using support vector machines from the prediction results of the seven independent predictors. Evaluation of the meta approach was performed using the CASP7 prediction targets to avoid an overestimation due to the inclusion of proteins used in the training set of some component predictors. As a result, the meta approach achieved higher prediction accuracy than all methods participating in CASP7. Availability: http://prdos.hgc.jp/meta/ Contact: t-ishida@hgc.jp
Abstract
ATTED-II (https://atted.jp) is a gene coexpression database for nine plant species based on publicly available RNAseq and microarray data. One of the challenges in constructing ...condition-independent coexpression data based on publicly available gene expression data is managing the inherent sampling bias. Here, we report ATTED-II version 11, wherein we adopted a coexpression calculation methodology to balance the samples using principal component analysis and ensemble calculation. This approach has two advantages. First, omitting principal components with low contribution rates reduces the main contributors of noise. Second, balancing large differences in contribution rates enables considering various sample conditions entirely. In addition, based on RNAseq- and microarray-based coexpression data, we provide species-representative, integrated coexpression information to enhance the efficiency of interspecies comparison of the coexpression data. These coexpression data are provided as a standardized z-score to facilitate integrated analysis with different data sources. We believe that with these improvements, ATTED-II is more valuable and powerful for supporting interspecies comparative studies and integrated analyses using heterogeneous data.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
The identification of functional modules from protein-protein interaction (PPI) networks is an important step toward understanding the biological features of PPI networks. The detection of functional ...modules in PPI networks is often performed by identifying internally densely connected subnetworks, and often produces modules with "core" and "peripheral" proteins. The core proteins are the ones having dense connections to each other in a module. The difference between core and peripheral proteins is important to understand the functional roles of proteins in modules, but there are few methods to explicitly elucidate the internal structure of functional modules at gene level.
We propose NCMine, which is a novel network clustering method and visualization tool for the core-peripheral structure of functional modules. It extracts near-complete subgraphs from networks based on a node-weighting scheme using degree centrality, and reports subgroups as functional modules. We implemented this method as a plugin of Cytoscape, which is widely used to visualize and analyze biological networks. The plugin allows users to extract functional modules from PPI networks and interactively filter modules of interest. We applied the method to human PPI networks, and found several examples with the core-peripheral structure of modules that may be related to cancer development.
The Cytoscape plugin and tutorial are available at Cytoscape AppStore. (http://apps.cytoscape.org/apps/ncmine).
kengo@ecei.tohoku.ac.jpSupplementary information: Supplementary data are available at Bioinformatics online.
Abstract
NRF2 is a transcription activator that plays a key role in cytoprotection against oxidative stress. Although increased NRF2 activity is principally beneficial for our health, NRF2 activation ...in cancer cells is detrimental, as it drives their malignant progression. We previously found that CCAAT/enhancer-binding protein B (CEBPB) cooperates with NRF2 in NRF2-activated lung cancer and enhances tumour-initiating activity by promoting NOTCH3 expression. However, the general contribution of CEBPB in lung cancer is rather controversial, probably because the role of CEBPB depends on cooperating transcription factors in each cellular context. To understand how NRF2 shapes the function of CEBPB in NRF2-activated lung cancers and its biological consequence, we comprehensively explored NRF2-CEBPB–coregulated genes and found that genes involved in drug metabolism and detoxification were characteristically enriched. Indeed, CEBPB and NRF2 cooperatively contribute to the drug resistance. We also found that CEBPB is directly regulated by NRF2, which is likely to be advantageous for the coexpression and cooperative function of NRF2 and CEBPB. These results suggest that drug resistance of NRF2-activated lung cancers is achieved by the cooperative function of NRF2 and CEBPB.
Graphical Abstract
Graphical Abstract
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Accumulating evidence has revealed unexpected phenotypic heterogeneity and diverse functions of neutrophils in several diseases. Coronavirus disease (COVID-19) can alter the leukocyte phenotype based ...on disease severity, including neutrophil activation in severe cases. However, the plasticity of neutrophil phenotypes and their relative impact on COVID-19 pathogenesis has not been well addressed. This study aimed to identify and validate the heterogeneity of neutrophils in COVID-19 and evaluate the functions of each subpopulation. We analyzed public single-cell RNA-seq, bulk RNA-seq, and proteome data from healthy donors and patients with COVID-19 to investigate neutrophil subpopulations and their response to disease pathogenesis. We identified eight neutrophil subtypes: pro-neutrophil, pre-neutrophil, immature neutrophil, and five mature neutrophil subpopulations. The subtypes exhibited distinct features, including diverse activation signatures and multiple enriched pathways. The pro-neutrophil subtype was associated with severe and fatal disease, while the pre-neutrophil subtype was particularly abundant in mild/moderate disease. One of the mature neutrophil subtypes showed consistently large fractions in patients with different disease severity. Bulk RNA-seq dataset analyses using a cellular deconvolution approach validated the relative abundances of neutrophil subtypes and the expansion of pro-neutrophils in severe COVID-19 patients. Cell-cell communication analysis revealed representative ligand-receptor interactions among the identified neutrophil subtypes. Further investigation into transcription factors and differential protein abundance revealed the regulatory network differences between healthy donors and patients with severe COVID-19. Overall, we demonstrated the complex interactions among heterogeneous neutrophil subtypes and other blood cell types during COVID-19 disease. Our work has great value in terms of both clinical and public health as it furthers our understanding of the phenotypic and functional heterogeneity of neutrophils and other cell populations in multiple diseases.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
We report a family with progressive myoclonic epilepsy who underwent whole-exome sequencing but was negative for pathogenic variants. Similar clinical courses of a devastating neurodegenerative ...phenotype of two affected siblings were highly suggestive of a genetic etiology, which indicates that the survey of genetic variation by whole-exome sequencing was not comprehensive. To investigate the presence of a variant that remained unrecognized by standard genetic testing, PacBio long-read sequencing was performed. Structural variant (SV) detection using low-coverage (6×) whole-genome sequencing called 17,165 SVs (7,216 deletions and 9,949 insertions). Our SV selection narrowed down potential candidates to only five SVs (two deletions and three insertions) on the genes tagged with autosomal recessive phenotypes. Among them, a 12.4-kb deletion involving the CLN6 gene was the top candidate because its homozygous abnormalities cause neuronal ceroid lipofuscinosis. This deletion included the initiation codon and was found in a GC-rich region containing multiple repetitive elements. These results indicate the presence of a causal variant in a difficult-to-sequence region and suggest that such variants that remain enigmatic after the application of current whole-exome sequencing technology could be uncovered by unbiased application of long-read whole-genome sequencing.
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Transcriptional dysregulation, which can be caused by genetic and epigenetic alterations, is a fundamental feature of many cancers. A key cytoprotective transcriptional activator, NRF2, is often ...aberrantly activated in non-small cell lung cancers (NSCLCs) and supports both aggressive tumorigenesis and therapeutic resistance. Herein, we find that persistently activated NRF2 in NSCLCs generates enhancers at gene loci that are not normally regulated by transiently activated NRF2 under physiological conditions. Elevated accumulation of CEBPB in NRF2-activated NSCLCs is found to be one of the prerequisites for establishment of the unique NRF2-dependent enhancers, among which the NOTCH3 enhancer is shown to be critical for promotion of tumor-initiating activity. Enhancer remodeling mediated by NRF2-CEBPB cooperativity promotes tumor-initiating activity and drives malignancy of NRF2-activated NSCLCs via establishment of the NRF2-NOTCH3 regulatory axis.