Sporadic angiosarcomas are aggressive vascular sarcomas whose rarity and genomic complexity present significant obstacles in deciphering the pathogenic significance of individual genetic alterations. ...Numerous fusion genes have been identified across multiple types of cancers, but their existence and significance remain unclear in sporadic angiosarcomas. In this study, we leveraged RNA-sequencing data from 13 human angiosarcomas and 76 spontaneous canine hemangiosarcomas to identify fusion genes associated with spontaneous vascular malignancies. Ten novel protein-coding fusion genes, including
and
, were identified in seven of the 13 human tumors, with two tumors showing mutations of
.
and
mutations were found in angiosarcomas without fusions or
mutations. We found 15 novel protein-coding fusion genes including
, and
in 11 of the 76 canine hemangiosarcomas; these fusion genes were seen exclusively in tumors of the angiogenic molecular subtype that contained recurrent mutations in
, and
. In particular, fusion genes and mutations of
cooccurred in tumors with higher frequency than expected by random chance, and they enriched gene signatures predicting activation of angiogenic pathways. Comparative transcriptomic analysis of human angiosarcomas and canine hemangiosarcomas identified shared molecular signatures associated with activation of PI3K/AKT/mTOR pathways. Our data suggest that genome instability induced by
mutations might create a predisposition for fusion events that may contribute to tumor progression by promoting selection and/or enhancing fitness through activation of convergent angiogenic pathways in this vascular malignancy. IMPLICATIONS: This study shows that, while drive events of malignant vasoformative tumors of humans and dogs include diverse mutations and stochastic rearrangements that create novel fusion genes, convergent transcriptional programs govern the highly conserved morphologic organization and biological behavior of these tumors in both species.
Quantifying the uncertainty of quantities of interest (QoIs) from physical systems is a primary objective in model validation. However, achieving this goal entails balancing the need for ...computational efficiency with the requirement for numerical accuracy. To address this trade-off, we propose a novel bi-fidelity formulation of variational auto-encoders (BF-VAE) designed to estimate the uncertainty associated with a QoI from low-fidelity (LF) and high-fidelity (HF) samples of the QoI. This model allows for the approximation of the statistics of the HF QoI by leveraging information derived from its LF counterpart. Specifically, we design a bi-fidelity auto-regressive model in the latent space which is integrated within the VAE’s probabilistic encoder–decoder structure. An effective algorithm is proposed to maximize the variational lower bound of the HF log-likelihood in the presence of limited HF data, resulting in the synthesis of HF realizations with a reduced computational cost. Additionally, we introduce the concept of the bi-fidelity information bottleneck (BF-IB) to provide an information-theoretic interpretation of the proposed BF-VAE model. Our numerical results demonstrate that the BF-VAE leads to considerably improved accuracy, as compared to a VAE trained using only HF data, when limited HF data is available.
•Introduce a bi-fidelity variational autoencoder model with a theoretically interpreted training objective criterion.•Design a novel algorithm to train the proposed model in the presence of scarce high-fidelity data.•Extend the information bottleneck theory to provide an interpretation of the proposed model.
The potential risk of fungal pathogen infection in agriculture has great adverse effects on agricultural development and human health. In this study, we explored the application of agarwood essential ...oil (AEO) and its main components in the development of agricultural fungicides. The compositions of AEO produced in Maoming City of Guangdong Province, China was firstly analysed by GC–MS. AEO and compounds thereof including carvacrol and longifolene were then used to perform a series of antifungal activity evaluations in vitro and in vivo against agricultural and foodborne pathogens. The in vitro results showed that the antifungal activities of AEO were comparable to those of hymexazol (a positive control), and carvacrol exhibited stronger efficacy. In vivo studies also showed that AEO and carvacrol could significantly inhibit the damage effect of Curvularia mebaldsii on the germination rate of wheat roots. Further, it was found that the germination rate exhibited a downward trend with the dose increase of AEO and carvacrol.
Abstract
Angiosarcomas are soft-tissue sarcomas that form malignant vascular tissues. Angiosarcomas are very rare, and due to their aggressive behavior and high metastatic propensity, they have poor ...clinical outcomes. Hemangiosarcomas commonly occur in domestic dogs, and share pathological and clinical features with human angiosarcomas. Typical pathognomonic features of this tumor are irregular vascular channels that are filled with blood and are lined by a mixture of malignant and nonmalignant endothelial cells. The current gold standard is the histological diagnosis of angiosarcoma; however, microscopic evaluation may be complicated, particularly when tumor cells are undetectable due to the presence of excessive amounts of nontumor cells or when tissue specimens have insufficient tumor content. In this study, we implemented machine learning applications from next-generation transcriptomic data of canine hemangiosarcoma tumor samples (n = 76) and nonmalignant tissues (n = 10) to evaluate their training performance for diagnostic utility. The 10-fold cross-validation test and multiple feature selection methods were applied. We found that extra trees and random forest learning models were the best classifiers for hemangiosarcoma in our testing datasets. We also identified novel gene signatures using the mutual information and Monte Carlo feature selection method. The extra trees model revealed high classification accuracy for hemangiosarcoma in validation sets. We demonstrate that high-throughput sequencing data of canine hemangiosarcoma are trainable for machine learning applications. Furthermore, our approach enables us to identify novel gene signatures as reliable determinants of hemangiosarcoma, providing significant insights into the development of potential applications for this vascular malignancy.
Bayesian optimization is a widely used technique for optimizing black-box functions, with Expected Improvement (EI) being the most commonly utilized acquisition function in this domain. While EI is ...often viewed as distinct from other information-theoretic acquisition functions, such as entropy search (ES) and max-value entropy search (MES), our work reveals that EI can be considered a special case of MES when approached through variational inference (VI). In this context, we have developed the Variational Entropy Search (VES) methodology and the VES-Gamma algorithm, which adapts EI by incorporating principles from information-theoretic concepts. The efficacy of VES-Gamma is demonstrated across a variety of test functions and read datasets, highlighting its theoretical and practical utilities in Bayesian optimization scenarios.
Hemangiosarcoma and angiosarcoma are soft-tissue sarcomas of blood vessel-forming cells in dogs and humans, respectively. These vasoformative sarcomas are aggressive and highly metastatic, with ...disorganized, irregular blood-filled vascular spaces. Our objective was to define molecular programs which support the niche that enables progression of canine hemangiosarcoma and human angiosarcoma. Dog-in-mouse hemangiosarcoma xenografts recapitulated the vasoformative and highly angiogenic morphology and molecular characteristics of primary tumors. Blood vessels in the tumors were complex and disorganized, and they were lined by both donor and host cells. In a series of xenografts, we observed that the transplanted hemangiosarcoma cells created exuberant myeloid hyperplasia and gave rise to lymphoproliferative tumors of mouse origin. Our functional analyses indicate that hemangiosarcoma cells generate a microenvironment that supports expansion and differentiation of hematopoietic progenitor populations. Furthermore, gene expression profiling data revealed hemangiosarcoma cells expressed a repertoire of hematopoietic cytokines capable of regulating the surrounding stromal cells. We conclude that canine hemangiosarcomas, and possibly human angiosarcomas, maintain molecular properties that provide hematopoietic support and facilitate stromal reactions, suggesting their potential involvement in promoting the growth of hematopoietic tumors.
Quantifying the uncertainty of quantities of interest (QoIs) from physical systems is a primary objective in model validation. However, achieving this goal entails balancing the need for ...computational efficiency with the requirement for numerical accuracy. To address this trade-off, we propose a novel bi-fidelity formulation of variational auto-encoders (BF-VAE) designed to estimate the uncertainty associated with a QoI from low-fidelity (LF) and high-fidelity (HF) samples of the QoI. This model allows for the approximation of the statistics of the HF QoI by leveraging information derived from its LF counterpart. Specifically, we design a bi-fidelity auto-regressive model in the latent space that is integrated within the VAE's probabilistic encoder-decoder structure. An effective algorithm is proposed to maximize the variational lower bound of the HF log-likelihood in the presence of limited HF data, resulting in the synthesis of HF realizations with a reduced computational cost. Additionally, we introduce the concept of the bi-fidelity information bottleneck (BF-IB) to provide an information-theoretic interpretation of the proposed BF-VAE model. Our numerical results demonstrate that BF-VAE leads to considerably improved accuracy, as compared to a VAE trained using only HF data, when limited HF data is available.
Least squares regression is a ubiquitous tool for building emulators (a.k.a. surrogate models) of problems across science and engineering for purposes such as design space exploration and uncertainty ...quantification. When the regression data are generated using an experimental design process (e.g., a quadrature grid) involving computationally expensive models, or when the data size is large, sketching techniques have shown promise to reduce the cost of the construction of the regression model while ensuring accuracy comparable to that of the full data. However, random sketching strategies, such as those based on leverage scores, lead to regression errors that are random and may exhibit large variability. To mitigate this issue, we present a novel boosting approach that leverages cheaper, lower-fidelity data of the problem at hand to identify the best sketch among a set of candidate sketches. This in turn specifies the sketch of the intended high-fidelity model and the associated data. We provide theoretical analyses of this bi-fidelity boosting (BFB) approach and discuss the conditions the low- and high-fidelity data must satisfy for a successful boosting. In doing so, we derive a bound on the residual norm of the BFB sketched solution relating it to its ideal, but computationally expensive, high-fidelity boosted counterpart. Empirical results on both manufactured and PDE data corroborate the theoretical analyses and illustrate the efficacy of the BFB solution in reducing the regression error, as compared to the non-boosted solution.
Approximate solutions to large least squares problems can be computed efficiently using leverage score-based row-sketches, but directly computing the leverage scores, or sampling according to them ...with naive methods, still requires an expensive manipulation and processing of the design matrix. In this paper we develop efficient leverage score-based sampling methods for matrices with certain Kronecker product-type structure; in particular we consider matrices that are monotone lower column subsets of Kronecker product matrices. Our discussion is general, encompassing least squares problems on infinite domains, in which case matrices formally have infinitely many rows. We briefly survey leverage score-based sampling guarantees from the numerical linear algebra and approximation theory communities, and follow this with efficient algorithms for sampling when the design matrix has Kronecker-type structure. Our numerical examples confirm that sketches based on exact leverage score sampling for our class of structured matrices achieve superior residual compared to approximate leverage score sampling methods.