Simulation of genomic sequences under the coalescent with recombination has conventionally been impractical for regions beyond tens of megabases. This work presents an algorithm, implemented as the ...program MaCS (Markovian Coalescent Simulator), that can efficiently simulate haplotypes under any arbitrary model of population history. We present several metrics comparing the performance of MaCS with other available simulation programs. Practical usage of MaCS is demonstrated through a comparison of measures of linkage disequilibrium between generated program output and real genotype data from populations considered to be structured.
An Analysis of Machine- and Human-Analytics in Classification Tam, Gary K. L.; Kothari, Vivek; Min Chen
IEEE transactions on visualization and computer graphics,
2017-Jan., 2017-01-00, 2017-1-00, 20170101, Letnik:
23, Številka:
1
Journal Article
Recenzirano
Odprti dostop
In this work, we present a study that traces the technical and cognitive processes in two visual analytics applications to a common theoretic model of soft knowledge that may be added into a visual ...analytics process for constructing a decision-tree model. Both case studies involved the development of classification models based on the "bag of features" approach. Both compared a visual analytics approach using parallel coordinates with a machine-learning approach using information theory. Both found that the visual analytics approach had some advantages over the machine learning approach, especially when sparse datasets were used as the ground truth. We examine various possible factors that may have contributed to such advantages, and collect empirical evidence for supporting the observation and reasoning of these factors. We propose an information-theoretic model as a common theoretic basis to explain the phenomena exhibited in these two case studies. Together we provide interconnected empirical and theoretical evidence to support the usefulness of visual analytics.
Innate and adaptive defense mechanisms protect the respiratory system from attack by microbes. Here, we present evidence that the bitter taste receptor T2R38 regulates the mucosal innate defense of ...the human upper airway. Utilizing immunofluorescent and live cell imaging techniques in polarized primary human sinonasal cells, we demonstrate that T2R38 is expressed in human upper respiratory epithelium and is activated in response to acyl-homoserine lactone quorum-sensing molecules secreted by Pseudomonas aeruginosa and other gram-negative bacteria. Receptor activation regulates calcium-dependent NO production, resulting in stimulation of mucociliary clearance and direct antibacterial effects. Moreover, common polymorphisms of the TAS2R38 gene were linked to significant differences in the ability of upper respiratory cells to clear and kill bacteria. Lastly, TAS2R38 genotype correlated with human sinonasal gram-negative bacterial infection. These data suggest that T2R38 is an upper airway sentinel in innate defense and that genetic variation contributes to individual differences in susceptibility to respiratory infection.
The primary goal in cluster analysis is to discover natural groupings of objects. The field of cluster analysis is crowded with diverse methods that make special assumptions about data and address ...different scientific aims. Despite its shortcomings in accuracy, hierarchical clustering is the dominant clustering method in bioinformatics. Biologists find the trees constructed by hierarchical clustering visually appealing and in tune with their evolutionary perspective. Hierarchical clustering operates on multiple scales simultaneously. This is essential, for instance, in transcriptome data, where one may be interested in making qualitative inferences about how lower-order relationships like gene modules lead to higher-order relationships like pathways or biological processes. The recently developed method of convex clustering preserves the visual appeal of hierarchical clustering while ameliorating its propensity to make false inferences in the presence of outliers and noise. The solution paths generated by convex clustering reveal relationships between clusters that are hidden by static methods such as k-means clustering. The current paper derives and tests a novel proximal distance algorithm for minimizing the objective function of convex clustering. The algorithm separates parameters, accommodates missing data, and supports prior information on relationships. Our program CONVEXCLUSTER incorporating the algorithm is implemented on ATI and nVidia graphics processing units (GPUs) for maximal speed. Several biological examples illustrate the strengths of convex clustering and the ability of the proximal distance algorithm to handle high-dimensional problems. CONVEXCLUSTER can be freely downloaded from the UCLA Human Genetics web site at http://www.genetics.ucla.edu/software/.
The challenges of successfully applying causal inference methods include: (i) satisfying underlying assumptions, (ii) limitations in data/models accommodated by the software and (iii) low power of ...common multiple testing approaches.
The causal inference test (CIT) is based on hypothesis testing rather than estimation, allowing the testable assumptions to be evaluated in the determination of statistical significance. A user-friendly software package provides P-values and optionally permutation-based FDR estimates (q-values) for potential mediators. It can handle single and multiple binary and continuous instrumental variables, binary or continuous outcome variables and adjustment covariates. Also, the permutation-based FDR option provides a non-parametric implementation.
Simulation studies demonstrate the validity of the cit package and show a substantial advantage of permutation-based FDR over other common multiple testing strategies.
The cit open-source R package is freely available from the CRAN website (https://cran.r-project.org/web/packages/cit/index.html) with embedded C ++ code that utilizes the GNU Scientific Library, also freely available (http://www.gnu.org/software/gsl/).
joshua.millstein@usc.edu
Supplementary data are available at Bioinformatics online.
Gastrointestinal (GI) cancer is a formidable malignancy with significant morbidity and mortality rates. Recent studies have shed light on the complex interplay between the nervous system and the GI ...system, influencing various aspects of GI tumorigenesis, such as the malignance of cancer cells, the conformation of tumor microenvironment (TME), and the resistance to chemotherapies. The discussion in this review first focused on exploring the intricate details of the biological function of the nervous system in the development of the GI tract and the progression of tumors within it. Meanwhile, the cancer cell-originated feedback regulation on the nervous system is revealed to play a crucial role in the growth and development of nerve cells within tumor tissues. This interaction is vital for understanding the complex relationship between the nervous system and GI oncogenesis. Additionally, the study identified various components within the TME that possess a significant influence on the occurrence and progression of GI cancer, including microbiota, immune cells, and fibroblasts. Moreover, we highlighted the transformation relationship between non-neuronal cells and neuronal cells during GI cancer progression, inspiring the development of strategies for nervous system-guided anti-tumor drugs. By further elucidating the deep mechanism of various neuroregulatory signals and neuronal intervention, we underlined the potential of these targeted drugs translating into effective therapies for GI cancer treatment. In summary, this review provides an overview of the mechanisms of neuromodulation and explores potential therapeutic opportunities, providing insights into the understanding and management of GI cancers.
Trapped Be+ ions are a leading platform for quantum information science (Gaebler et al 2016 Phys. Rev. Lett. 117 060505), but reactions with background gas species, such as H2 and H2O, result in ...qubit loss. Our experiment reveals that the BeOH+ ion is the final trapped ion species when both H2 and H2O exist in a vacuum system with cold, trapped Be+. The BeH+ product in the Be+ + H2 reaction further reacts with H2O to form BeOH+. To understand the loss mechanism, low-temperature reactions between sympathetically cooled BeD+ ions and H2O molecules have been investigated using an integrated, laser-cooled Be+ ion trap and high-resolution time-of-flight mass spectrometer (Schneider et al 2014 Phys. Rev. Appl. 2 034013). Among all the possible products, BeH2O+, H2DO+, BeOD+, and BeOH+, only the BeOH+ molecular ion was observed experimentally, with the assumed co-product of HD. Theoretical analyses based on explicitly correlated restricted coupled cluster singles, doubles, and perturbative triples (RCCSD(T)-F12) method with the augmented correlation-consistent polarized triple zeta (AVTZ) basis set reveal that two intuitive direct abstraction product channels, Be + H2DO+ and D + BeH2O+, are not energetically accessible at the present reaction temperature (∼150 K). Instead, a double displacement BeOH+ + HD product channel is accessible due to a large exothermicity of 1.885 eV through a submerged barrier in the reaction pathway. While the BeOD+ + H2 product channel has a similar exothermicity, the reaction pathway is dynamically unfavourable, as suggested by a sudden vector projection analysis. This work sheds light on the origin of the loss and contaminations of the laser-cooled Be+ ions in quantum-information experiments.
Activation of oncogenes by mechanisms other than genetic aberrations such as mutations, translocations, or amplifications is largely undefined. Here we report a novel isoform of the anaplastic ...lymphoma kinase (ALK) that is expressed in ∼11% of melanomas and sporadically in other human cancer types, but not in normal tissues. The novel ALK transcript initiates from a de novo alternative transcription initiation (ATI) site in ALK intron 19, and was termed ALK(ATI). In ALK(ATI)-expressing tumours, the ATI site is enriched for H3K4me3 and RNA polymerase II, chromatin marks characteristic of active transcription initiation sites. ALK(ATI) is expressed from both ALK alleles, and no recurrent genetic aberrations are found at the ALK locus, indicating that the transcriptional activation is independent of genetic aberrations at the ALK locus. The ALK(ATI) transcript encodes three proteins with molecular weights of 61.1, 60.8 and 58.7 kilodaltons, consisting primarily of the intracellular tyrosine kinase domain. ALK(ATI) stimulates multiple oncogenic signalling pathways, drives growth-factor-independent cell proliferation in vitro, and promotes tumorigenesis in vivo in mouse models. ALK inhibitors can suppress the kinase activity of ALK(ATI), suggesting that patients with ALK(ATI)-expressing tumours may benefit from ALK inhibitors. Our findings suggest a novel mechanism of oncogene activation in cancer through de novo alternative transcription initiation.
More than 50% of patients with chondrosarcomas exhibit gain-of-function mutations in either isocitrate dehydrogenase 1 (IDH1) or IDH2. In this study, we performed genome-wide CpG methylation ...sequencing of chondrosarcoma biopsies and found that IDH mutations were associated with DNA hypermethylation at CpG islands but not other genomic regions. Regions of CpG island hypermethylation were enriched for genes implicated in stem cell maintenance/differentiation and lineage specification. In murine 10T1/2 mesenchymal progenitor cells, expression of mutant IDH2 led to DNA hypermethylation and an impairment in differentiation that could be reversed by treatment with DNA-hypomethylating agents. Introduction of mutant IDH2 also induced loss of contact inhibition and generated undifferentiated sarcomas in vivo. The oncogenic potential of mutant IDH2 correlated with the ability to produce 2-hydroxyglutarate. Together, these data demonstrate that neomorphic IDH2 mutations can be oncogenic in mesenchymal cells.