Genetic prediction of complex traits has great promise for disease prevention, monitoring, and treatment. The development of accurate risk prediction models is hindered by the wide diversity of ...genetic architecture across different traits, limited access to individual level data for training and parameter tuning, and the demand for computational resources. To overcome the limitations of the most existing methods that make explicit assumptions on the underlying genetic architecture and need a separate validation data set for parameter tuning, we develop a summary statistics-based nonparametric method that does not rely on validation datasets to tune parameters. In our implementation, we refine the commonly used likelihood assumption to deal with the discrepancy between summary statistics and external reference panel. We also leverage the block structure of the reference linkage disequilibrium matrix for implementation of a parallel algorithm. Through simulations and applications to twelve traits, we show that our method is adaptive to different genetic architectures, statistically robust, and computationally efficient. Our method is available at
Hit selection and lead generation are crucial for the success of the resource-demanding lead-optimization phase in drug discovery, and represent a major research area of medicinal chemistry today. ...Ligand-binding efficiency, ligand complexity, ligand–target profile complementarity and chemical tractability are important parameters in hit selection. As synthesis and assay throughput improve, a large number of analogs based on the same scaffold can be rapidly synthesized and tested. Consequently, more chemistry resources could be devoted to scaffold modifications to expand the candidate pool in lead generation. Most recently discovered druggable targets are promiscuous toward lipophilic ligands, and the hydrophobic portions of hit compounds should be preferentially modified in analog and scaffold design.
This paper firstly introduces common wearable sensors, smart wearable devices and the key application areas. Since multi-sensor is defined by the presence of more than one model or channel, e.g. ...visual, audio, environmental and physiological signals. Hence, the fusion methods of multi-modality and multi-location sensors are proposed. Despite it has been contributed several works reviewing the stateoftheart on information fusion or deep learning, all of them only tackled one aspect of the sensor fusion applications, which leads to a lack of comprehensive understanding about it. Therefore, we propose using a more holistic approach in order to provide a more suitable starting point from which to develop a full understanding of the fusion methods of wearable sensors. Specifically, this review attempts to provide a more comprehensive survey of the most important aspects of multi-sensor applications for human activity recognition, including those recently added to the field for unsupervised learning and transfer learning. Finally, the open research issues that need further research and improvement are identified and discussed.
•Machine learning combines heterogeneous features into multi-sensor information fusion.•Deep learning algorithms have been proposed for automatic feature representation.•Transfer learning and domain adaptation keep the generalization performance up.•Hyper-parameters tunning has been used for providing effective fusion strategies.•Encryption and authentication approaches are necessary to protect privacy of users.
Autophagy acts as a cellular surveillance mechanism to combat invading pathogens. Viruses have evolved various strategies to block autophagy and even subvert it for their replication and release. ...Here, we demonstrated that ORF3a of the COVID-19 virus SARS-CoV-2 inhibits autophagy activity by blocking fusion of autophagosomes/amphisomes with lysosomes. The late endosome-localized ORF3a directly interacts with and sequestrates the homotypic fusion and protein sorting (HOPS) component VPS39, thereby preventing HOPS complex from interacting with the autophagosomal SNARE protein STX17. This blocks assembly of the STX17-SNAP29-VAMP8 SNARE complex, which mediates autophagosome/amphisome fusion with lysosomes. Expression of ORF3a also damages lysosomes and impairs their function. SARS-CoV-2 virus infection blocks autophagy, resulting in accumulation of autophagosomes/amphisomes, and causes late endosomal sequestration of VPS39. Surprisingly, ORF3a from the SARS virus SARS-CoV fails to interact with HOPS or block autophagy. Our study reveals a mechanism by which SARS-CoV-2 evades lysosomal destruction and provides insights for developing new strategies to treat COVID-19.
Display omitted
•SARS-CoV-2 virus infection or expression of ORF3a blocks formation of autolysosomes•SARS-CoV-2 ORF3a sequestrates the HOPS component VPS39 on late endosomes•SARS-CoV-2 ORF3a impairs the assembly of the STX17-SNAP29-VAMP8 SNARE complex•SARS virus ORF3a fails to interact with VPS39 or affect autophagy activity
Miao et al. demonstrate that late endosome-localized ORF3a of the COVID-19 virus SARS-CoV-2 sequestrates the HOPS component VPS39. ORF3a blocks autophagosome/amphisome fusion with lysosomes by preventing the assembly of the STX17-SNAP29-VAMP8 SNARE complex. SARS-CoV-2-infected cells also exhibit a defect in autophagosome maturation and sequestration of VPS39 on late endosomes.
Direct analysis of microbial communities in the environment and human body has become more convenient and reliable owing to the advancements of high-throughput sequencing techniques for 16S rRNA gene ...profiling. Inferring the correlation relationship among members of microbial communities is of fundamental importance for genomic survey study. Traditional Pearson correlation analysis treating the observed data as absolute abundances of the microbes may lead to spurious results because the data only represent relative abundances. Special care and appropriate methods are required prior to correlation analysis for these compositional data.
In this article, we first discuss the correlation definition of latent variables for compositional data. We then propose a novel method called CCLasso based on least squares with Formula: see text penalty to infer the correlation network for latent variables of compositional data from metagenomic data. An effective alternating direction algorithm from augmented Lagrangian method is used to solve the optimization problem. The simulation results show that CCLasso outperforms existing methods, e.g. SparCC, in edge recovery for compositional data. It also compares well with SparCC in estimating correlation network of microbe species from the Human Microbiome Project.
CCLasso is open source and freely available from https://github.com/huayingfang/CCLasso under GNU LGPL v3.
dengmh@pku.edu.cn
Supplementary data are available at Bioinformatics online.
Zero velocity updates (ZUPT) is an effective way for the foot-mounted inertial pedestrian navigation systems. For the ZUPT technique to work properly, it is necessary to correctly detect the stance ...phase of each gait cycle. An adaptive stance-phase detection method is proposed based solely on an inertial sensor, which deals with the measurement fluctuations in swing and stance phases differently, and applies a clustering algorithm to partition the potential gait phases into true and false clusters, thereby yielding a time threshold to eliminate the false gait phases. The roles of the detection parameters and the relationship between them are analyzed to offer some suggestions for parameter tuning. Detection performance is evaluated with multisubject experimental data collected at varying walking speeds. The evaluation results show that the proposed detection method performs well in the presence of measurement fluctuations, which can make the detection of stance phases more robust and the choice of detection parameters more flexible.
Low-rank modeling generally refers to a class of methods that solves problems by representing variables of interest as low-rank matrices. It has achieved great success in various fields including ...computer vision, data mining, signal processing, and bioinformatics. Recently, much progress has been made in theories, algorithms, and applications of low-rank modeling, such as exact low-rank matrix recovery via convex programming and matrix completion applied to collaborative filtering. These advances have brought more and more attention to this topic. In this article, we review the recent advances of low-rank modeling, the state-of-the-art algorithms, and the related applications in image analysis. We first give an overview of the concept of low-rank modeling and the challenging problems in this area. Then, we summarize the models and algorithms for low-rank matrix recovery and illustrate their advantages and limitations with numerical experiments. Next, we introduce a few applications of low-rank modeling in the context of image analysis. Finally, we conclude this article with some discussions.
Advances in single-cell RNA sequencing (scRNA-seq) have led to successes in discovering novel cell types and understanding cellular heterogeneity among complex cell populations through cluster ...analysis. However, cluster analysis is not able to reveal continuous spectrum of states and underlying gene expression programs (GEPs) shared across cell types. We introduce scAAnet, an autoencoder for single-cell non-linear archetypal analysis, to identify GEPs and infer the relative activity of each GEP across cells. We use a count distribution-based loss term to account for the sparsity and overdispersion of the raw count data and add an archetypal constraint to the loss function of scAAnet. We first show that scAAnet outperforms existing methods for archetypal analysis across different metrics through simulations. We then demonstrate the ability of scAAnet to extract biologically meaningful GEPs using publicly available scRNA-seq datasets including a pancreatic islet dataset, a lung idiopathic pulmonary fibrosis dataset and a prefrontal cortex dataset.
Wearable devices have been increasingly used in research to provide continuous physical activity monitoring, but how to effectively extract features remains challenging for researchers. To analyze ...the generated actigraphy data in large-scale population studies, we developed computationally efficient methods to derive sleep and activity features through a Hidden Markov Model-based sleep/wake identification algorithm, and circadian rhythm features through a Penalized Multi-band Learning approach adapted from machine learning. Unsupervised feature extraction is useful when labeled data are unavailable, especially in large-scale population studies. We applied these two methods to the UK Biobank wearable device data and used the derived sleep and circadian features as phenotypes in genome-wide association studies. We identified 53 genetic loci with p<5×10-8 including genes known to be associated with sleep disorders and circadian rhythms as well as novel loci associated with Body Mass Index, mental diseases and neurological disorders, which suggest shared genetic factors of sleep and circadian rhythms with physical and mental health. Further cross-tissue enrichment analysis highlights the important role of the central nervous system and the shared genetic architecture with metabolism-related traits and the metabolic system. Our study demonstrates the effectiveness of our unsupervised methods for wearable device data when additional training data cannot be easily acquired, and our study further expands the application of wearable devices in population studies and genetic studies to provide novel biological insights.