Drug discovery and development are long and financially taxing processes. On an average it takes 12–15 years and costs 1.2 billion USD for successful drug discovery and approval for clinical use. ...Many lead molecules are not developed further and their potential is not tapped to the fullest due to lack of resources or time constraints. In order for a drug to be approved by FDA for clinical use, it must have excellent therapeutic potential in the desired area of target with minimal toxicities as supported by both pre-clinical and clinical studies. The targeted clinical evaluations fail to explore other potential therapeutic applications of the candidate drug. Drug repurposing or repositioning is a fast and relatively cheap alternative to the lengthy and expensive de novo drug discovery and development. Drug repositioning utilizes the already available clinical trials data for toxicity and adverse effects, at the same time explores the drug's therapeutic potential for a different disease. This review addresses recent developments and future scope of drug repositioning strategy.
Display omitted
•Repurposed drugs categorized based on biological activity.•Various approaches to drug repurposing.•Various resources for drug repurposing.•Drug repurposing in various diseases latest information.
Accurate and efficient cell grouping is essential for analyzing single-cell transcriptome sequencing (scRNA-seq) data. However, the existing clustering techniques often struggle to provide timely and ...accurate cell type groupings when dealing with datasets with large-scale or imbalanced cell types. Therefore, there is a need for improved methods that can handle the increasing size of scRNA-seq datasets while maintaining high accuracy and efficiency.
We propose CDSKNN
(Community Detection based on a Stable K-Nearest Neighbor Graph Structure), a novel single-cell clustering framework integrating partition clustering algorithm and community detection algorithm, which achieves accurate and fast cell type grouping by finding a stable graph structure.
We evaluated the effectiveness of our approach by analyzing 15 tissues from the human fetal atlas. Compared to existing methods, CDSKNN effectively counteracts the high imbalance in single-cell data, enabling effective clustering. Furthermore, we conducted comparisons across multiple single-cell datasets from different studies and sequencing techniques. CDSKNN is of high applicability and robustness, and capable of balancing the complexities of across diverse types of data. Most importantly, CDSKNN exhibits higher operational efficiency on datasets at the million-cell scale, requiring an average of only 6.33 min for clustering 1.46 million single cells, saving 33.3% to 99% of running time compared to those of existing methods.
The CDSKNN is a flexible, resilient, and promising clustering tool that is particularly suitable for clustering imbalanced data and demonstrates high efficiency on large-scale scRNA-seq datasets.
•This study provides new insight of the environmental governance on community levels.•Higher cadmium content of soil affect the sustainability of community.•Community characteristics with cluster ...analysis help identify relevant recommendations.•Hierarchical control strategies are recommended to improve rural sanitation.
Equitable access to sanitation for all is a goal of sustainable development, notably in rural areas. However, studies including economic, socio-cultural, and other factors that have been conducted to comprehensively assess rural sanitation at the community level in developing countries are limited. This study aimed to investigate the current state of rural environmental sanitation on the community level and to evaluate the characteristics of the diverse sub-clusters. A multidimensional environmental sanitation survey was conducted on 400 communities in Chongqing, China in 2020, and the priority options for improving sub-clusters’ sanitation were explored using cluster analysis. Among all communities, more than 60 % had positive village appearance, 50.50 % had domestic sewage treatment, and the coverage rate of household sanitary toilets was 72.28 %. The average content for lead, cadmium (Cd), and chromium in soil was 26.01, 0.53, and 54.47 mg/kg, respectively. The communities were clustered into 3 groups (I, II, III) based on similar characteristics including basic information, village appearance, water, sanitation, hygiene management, bio-vector control, and soil pollution. The proportion of cluster I, II, and III was 39.25 % (157/400), 31.75 % (127/400), and 29.00 % (116/400), respectively. Each cluster had its sanitation characteristics, and significant differences among the sub-clusters were observed in Cd of soil (p = 0.001), domestic sewage disposal ratio (p < 0.001), and environmental health funding (p = 0.001). To sum up, the environmental sanitation in rural areas of Chongqing was better than that of the national average in China, but the Cd content of farmland soil was higher. Our study provides novel evidence for assessing the characteristics of rural sanitation at the community level, and communities with similar environmental health status were clustered into the same group, while the heterogeneity of different sub-cluster was thoroughly characterized. The hierarchical control strategy should be performed in communities based on different characteristics and deficiencies of the clusters to improve environmental sanitation.
Through a survey of rose diseases in the South Tropical Garden in Kunming, China, it was found that black spot was the most common and serious disease of rose cultivated in the open air there, with ...an incidence of more than 90%. In this study, fungus isolation was performed on leaf samples of five black spot susceptible varieties of rose from the South Tropical Garden by tissue isolation. 18 strains of fungus were initially obtained, and seven of them were finally identified to cause black spot symptoms on healthy leaves of rose after verification by Koch's rule. By observing the morphology of colonies and spores, and constructing a phylogenetic tree by combining molecular biology and multiple genes, two pathogenic fungus were identified, namely, Alternaria alternata and Gnomoniopsis rosae. G. rosae was the first pathogenic fungi of rose black spot isolated and identified in this study. The results of this study can provide a reference base for further research and control of the black spot disease of rose in Kunming.
Abstract
Introduction
Sleep bruxism (BXM) is the result of rhythmic muscular masticatory activity (RMMA) and can be captured by masseters surface electromyography (sEMG). Despite the multiple adverse ...negative consequences of BXM, a simple reliable home diagnostic device is currently unavailable, with in laboratory audio-video polysomnography (type I PSG) remaining the gold standard diagnostic tool. Mandibular movements (MM) recordings during sleep can readily identify RMMA, are simple to set up and can be easily repeated from night to night. Here, we aimed to identify stereotypical MM in patients with BXM, and to develop RMMA automatic detection and BXM diagnosis using an artificial intelligence-based approach.
Methods
MM were recorded by a dedicated sensor (Sunrise, Namur, Belgium) in 12 patients with BXM during type I PSG. The Sunrise system consists of a coin-sized hardware that is comfortably placed on the subject’s chin. Its embedded inertial measurement unit communicates via Bluetooth with a smartphone and automatically transfers MM signals to a cloud-based infrastructure at the end of the night. Data processing and analysis are then performed in Python programming language.
A time series cluster analysis was applied to sequences of masseters sEMG and MM signals during BXM episodes (n=300) and during spontaneous micro-arousals (n=300). Then, a convolutional neuronal network (CNN) was developed to identify BXM and distinguish it from spontaneous micro-arousals while exclusively relying on MM signal.
Results
Based on the cluster analysis, BXM periods were characterized by a specific pattern of MM signals (higher frequency and amplitude), which was closely associated with the sEMG signals but clearly differed from the MM signal patterns during micro-arousals.
CNN-based classifier distinguished the BXM events from other RMMAs during micro-arousals and respiratory efforts with an overall accuracy of 91%.
Conclusion
Sleep bruxism can be automatically identified, quantified, and characterized with mandibular movements analysis supported by artificial intelligence technology.
Support
This work was supported by the French National Research Agency (ANR-12-TECS-0010), in the framework of the “Investissements d’avenir” program (ANR-15-IDEX-02). https://life.univ-grenoble-alpes.fr.
Background: Clinical trials for traumatic spine injury (TSI) are difficult to conduct because of the amount of variation in neurologic impairments and patient characteristics within a population. ...Identifying homogeneous subgroups may improve the effective implementation of treatment strategies and patient trajectories. We hypothesize that using automated clustering can mimic or improve preexisting (traditional) classification Methods: Our objective was to identify the features influencing the optimized clusters creating each homogenous group of patients. Methods: Spectral cluster analysis was deployed to group similar patients on the basis of baseline prognostic factors: location of injury, severity of injury, baseline Functional Independence Measure (FIM) motor score and demographic data (age, body mass index). The FIM motor score at follow-up and the total length of stay were superimposed on identified clusters as outcome variables. Prognostic factors used in spectral cluster analysis and superimposed outcome variables were assessed statistically. The performance of the spectral algorithm was evaluated by the silhouette coefficient, the Davies-Bouldin index and the elbow method to determine the optimal number of clusters. The identified clusters were qualitatively and systematically described to explore patient-focused insights. Results: A total of 338 patients with TSI from the Rick Hansen Spinal Cord Injury Registry (RHSCIR) were analyzed from baseline to follow-up. Five distinct clusters were identified with a statistically significant difference (p < 0.05) for prognostic factors. Outcome variables were statistically significant among all clusters. Conclusion: Using automated clustering, we found significant differences between prognostic variables and superimposed outcome variables across all 5 clusters. Qualitative descriptions of distinguishing prognostic variables for each cluster revealed that the spectral cluster analysis provided clinically intuitive groupings of patients. Additionally, we observed a relationship between the location of the injury and the mechanism of injury among all clusters, which adds significant clinical categorizations of patients with TSI in their classification. This data-driven approach identified homogenous subgroups without depending on a priori decisions, which is a step toward improving traditional classification.
We consider statistical inference for regression when data are grouped into clusters, with regression model errors independent across clusters but correlated within clusters. Examples include data on ...individuals with clustering on village or region or other category such as industry, and state-year differences-in-differences studies with clustering on state. In such settings, default standard errors can greatly overstate estimator precision. Instead, if the number of clusters is large, statistical inference after OLS should be based on cluster-robust standard errors. We outline the basic method as well as many complications that can arise in practice. These include cluster-specific fixed effects, few clusters, multiway clustering, and estimators other than OLS.
K-Means is a simple clustering algorithm that has the ability to throw large amounts of data, partition datasets into several clusters k. The algorithm is quite easy to implement and run, relatively ...fast and efficient. Another division of K-Means still has several weaknesses, namely in determining the number of clusters, determining the cluster center. The results of the cluster formed from the K-means method is very dependent on the initiation of the initial cluster center value provided. This causes the results of the cluster to be a solution that is locally optimal. This research was conducted to overcome the weaknesses in the K-Means algorithm, namely: improvements to the K-Means algorithm produce better clusters, namely the application of Sum Of Squared Error (SSE) to help K-Means Clustering in determining the optimum number of clusters, From this modification process, it is expected that the cluster center obtained will produce clusters, where the cluster members have a high level of similarity. Improving the performance of the K-Means cluster will be applied to determining the number of clusters using the elbow method.
Defining clusters of related industries Delgado, Mercedes; Porter, Michael E.; Stern, Scott
Journal of economic geography,
01/2016, Letnik:
16, Številka:
1
Journal Article
Recenzirano
Clusters are geographic concentrations of industries related by knowledge, skills, inputs, demand and/or other linkages. There is an increasing need for cluster-based data to support research, ...facilitate comparisons of clusters across regions and support policymakers in defining regional strategies. This article develops a novel clustering algorithm that systematically generates and assesses sets of cluster definitions (i.e., groups of closely related industries). We implement the algorithm using 2009 data for U.S. industries (six-digit NAICS), and propose a new set of benchmark cluster definitions that incorporates measures of inter-industry linkages based on co-location patterns, input–output links, and similarities in labor occupations. We also illustrate the algorithm’s ability to compare alternative sets of cluster definitions by evaluating our new set against existing sets in the literature. We find that our proposed set outperforms other methods in capturing a wide range of inter-industry linkages, including the grouping of industries within the same three-digit NAICS.