Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) infection is associated with a range of persistent symptoms impacting everyday functioning, known as post-COVID-19 condition or long ...COVID. We undertook a retrospective matched cohort study using a UK-based primary care database, Clinical Practice Research Datalink Aurum, to determine symptoms that are associated with confirmed SARS-CoV-2 infection beyond 12 weeks in non-hospitalized adults and the risk factors associated with developing persistent symptoms. We selected 486,149 adults with confirmed SARS-CoV-2 infection and 1,944,580 propensity score-matched adults with no recorded evidence of SARS-CoV-2 infection. Outcomes included 115 individual symptoms, as well as long COVID, defined as a composite outcome of 33 symptoms by the World Health Organization clinical case definition. Cox proportional hazards models were used to estimate adjusted hazard ratios (aHRs) for the outcomes. A total of 62 symptoms were significantly associated with SARS-CoV-2 infection after 12 weeks. The largest aHRs were for anosmia (aHR 6.49, 95% CI 5.02-8.39), hair loss (3.99, 3.63-4.39), sneezing (2.77, 1.40-5.50), ejaculation difficulty (2.63, 1.61-4.28) and reduced libido (2.36, 1.61-3.47). Among the cohort of patients infected with SARS-CoV-2, risk factors for long COVID included female sex, belonging to an ethnic minority, socioeconomic deprivation, smoking, obesity and a wide range of comorbidities. The risk of developing long COVID was also found to be increased along a gradient of decreasing age. SARS-CoV-2 infection is associated with a plethora of symptoms that are associated with a range of sociodemographic and clinical risk factors.
Abstract
Motivation
Existing microbiome-based disease prediction relies on the ability of machine learning methods to differentiate disease from healthy subjects based on the observed taxa abundance ...across samples. Despite numerous microbes have been implicated as potential biomarkers, challenges remain due to not only the statistical nature of microbiome data but also the lack of understanding of microbial interactions which can be indicative of the disease.
Results
We propose CACONET (classification of Compositional-Aware COrrelation NETworks), a computational framework that learns to classify microbial correlation networks and extracts potential signature interactions, taking as input taxa relative abundance across samples and their health status. By using Bayesian compositional-aware correlation inference, a collection of posterior correlation networks can be drawn and used for graph-level classification, thus incorporating uncertainty in the estimates. CACONET then employs a deep learning approach for graph classification, achieving excellent performance metrics by exploiting the correlation structure. We test the framework on both simulated data and a large real-world dataset pertaining to microbiome samples of colorectal cancer (CRC) and healthy subjects, and identify potential network substructure characteristic of CRC microbiota. CACONET is customizable and can be adapted to further improve its utility.
Availability and implementation
CACONET is available at https://github.com/yuanwxu/corr-net-classify.
Supplementary information
Supplementary data are available at Bioinformatics online.
Undetected atrial fibrillation (AF) is a major health concern. Blood biomarkers associated with AF could simplify patient selection for screening and further inform ongoing research towards ...stratified prevention and treatment of AF.
Forty common cardiovascular biomarkers were quantified in 638 consecutive patients referred to hospital mean ± standard deviation age 70 ± 12 years, 398 (62%) male, 294 (46%) with AF with known AF or ≥2 CHA2DS2-VASc risk factors. Paroxysmal or silent AF was ruled out by 7-day ECG monitoring. Logistic regression with forward selection and machine learning algorithms were used to determine clinical risk factors, imaging parameters, and biomarkers associated with AF. Atrial fibrillation was significantly associated with age bootstrapped odds ratio (OR) per year = 1.060, 95% confidence interval (1.04-1.10); P = 0.001, male sex OR = 2.022 (1.28-3.56); P = 0.008, body mass index BMI, OR per unit = 1.060 (1.02-1.12); P = 0.003, elevated brain natriuretic peptide BNP, OR per fold change = 1.293 (1.11-1.63); P = 0.002, elevated fibroblast growth factor-23 FGF-23, OR = 1.667 (1.36-2.34); P = 0.001, and reduced TNF-related apoptosis-induced ligand-receptor 2 TRAIL-R2, OR = 0.242 (0.14-0.32); P = 0.001, but not other biomarkers. Biomarkers improved the prediction of AF compared with clinical risk factors alone (net reclassification improvement = 0.178; P < 0.001). Both logistic regression and machine learning predicted AF well during validation area under the receiver-operator curve = 0.684 (0.62-0.75) and 0.697 (0.63-0.76), respectively.
Three simple clinical risk factors (age, sex, and BMI) and two biomarkers (elevated BNP and elevated FGF-23) identify patients with AF. Further research is warranted to elucidate FGF-23 dependent mechanisms of AF.
Previous in vitro and postmortem research suggests that inflammation may lead to structural brain changes via activation of microglia and/or astrocytic dysfunction in a range of neuropsychiatric ...disorders.
To investigate the relationship between inflammation and changes in brain structures in vivo and to explore a transcriptome-driven functional basis with relevance to mental illness.
This study used multistage linked analyses, including mendelian randomization (MR), gene expression correlation, and connectivity analyses. A total of 20 688 participants in the UK Biobank, which includes clinical, genomic, and neuroimaging data, and 6 postmortem brains from neurotypical individuals in the Allen Human Brain Atlas (AHBA), including RNA microarray data. Data were extracted in February 2021 and analyzed between March and October 2021.
Genetic variants regulating levels and activity of circulating interleukin 1 (IL-1), IL-2, IL-6, C-reactive protein (CRP), and brain-derived neurotrophic factor (BDNF) were used as exposures in MR analyses.
Brain imaging measures, including gray matter volume (GMV) and cortical thickness (CT), were used as outcomes. Associations were considered significant at a multiple testing-corrected threshold of P < 1.1 × 10-4. Differential gene expression in AHBA data was modeled in brain regions mapped to areas significant in MR analyses; genes were tested for biological and disease overrepresentation in annotation databases and for connectivity in protein-protein interaction networks.
Of 20 688 participants in the UK Biobank sample, 10 828 (52.3%) were female, and the mean (SD) age was 55.5 (7.5) years. In the UK Biobank sample, genetically predicted levels of IL-6 were associated with GMV in the middle temporal cortex (z score, 5.76; P = 8.39 × 10-9), inferior temporal (z score, 3.38; P = 7.20 × 10-5), fusiform (z score, 4.70; P = 2.60 × 10-7), and frontal (z score, -3.59; P = 3.30 × 10-5) cortex together with CT in the superior frontal region (z score, -5.11; P = 3.22 × 10-7). No significant associations were found for IL-1, IL-2, CRP, or BDNF after correction for multiple comparison. In the AHBA sample, 5 of 6 participants (83%) were male, and the mean (SD) age was 42.5 (13.4) years. Brain-wide coexpression analysis showed a highly interconnected network of genes preferentially expressed in the middle temporal gyrus (MTG), which further formed a highly connected protein-protein interaction network with IL-6 (enrichment test of expected vs observed network given the prevalence and degree of interactions in the STRING database: 43 nodes/30 edges observed vs 8 edges expected; mean node degree, 1.4; genome-wide significance, P = 4.54 × 10-9). MTG differentially expressed genes that were functionally enriched for biological processes in schizophrenia, autism spectrum disorder, and epilepsy.
In this study, genetically determined IL-6 was associated with brain structure and potentially affects areas implicated in developmental neuropsychiatric disorders, including schizophrenia and autism.
The association between allergic diseases and autoimmune disorders is not well established. Our objective was to determine incidence rates of autoimmune disorders in allergic rhinitis/conjunctivitis ...(ARC), atopic eczema and asthma, and to investigate for co-occurring patterns.
This was a retrospective cohort study (1990-2018) employing data extracted from The Health Improvement Network (UK primary care database). The exposure group comprised ARC, atopic eczema and asthma (all ages). For each exposed patient, up to two randomly selected age- and sex-matched controls with no documented allergic disease were used. Adjusted incidence rate ratios (aIRRs) were calculated using Poisson regression. A cross-sectional study was also conducted employing Association Rule Mining (ARM) to investigate disease clusters.
782 320, 1 393 570 and 1 049 868 patients with ARC, atopic eczema and asthma, respectively, were included. aIRRs of systemic lupus erythematosus (SLE), Sjögren's syndrome, vitiligo, rheumatoid arthritis, psoriasis, pernicious anaemia, inflammatory bowel disease, coeliac disease and autoimmune thyroiditis were uniformly higher in the three allergic diseases compared with controls. Specifically, aIRRs of SLE (1.45) and Sjögren's syndrome (1.88) were higher in ARC; aIRRs of SLE (1.44), Sjögren's syndrome (1.61) and myasthenia (1.56) were higher in asthma; and aIRRs of SLE (1.86), Sjögren's syndrome (1.48), vitiligo (1.54) and psoriasis (2.41) were higher in atopic eczema. There was no significant effect of the three allergic diseases on multiple sclerosis or of ARC and atopic eczema on myasthenia. Using ARM, allergic diseases clustered with multiple autoimmune disorders. Three age- and sex-related clusters were identified, with a relatively complex pattern in females ≥55 years old.
The long-term risks of autoimmune disorders are significantly higher in patients with allergic diseases. Allergic diseases and autoimmune disorders show age- and sex-related clustering patterns.
Ontologies are widely used in biological and biomedical research. Their success lies in their combination of four main features present in almost all ontologies: provision of standard identifiers for ...classes and relations that represent the phenomena within a domain; provision of a vocabulary for a domain; provision of metadata that describes the intended meaning of the classes and relations in ontologies; and the provision of machine-readable axioms and definitions that enable computational access to some aspects of the meaning of classes and relations. While each of these features enables applications that facilitate data integration, data access and analysis, a great potential lies in the possibility of combining these four features to support integrative analysis and interpretation of multimodal data. Here, we provide a functional perspective on ontologies in biology and biomedicine, focusing on what ontologies can do and describing how they can be used in support of integrative research. We also outline perspectives for using ontologies in data-driven science, in particular their application in structured data mining and machine learning applications.
Link prediction in a complex network is a problem of fundamental interest in network science and has attracted increasing attention in recent years. It aims to predict missing (or future) links ...between two entities in a complex system that are not already connected. Among existing methods, local similarity indices are most popular that take into account the information of common neighbours to estimate the likelihood of existence of a connection between two nodes. In this paper, we propose global and quasi-local extensions of some commonly used local similarity indices. We have performed extensive numerical simulations on publicly available datasets from diverse domains demonstrating that the proposed extensions not only give superior performance, when compared to their respective local indices, but also outperform some of the current, state-of-the-art, local and global link-prediction methods.
Echocardiography plays an essential role in the diagnosis and assessment of cardiovascular disease. Measurements derived from echocardiography are also used to determine the severity of disease, its ...progression over time, and to aid in the choice of optimal therapy. It is therefore clinically important that echocardiographic measurements be reproducible, repeatable, and reliable. There are a variety of statistical tests available to assess these parameters, and in this article the authors summarize those available for use by echocardiographers to improve their clinical practice. Correlation coefficients, linear regression, Bland-Altman plots, and the coefficient of variation are explored, along with their limitations. The authors also provide an online tool for the easy calculation of these statistics in the clinical environment (www.birmingham.ac.uk/echo). Quantifying and enhancing the reproducibility of echocardiography has important potential to improve the value of echocardiography as the basis for good clinical decision-making.
•Good reproducibility, repeatability, and reliability are essential for echo studies.•Straightforward statistical evaluation can improve echocardiography practice.•A free online tool is available at www.birmingham.ac.uk/echo.
We present Uberon, an integrated cross-species ontology consisting of over 6,500 classes representing a variety of anatomical entities, organized according to traditional anatomical classification ...criteria. The ontology represents structures in a species-neutral way and includes extensive associations to existing species-centric anatomical ontologies, allowing integration of model organism and human data. Uberon provides a necessary bridge between anatomical structures in different taxa for cross-species inference. It uses novel methods for representing taxonomic variation, and has proved to be essential for translational phenotype analyses. Uberon is available at http://uberon.org.