Proteases play vital roles in a range of biological processes, such as cell cycle, cell growth and differentiation, apoptosis, haemostasis and signalling. Fundamental to our knowledge of protease ...action is an understanding of how the active site operates; this has been examined through extensive studies of the substrate specificity of the enzymes. Kinetic and structural analyses have shown that the binding of a particular substrate residue at a protease subsite can have either a positive or negative influence on the binding of particular residues at other subsites. This phenomenon has been termed subsite cooperativity and has been observed in a wide range of proteases, often between non-adjacent subsites. This review aims to highlight studies where subsite cooperativity has been observed, experimental techniques used in the past and potential methods that can be employed to comprehensively examine this phenomenon. Further understanding of how the protease active site recognises and chooses its substrates for cleavage will have a significant impact on the development of pharmaceuticals that target these enzymes.
Many problems in bioinformatics involve classification based on features such as sequence, structure or morphology. Given multiple classifiers, two crucial questions arise: how does their performance ...compare, and how can they best be combined to produce a better classifier? A classifier can be evaluated in terms of sensitivity and specificity using benchmark, or gold standard, data, that is, data for which the true classification is known. However, a gold standard is not always available. Here we demonstrate that a Bayesian model for comparing medical diagnostics without a gold standard can be successfully applied in the bioinformatics domain, to genomic scale data sets. We present a new implementation, which unlike previous implementations is applicable to any number of classifiers. We apply this model, for the first time, to the problem of finding the globally optimal logical combination of classifiers.
We compared three classifiers of protein subcellular localisation, and evaluated our estimates of sensitivity and specificity against estimates obtained using a gold standard. The method overestimated sensitivity and specificity with only a small discrepancy, and correctly ranked the classifiers. Diagnostic tests for swine flu were then compared on a small data set. Lastly, classifiers for a genome-wide association study of macular degeneration with 541094 SNPs were analysed. In all cases, run times were feasible, and results precise. The optimal logical combination of classifiers was also determined for all three data sets. Code and data are available from http://bioinformatics.monash.edu.au/downloads/.
The examples demonstrate the methods are suitable for both small and large data sets, applicable to the wide range of bioinformatics classification problems, and robust to dependence between classifiers. In all three test cases, the globally optimal logical combination of the classifiers was found to be their union, according to three out of four ranking criteria. We propose as a general rule of thumb that the union of classifiers will be close to optimal.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Glial fibrillary acidic protein (GFAP) is an intermediate filament (IF) protein specific to central nervous system (CNS) astrocytes. It has been the subject of intense interest due to its association ...with neurodegenerative diseases, and because of growing evidence that IF proteins not only modulate cellular structure, but also cellular function. Moreover, GFAP has a family of splicing isoforms apparently more complex than that of other CNS IF proteins, consistent with it possessing a range of functional and structural roles. The gene consists of 9 exons, and to date all isoforms associated with 3' end splicing have been identified from modifications within intron 7, resulting in the generation of exon 7a (GFAPδ/ε) and 7b (GFAPκ). To better understand the nature and functional significance of variation in this region, we used a Bayesian multiple change-point approach to identify conserved regions. This is the first successful application of this method to a single gene--it has previously only been used in whole-genome analyses. We identified several highly or moderately conserved regions throughout the intron 7/7a/7b regions, including untranslated regions and regulatory features, consistent with the biology of GFAP. Several putative unconfirmed features were also identified, including a possible new isoform. We then integrated multiple computational analyses on both the DNA and protein sequences from the mouse, rat and human, showing that the major isoform, GFAPα, has highly conserved structure and features across the three species, whereas the minor isoforms GFAPδ/ε and GFAPκ have low conservation of structure and features at the distal 3' end, both relative to each other and relative to GFAPα. The overall picture suggests distinct and tightly regulated functions for the 3' end isoforms, consistent with complex astrocyte biology. The results illustrate a computational approach for characterising splicing isoform families, using both DNA and protein sequences.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Background
Electronic patient-reported outcome (ePRO) systems are increasingly used in clinical trials to provide evidence of efficacy and tolerability of treatment from the patient perspective. The ...aim of this study is twofold: (1) to describe how we developed an electronic platform for patients to report their symptoms, and (2) to develop and undertake usability testing of an ePRO solution for use in a study of cell therapy seeking to provide early evidence of efficacy and tolerability of treatment and test the feasibility of the system for use in later phase studies.
Methods
An ePRO system was designed to be used in a single arm, multi-centre, phase II basket trial investigating the safety and activity of the use of ORBCEL-C™ in the treatment of patients with inflammatory conditions. ORBCEL-C™ is an enriched Mesenchymal Stromal Cells product isolated from human umbilical cord tissue using CD362+ cell selection. Usability testing sessions were conducted using cognitive interviews and the ‘Think Aloud’ method with patient advisory group members and Research Nurses to assess the usability of the system.
Results
Nine patient partners and seven research nurses took part in one usability testing session. Measures of fatigue and health-related quality of life, the PRO-CTCAE™ and FACT-GP5 global tolerability question were included in the ePRO system. Alert notifications to the clinical team were triggered by PRO-CTCAE™ and FACT-GP5 scores. Patient participants liked the simplicity and responsiveness of the patient-facing app. Two patients were unable to complete the testing session, due to technical issues. Research Nurses suggested minor modifications to improve functionality and the layout of the clinician dashboard and the training materials.
Conclusion
By testing the effectiveness, efficiency, and satisfaction of our novel ePRO system (PROmics
R
), we learnt that most people with an inflammatory condition found it easy to report their symptoms using an app on their own device. Their experiences using the PROmics
R
ePRO system within a trial environment will be further explored in our upcoming feasibility testing. Research nurses were also positive and found the clinical dashboard easy-to-use. Using ePROs in early phase trials is important in order to provide evidence of therapeutic responses and tolerability, increase the evidence based, and inform methodology development.
Trial registration
: ISRCTN, ISRCTN80103507. Registered 01 April 2022,
https://www.isrctn.com/ISRCTN80103507
Plain English summary
More and more patients tell clinicians how they feel by completing questionnaires electronically. Therefore, it is important to assess how easy it is for patients to do this. In this study, we describe how we developed an electronic platform for patients to report their symptoms and how we tested the usability of this platform with patient partners and research nurses. Once the electronic platform was developed, quality of life and symptoms questionnaires were programmed onto it. Alerts were sent to the clinical team if specific scores were obtained on the symptoms questionnaires. Although two patient partners were not able to finish the testing session because of technical issues, the ones who completed the session liked its simplicity and responsiveness. The research nurses also liked the system and only suggested minor modifications. Following this testing, we refined the electronic platform to test it further in a larger study which investigates the safety and use of a drug. We hope that thanks to this electronic platform, we will obtain useful information on the safety and efficacy of treatment.
Short-term follow-up in the Fluid and Catheter Treatment Trial (FACTT) suggested differential mortality by race with conservative fluid management, but no significant interaction.
In a post hoc ...analysis of FACTT including 1-year follow-up, we sought to estimate long-term mortality by race and test for an interaction between fluids and race.
We performed a post hoc analysis of FACTT and the Economic Analysis of Pulmonary Artery Catheters (EAPAC) study (which included 655 of the 1,000 FACTT patients with near-complete 1-year follow up). We fit a multistate Markov model to estimate 1-year mortality for all non-Hispanic black and white randomized FACTT subjects. The model estimated the distribution of time from randomization to hospital discharge or hospital death (available on all patients) and estimated the distribution of time from hospital discharge to death using data on patients after hospital discharge for patients in EAPAC. The 1-year mortality was found by combining these estimates.
Non-Hispanic black (n = 217, 25%) or white identified subjects (n = 641, 75%) were included. There was a significant interaction between race and fluid treatment (P = 0.012). One-year mortality was lower for black subjects assigned to conservative fluids (38 vs. 54%; mean mortality difference, 16%; 95% confidence interval, 2-30%; P = 0.027 between conservative and liberal). Conversely, 1-year mortality for white subjects was 35% versus 30% for conservative versus liberal arms (mean mortality difference, -4.8%; 95% confidence interval, -13% to 3%; P = 0.23).
In our cohort, conservative fluid management may have improved 1-year mortality for non-Hispanic black patients with ARDS. However, we found no long-term benefit of conservative fluid management in white subjects.
Quantitative high-throughput data deposited in consortia such as International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA) present opportunities and challenges for ...computational analyses.
We present a computational strategy to systematically rank and investigate a large number (2
-2
) of clinically testable gene sets, using combinatorial gene subset generation and disease-free survival (DFS) analyses. This approach integrates protein-protein interaction networks, gene expression, DNA methylation, and copy number data, in association with DFS profiles from patient clinical records.
As a case study, we applied this pipeline to systematically analyze the role of
in prostate cancer (PCa). We have previously found this gene to have multiple roles in disease and homeostasis, and here we investigate the role of the associated
gene/protein networks in PCa, using our methodology in combination with PCa patient clinical profiles from ICGC and TCGA databases. Relationships between gene signatures and relapse were analyzed using Kaplan-Meier (KM) log-rank analysis and multivariable Cox regression. Relative expression versus pooled mean from diploid population was used for
-statistics calculation. Gene/protein interaction network analyses generated 11 core genes associated with
; combinatorial ranking of the power set of these core genes identified two gene sets (out of 2
- 1 = 2,047 combinations) with significant correlation with disease relapse (KM log rank
< 0.05). For the more significant of these two sets, referred to as the optimal gene set (OGS), patients have median survival 62.7 months with OGS alterations compared to >150 months without OGS alterations (
= 0.0248, hazard ratio = 2.213, 95% confidence interval = 1.1-4.098). Two genes comprising OGS (
and
) are strongly associated with
in the retinoic acid (RA) pathways, suggesting a major role of RA signaling in early PCa progression. Our pipeline complements human expertise in the search for prognostic biomarkers in large-scale datasets.
Understanding the complexity of cardiac physiology requires system-level studies of multiple cardiac cell types. Frequently, however, the end result of published research lacks the detail of the ...collaborative and integrative experimental design process, and the underlying conceptual framework. We review the recent progress in systems modelling and omics analysis of the heterocellular heart environment through complementary forward and inverse approaches, illustrating these conceptual and experimental frameworks with case studies from our own research program. The forward approach begins by collecting curated information from the niche cardiac biology literature, and connecting the dots to form mechanistic network models that generate testable system-level predictions. The inverse approach starts from the vast pool of public omics data in recent cardiac biological research, and applies bioinformatics analysis to produce novel candidates for further investigation. We also discuss the possibility of combining these two approaches into a hybrid framework, together with the benefits and challenges. These interdisciplinary research frameworks illustrate the interplay between computational models, omics analysis, and wet lab experiments, which holds the key to making real progress in improving human cardiac wellbeing.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UL, UM, UPCLJ, UPUK
Caspase-8 is the initiator caspase of the extrinsic apoptosis pathway and also has a role in non-apoptotic physiologies. Identifying endogenous substrates for caspase-8 by using integrated ...bioinformatics and biological approaches is required to delineate the diverse roles of this caspase. We describe a number of novel putative caspase-8 substrates using the Prediction of Protease Specificity (PoPS) program, one of which is histone deacetylase 7 (HDAC7). HDAC7 is cleaved faster than any other caspase-8 substrate described to date. It is also cleaved in primary CD4+CD8+ thymocytes undergoing extrinsic apoptosis. By using naturally occurring caspase inhibitors that have evolved exquisite specificity at concentrations found within the cell, we could unequivocally assign the cleavage activity to caspase-8. Importantly, cleavage of HDAC7 alters its subcellular localization and abrogates its Nur77 repressor function. Thus we demonstrate a direct role for initiator caspase-mediated proteolysis in promoting gene transcription.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
Existing de novo software platforms have largely overlooked a valuable resource, the expertise of the intended biologist users. Typical data representations such as long gene lists, or highly dense ...and overlapping transcription factor networks often hinder biologists from relating these results to their expertise.
VISIONET, a streamlined visualisation tool built from experimental needs, enables biologists to transform large and dense overlapping transcription factor networks into sparse human-readable graphs via numerically filtering. The VISIONET interface allows users without a computing background to interactively explore and filter their data, and empowers them to apply their specialist knowledge on far more complex and substantial data sets than is currently possible. Applying VISIONET to the Tbx20-Gata4 transcription factor network led to the discovery and validation of Aldh1a2, an essential developmental gene associated with various important cardiac disorders, as a healthy adult cardiac fibroblast gene co-regulated by cardiogenic transcription factors Gata4 and Tbx20.
We demonstrate with experimental validations the utility of VISIONET for expertise-driven gene discovery that opens new experimental directions that would not otherwise have been identified.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK