Mobile apps for mental health have the potential to overcome access barriers to mental health care, but there is little information on whether patients use the interventions as intended and the ...impact they have on mental health outcomes.
The objective of our study was to document and compare use patterns and clinical outcomes across the United States between 3 different self-guided mobile apps for depression.
Participants were recruited through Web-based advertisements and social media and were randomly assigned to 1 of 3 mood apps. Treatment and assessment were conducted remotely on each participant's smartphone or tablet with minimal contact with study staff. We enrolled 626 English-speaking adults (≥18 years old) with mild to moderate depression as determined by a 9-item Patient Health Questionnaire (PHQ-9) score ≥5, or if their score on item 10 was ≥2. The apps were (1) Project: EVO, a cognitive training app theorized to mitigate depressive symptoms by improving cognitive control, (2) iPST, an app based on an evidence-based psychotherapy for depression, and (3) Health Tips, a treatment control. Outcomes were scores on the PHQ-9 and the Sheehan Disability Scale. Adherence to treatment was measured as number of times participants opened and used the apps as instructed.
We randomly assigned 211 participants to iPST, 209 to Project: EVO, and 206 to Health Tips. Among the participants, 77.0% (482/626) had a PHQ-9 score >10 (moderately depressed). Among the participants using the 2 active apps, 57.9% (243/420) did not download their assigned intervention app but did not differ demographically from those who did. Differential treatment effects were present in participants with baseline PHQ-9 score >10, with the cognitive training and problem-solving apps resulting in greater effects on mood than the information control app (χ22=6.46, P=.04).
Mobile apps for depression appear to have their greatest impact on people with more moderate levels of depression. In particular, an app that is designed to engage cognitive correlates of depression had the strongest effect on depressed mood in this sample. This study suggests that mobile apps reach many people and are useful for more moderate levels of depression.
Clinicaltrials.gov NCT00540865; https://www.clinicaltrials.gov/ct2/show/NCT00540865 (Archived by WebCite at http://www.webcitation.org/6mj8IPqQr).
The predictive accuracy of a survival model can be summarized using extensions of the proportion of variation explained by the model, or R2, commonly used for continuous response models, or using ...extensions of sensitivity and specificity, which are commonly used for binary response models. In this article we propose new time-dependent accuracy summaries based on time-specific versions of sensitivity and specificity calculated over risk sets. We connect the accuracy summaries to a previously proposed global concordance measure, which is a variant of Kendall's tau. In addition, we show how standard Cox regression output can be used to obtain estimates of time-dependent sensitivity and specificity, and time-dependent receiver operating characteristic (ROC) curves. Semiparametric estimation methods appropriate for both proportional and nonproportional hazards data are introduced, evaluated in simulations, and illustrated using two familiar survival data sets.
Diabetic kidney disease is the leading cause of chronic and end-stage kidney disease in the United States and worldwide. Changes in demographics and treatments may affect the prevalence and clinical ...manifestations of diabetic kidney disease.
To characterize the clinical manifestations of kidney disease among US adults with diabetes over time.
Serial cross-sectional studies of adults aged 20 years or older with diabetes mellitus participating in National Health and Nutrition Examination Surveys from 1988 through 2014.
Diabetes was defined as hemoglobin A1c greater than 6.5% or use of glucose-lowering medications.
Albuminuria (urine albumin-to-creatinine ratio ≥30 mg/g), macroalbuminuria (urine albumin-to-creatinine ratio ≥300 mg/g), reduced estimated glomerular filtration rate (eGFR <60 mL/min/1.73 m2), and severely reduced eGFR (<30 mL/min/1.73 m2), incorporating data on biological variability to estimate the prevalence of persistent abnormalities.
There were 6251 adults with diabetes included (1431 from 1988-1994, 1443 from 1999-2004, 1280 from 2005-2008, and 2097 from 2009-2014). The prevalence of any diabetic kidney disease, defined as persistent albuminuria, persistent reduced eGFR, or both, did not significantly change over time from 28.4% (95% CI, 23.8%-32.9%) in 1988-1994 to 26.2% (95% CI, 22.6%-29.9%) in 2009-2014 (prevalence ratio, 0.95 95% CI, 0.86-1.06 adjusting for age, sex, and race/ethnicity; P = .39 for trend). However, the prevalence of albuminuria decreased progressively over time from 20.8% (95% CI, 16.3%-25.3%) in 1988-1994 to 15.9% (95% CI, 12.7%-19.0%) in 2009-2014 (adjusted prevalence ratio, 0.76 95% CI, 0.65-0.89; P < .001 for trend). In contrast, the prevalence of reduced eGFR increased from 9.2% (95% CI, 6.2%-12.2%) in 1988-1994 to 14.1% (95% CI, 11.3%-17.0%) in 2009-2014 (adjusted prevalence ratio, 1.61 95% CI, 1.33-1.95 comparing 2009-2014 with 1988-1994; P < .001 for trend), with a similar pattern for severely reduced eGFR (adjusted prevalence ratio, 2.86 95% CI, 1.38-5.91; P = .004 for trend). Significant heterogeneity in the temporal trend for albuminuria was noted by age (P = .049 for interaction) and race/ethnicity (P = .007 for interaction), with a decreasing prevalence of albuminuria observed only among adults younger than 65 years and non-Hispanic whites, whereas the prevalence of reduced GFR increased without significant differences by age or race/ethnicity. In 2009-2014, approximately 8.2 million adults with diabetes (95% CI, 6.5-9.9 million adults) had albuminuria, reduced eGFR, or both.
Among US adults with diabetes from 1988 to 2014, the overall prevalence of diabetic kidney disease did not change significantly, whereas the prevalence of albuminuria declined and the prevalence of reduced eGFR increased.
Many medical decisions involve the use of dynamic information collected on individual patients toward predicting likely transitions in their future health status. If accurate predictions are ...developed, then a prognostic model can identify patients at greatest risk for future adverse events and may be used clinically to define populations appropriate for targeted intervention. In practice, a prognostic model is often used to guide decisions at multiple time points over the course of disease, and classification performance (i.e., sensitivity and specificity) for distinguishing high-risk v. low-risk individuals may vary over time as an individual’s disease status and prognostic information change. In this tutorial, we detail contemporary statistical methods that can characterize the time-varying accuracy of prognostic survival models when used for dynamic decision making. Although statistical methods for evaluating prognostic models with simple binary outcomes are well established, methods appropriate for survival outcomes are less well known and require time-dependent extensions of sensitivity and specificity to fully characterize longitudinal biomarkers or models. The methods we review are particularly important in that they allow for appropriate handling of censored outcomes commonly encountered with event time data. We highlight the importance of determining whether clinical interest is in predicting cumulative (or prevalent) cases over a fixed future time interval v. predicting incident cases over a range of follow-up times and whether patient information is static or updated over time. We discuss implementation of time-dependent receiver operating characteristic approaches using relevant R statistical software packages. The statistical summaries are illustrated using a liver prognostic model to guide transplantation in primary biliary cirrhosis.
Digital technologies such as smartphones are transforming the way scientists conduct biomedical research. Several remotely conducted studies have recruited thousands of participants over a span of a ...few months allowing researchers to collect real-world data at scale and at a fraction of the cost of traditional research. Unfortunately, remote studies have been hampered by substantial participant attrition, calling into question the representativeness of the collected data including generalizability of outcomes. We report the findings regarding recruitment and retention from eight remote digital health studies conducted between 2014-2019 that provided individual-level study-app usage data from more than 100,000 participants completing nearly 3.5 million remote health evaluations over cumulative participation of 850,000 days. Median participant retention across eight studies varied widely from 2-26 days (median across all studies = 5.5 days). Survival analysis revealed several factors significantly associated with increase in participant retention time, including (i) referral by a clinician to the study (increase of 40 days in median retention time); (ii) compensation for participation (increase of 22 days, 1 study); (iii) having the clinical condition of interest in the study (increase of 7 days compared with controls); and (iv) older age (increase of 4 days). Additionally, four distinct patterns of daily app usage behavior were identified by unsupervised clustering, which were also associated with participant demographics. Most studies were not able to recruit a sample that was representative of the race/ethnicity or geographical diversity of the US. Together these findings can help inform recruitment and retention strategies to enable equitable participation of populations in future digital health research.
Vertebroplasty is commonly used to treat painful, osteoporotic vertebral compression fractures.
In this multicenter trial, we randomly assigned 131 patients who had one to three painful osteoporotic ...vertebral compression fractures to undergo either vertebroplasty or a simulated procedure without cement (control group). The primary outcomes were scores on the modified Roland-Morris Disability Questionnaire (RDQ) (on a scale of 0 to 23, with higher scores indicating greater disability) and patients' ratings of average pain intensity during the preceding 24 hours at 1 month (on a scale of 0 to 10, with higher scores indicating more severe pain). Patients were allowed to cross over to the other study group after 1 month.
All patients underwent the assigned intervention (68 vertebroplasties and 63 simulated procedures). The baseline characteristics were similar in the two groups. At 1 month, there was no significant difference between the vertebroplasty group and the control group in either the RDQ score (difference, 0.7; 95% confidence interval CI, -1.3 to 2.8; P=0.49) or the pain rating (difference, 0.7; 95% CI, -0.3 to 1.7; P=0.19). Both groups had immediate improvement in disability and pain scores after the intervention. Although the two groups did not differ significantly on any secondary outcome measure at 1 month, there was a trend toward a higher rate of clinically meaningful improvement in pain (a 30% decrease from baseline) in the vertebroplasty group (64% vs. 48%, P=0.06). At 3 months, there was a higher crossover rate in the control group than in the vertebroplasty group (51% vs. 13%, P<0.001) corrected. There was one serious adverse event in each group.
Improvements in pain and pain-related disability associated with osteoporotic compression fractures in patients treated with vertebroplasty were similar to the improvements in a control group. (ClinicalTrials.gov number, NCT00068822.)
The stepped wedge cluster randomized design has received increasing attention in pragmatic clinical trials and implementation science research. The key feature of the design is the unidirectional ...crossover of clusters from the control to intervention conditions on a staggered schedule, which induces confounding of the intervention effect by time. The stepped wedge design first appeared in the Gambia hepatitis study in the 1980s. However, the statistical model used for the design and analysis was not formally introduced until 2007 in an article by Hussey and Hughes. Since then, a variety of mixed-effects model extensions have been proposed for the design and analysis of these trials. In this article, we explore these extensions under a unified perspective. We provide a general model representation and regard various model extensions as alternative ways to characterize the secular trend, intervention effect, as well as sources of heterogeneity. We review the key model ingredients and clarify their implications for the design and analysis. The article serves as an entry point to the evolving statistical literatures on stepped wedge designs.
High-dose erythropoietin has been shown to have a neuroprotective effect in preclinical models of neonatal brain injury, and phase 2 trials have suggested possible efficacy; however, the benefits and ...safety of this therapy in extremely preterm infants have not been established.
In this multicenter, randomized, double-blind trial of high-dose erythropoietin, we assigned 941 infants who were born at 24 weeks 0 days to 27 weeks 6 days of gestation to receive erythropoietin or placebo within 24 hours after birth. Erythropoietin was administered intravenously at a dose of 1000 U per kilogram of body weight every 48 hours for a total of six doses, followed by a maintenance dose of 400 U per kilogram three times per week by subcutaneous injection through 32 completed weeks of postmenstrual age. Placebo was administered as intravenous saline followed by sham injections. The primary outcome was death or severe neurodevelopmental impairment at 22 to 26 months of postmenstrual age. Severe neurodevelopmental impairment was defined as severe cerebral palsy or a composite motor or composite cognitive score of less than 70 (which corresponds to 2 SD below the mean, with higher scores indicating better performance) on the Bayley Scales of Infant and Toddler Development, third edition.
A total of 741 infants were included in the per-protocol efficacy analysis: 376 received erythropoietin and 365 received placebo. There was no significant difference between the erythropoietin group and the placebo group in the incidence of death or severe neurodevelopmental impairment at 2 years of age (97 children 26% vs. 94 children 26%; relative risk, 1.03; 95% confidence interval, 0.81 to 1.32; P = 0.80). There were no significant differences between the groups in the rates of retinopathy of prematurity, intracranial hemorrhage, sepsis, necrotizing enterocolitis, bronchopulmonary dysplasia, or death or in the frequency of serious adverse events.
High-dose erythropoietin treatment administered to extremely preterm infants from 24 hours after birth through 32 weeks of postmenstrual age did not result in a lower risk of severe neurodevelopmental impairment or death at 2 years of age. (Funded by the National Institute of Neurological Disorders and Stroke; PENUT ClinicalTrials.gov number, NCT01378273.).
Stepped wedge design is a popular research design that enables a rigorous evaluation of candidate interventions by using a staggered cluster randomization strategy. While analytical methods were ...developed for designing stepped wedge trials, the prior focus has been solely on testing for the average treatment effect. With a growing interest on formal evaluation of the heterogeneity of treatment effects across patient subpopulations, trial planning efforts need appropriate methods to accurately identify sample sizes or design configurations that can generate evidence for both the average treatment effect and variations in subgroup treatment effects. To fill in that important gap, this article derives novel variance formulas for confirmatory analyses of treatment effect heterogeneity, that are applicable to both cross‐sectional and closed‐cohort stepped wedge designs. We additionally point out that the same framework can be used for more efficient average treatment effect analyses via covariate adjustment, and allows the use of familiar power formulas for average treatment effect analyses to proceed. Our results further sheds light on optimal design allocations of clusters to maximize the weighted precision for assessing both the average and heterogeneous treatment effects. We apply the new methods to the Lumbar Imaging with Reporting of Epidemiology Trial, and carry out a simulation study to validate our new methods.