Abstract
Black Veterans have higher a incidence of localized and metastatic prostate cancer compared to White Veterans yet are underrepresented in reports of frequencies of somatic and germline ...alterations. This retrospective analysis of somatic and putative germline alterations was conducted in a large cohort of Veterans with prostate cancer (N = 835 Black, 1613 White) who underwent next generation sequencing through the VA Precision Oncology Program, which facilitates molecular testing for Veterans with metastatic cancer. No differences were observed in gene alterations for FDA approved targetable therapies (13.5% in Black Veterans vs. 15.5% in White Veterans, P = .21), nor in any potentially actionable alterations (25.5% vs. 28.7%, P =.1). Black Veterans had higher rates of BRAF (5.5% vs. 2.6%, P < .001) alterations, White Veterans TMPRSS2 fusions (27.2% vs. 11.7%, P < .0001). Putative germline alteration rates were higher in White Veterans (12.0% vs. 6.1%, P < .0001). Racial disparities in outcome are unlikely attributable to acquired somatic alterations in actionable pathways.
Black veterans have higher incidence of prostate cancer than White veterans but are underrepresented in reports of frequencies of somatic and germline alterations. This retrospective analysis of somatic and putative germline alterations was conducted in a large cohort of veterans with prostate cancer who underwent next-generation sequencing through the VA Precision Oncology Program. No differences were observed in actionable alterations, suggesting disparities in prostate cancer outcomes are not attributable to acquired somatic alterations in actionable pathways.
Display omitted
•Multi-attribute multi-value classification.•Marker decorated context classification.•Optimize sentence window for medication disposition context classification.
This article ...summarizes our approach to extracting medication and corresponding attributes from clinical notes, which is the focus of track 1 of the 2022 National Natural Language Processing (NLP) Clinical Challenges(n2c2) shared task.
The dataset was prepared using Contextualized Medication Event Dataset (CMED), including 500 notes from 296 patients. Our system consisted of three components: medication named entity recognition (NER), event classification (EC), and context classification (CC). These three components were built using transformer models with slightly different architecture and input text engineering. A zero-shot learning solution for CC was also explored.
Our best performance systems achieved micro-average F1 scores of 0.973, 0.911, and 0.909 for the NER, EC, and CC, respectively.
In this study, we implemented a deep learning-based NLP system and demonstrated that our approach of (1) utilizing special tokens helps our model to distinguish multiple medications mentions in the same context; (2) aggregating multiple events of a single medication into multiple labels improves our model’s performance.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
Despite impressive success of machine learning algorithms in clinical natural language processing (cNLP), rule-based approaches still have a prominent role. In this paper, we introduce medspaCy, an ...extensible, open-source cNLP library based on spaCy framework that allows flexible integration of rule-based and machine learning-based algorithms adapted to clinical text. MedspaCy includes a variety of components that meet common cNLP needs such as context analysis and mapping to standard terminologies. By utilizing spaCy's clear and easy-to-use conventions, medspaCy enables development of custom pipelines that integrate easily with other spaCy-based modules. Our toolkit includes several core components and facilitates rapid development of pipelines for clinical text.
In this study we sought to explore the possibility of using patient centered care (PCC) documentation as a measure of the delivery of PCC in a health system.
We first selected 6 VA medical centers ...based on their scores for a measure of support for self-management subscale from a national patient satisfaction survey (the Survey for Healthcare Experience-Patients). We accessed clinical notes related to either smoking cessation or weight management consults. We then annotated this dataset of notes for documentation of PCC concepts including: patient goals, provider support for goal progress, social context, shared decision making, mention of caregivers, and use of the patient's voice. We examined the association of documentation of PCC with patients' perception of support for self-management with regression analyses.
Two health centers had < 50 notes related to either tobacco cessation or weight management consults and were removed from further analysis. The resulting dataset includes 477 notes related to 311 patients total from 4 medical centers. For a majority of patients (201 out of 311; 64.8%) at least one PCC concept was present in their clinical notes. The most common PCC concepts documented were patient goals (patients n = 126; 63% clinical notes n = 302; 63%), patient voice (patients n = 165, 82%; clinical notes n = 323, 68%), social context (patients n = 105, 52%; clinical notes n = 181, 38%), and provider support for goal progress (patients n = 124, 62%; clinical notes n = 191, 40%). Documentation of goals for weight loss notes was greater at health centers with higher satisfaction scores compared to low. No such relationship was found for notes related to tobacco cessation.
Providers document PCC concepts in their clinical notes. In this pilot study we explored the feasibility of using this data as a means to measure the degree to which care in a health center is patient centered.
clinical EHR notes are a rich source of information about PCC that could potentially be used to assess PCC over time and across systems with scalable technologies such as natural language processing.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Electronically linked datasets have become an important part of clinical research. Information from multiple sources can be used to identify comorbid conditions and patient outcomes, measure use of ...healthcare services, and enrich demographic and clinical variables of interest. Innovative approaches for creating research infrastructure beyond a traditional data system are necessary.
Records from a large healthcare system's enterprise data warehouse (EDW) were linked to a statewide population database, and a master subject index was created. The authors evaluate the linkage, along with the impact of missing information in EDW records and the coverage of the population database. The makeup of the EDW and population database provides a subset of cancer records that exist in both resources, which allows a cancer-specific evaluation of the linkage.
About 3.4 million records (60.8%) in the EDW were linked to the population database with a minimum accuracy of 96.3%. It was estimated that approximately 24.8% of target records were absent from the population database, which enabled the effect of the amount and type of information missing from a record on the linkage to be estimated. However, 99% of the records from the oncology data mart linked; they had fewer missing fields and this correlated positively with the number of patient visits.
A general-purpose research infrastructure was created which allows disease-specific cohorts to be identified. The usefulness of creating an index between institutions is that it allows each institution to maintain control and confidentiality of their own information.
Many patients with bipolar disorder (BD) are initially misdiagnosed with major depressive disorder (MDD) and are treated with antidepressants, whose potential iatrogenic effects are widely discussed. ...It is unknown whether MDD is a comorbidity of BD or its earlier stage, and no consensus exists on individual conversion predictors, delaying BD's timely recognition and treatment. We aimed to build a predictive model of MDD to BD conversion and to validate it across a multi-national network of patient databases using the standardization afforded by the Observational Medical Outcomes Partnership (OMOP) common data model. Five "training" US databases were retrospectively analyzed: IBM MarketScan CCAE, MDCR, MDCD, Optum EHR, and Optum Claims. Cyclops regularized logistic regression models were developed on one-year MDD-BD conversion with all standard covariates from the HADES PatientLevelPrediction package. Time-to-conversion Kaplan-Meier analysis was performed up to a decade after MDD, stratified by model-estimated risk. External validation of the final prediction model was performed across 9 patient record databases within the Observational Health Data Sciences and Informatics (OHDSI) network internationally. The model's area under the curve (AUC) varied 0.633-0.745 (µ = 0.689) across the five US training databases. Nine variables predicted one-year MDD-BD transition. Factors that increased risk were: younger age, severe depression, psychosis, anxiety, substance misuse, self-harm thoughts/actions, and prior mental disorder. AUCs of the validation datasets ranged 0.570-0.785 (µ = 0.664). An assessment algorithm was built for MDD to BD conversion that allows distinguishing as much as 100-fold risk differences among patients and validates well across multiple international data sources.
Cognitive processing therapy (CPT) and prolonged exposure therapy (PE) are effective psychotherapies for post-traumatic stress disorder (PTSD). However, these treatments also have high rates of ...dropout and non-response. Therefore, patients may need a second course of treatment. We compared outcomes for patients who switched between CPT/PE and those who repeated CPT/PE during a second course of treatment. We collected data from Iraq and Afghanistan war veterans (n = 2,958) who received a second course of CPT/PE in the Veterans Health Administration from 2001 to 2017 and had symptom outcomes (PTSD checklist; PCL). We measured the association between treatment sequence and change in PCL score over the second course of treatment using hierarchical Bayesian regression, adjusted for sociodemographic and clinical characteristics. All treatment sequences showed a significant reduction in PCL score over time (β = −4.80; HDI
95
: −5.74, −3.86). Veterans who switched from CPT to PE had modestly greater PCL reductions during the second course than those who repeated CPT. However, no significant difference in PCL change during the second course was observed between veterans who repeated PE and those who switched from PE to CPT. Veterans participating in a second course of CPT/PE can benefit, and switching treatment may be slightly more beneficial following CPT.
Full text
Available for:
BFBNIB, DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, UILJ, UKNU, UL, UM, UPUK
Accurate information is needed to direct healthcare systems' efforts to control methicillin-resistant Staphylococcus aureus (MRSA). Assembling complete and correct microbiology data is vital to ...understanding and addressing the multiple drug-resistant organisms in our hospitals.
Herein, we describe a system that securely gathers microbiology data from the Department of Veterans Affairs (VA) network of databases. Using natural language processing methods, we applied an information extraction process to extract organisms and susceptibilities from the free-text data. We then validated the extraction against independently derived electronic data and expert annotation.
We estimate that the collected microbiology data are 98.5% complete and that methicillin-resistant Staphylococcus aureus was extracted accurately 99.7% of the time.
Applying natural language processing methods to microbiology records appears to be a promising way to extract accurate and useful nosocomial pathogen surveillance data. Both scientific inquiry and the data's reliability will be dependent on the surveillance system's capability to compare from multiple sources and circumvent systematic error. The dataset constructed and methods used for this investigation could contribute to a comprehensive infectious disease surveillance system or other pressing needs.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Angiotensin-converting enzyme inhibitors (ACEIs) and angiotensin receptor blockers (ARBs) have been postulated to affect susceptibility to COVID-19. Observational studies so far have lacked rigorous ...ascertainment adjustment and international generalisability. We aimed to determine whether use of ACEIs or ARBs is associated with an increased susceptibility to COVID-19 in patients with hypertension.
In this international, open science, cohort analysis, we used electronic health records from Spain (Information Systems for Research in Primary Care SIDIAP) and the USA (Columbia University Irving Medical Center data warehouse CUIMC and Department of Veterans Affairs Observational Medical Outcomes Partnership VA-OMOP) to identify patients aged 18 years or older with at least one prescription for ACEIs and ARBs (target cohort) or calcium channel blockers (CCBs) and thiazide or thiazide-like diuretics (THZs; comparator cohort) between Nov 1, 2019, and Jan 31, 2020. Users were defined separately as receiving either monotherapy with these four drug classes, or monotherapy or combination therapy (combination use) with other antihypertensive medications. We assessed four outcomes: COVID-19 diagnosis; hospital admission with COVID-19; hospital admission with pneumonia; and hospital admission with pneumonia, acute respiratory distress syndrome, acute kidney injury, or sepsis. We built large-scale propensity score methods derived through a data-driven approach and negative control experiments across ten pairwise comparisons, with results meta-analysed to generate 1280 study effects. For each study effect, we did negative control outcome experiments using a possible 123 controls identified through a data-rich algorithm. This process used a set of predefined baseline patient characteristics to provide the most accurate prediction of treatment and balance among patient cohorts across characteristics. The study is registered with the EU Post-Authorisation Studies register, EUPAS35296.
Among 1 355 349 antihypertensive users (363 785 ACEI or ARB monotherapy users, 248 915 CCB or THZ monotherapy users, 711 799 ACEI or ARB combination users, and 473 076 CCB or THZ combination users) included in analyses, no association was observed between COVID-19 diagnosis and exposure to ACEI or ARB monotherapy versus CCB or THZ monotherapy (calibrated hazard ratio HR 0·98, 95% CI 0·84–1·14) or combination use exposure (1·01, 0·90–1·15). ACEIs alone similarly showed no relative risk difference when compared with CCB or THZ monotherapy (HR 0·91, 95% CI 0·68–1·21; with heterogeneity of >40%) or combination use (0·95, 0·83–1·07). Directly comparing ACEIs with ARBs demonstrated a moderately lower risk with ACEIs, which was significant with combination use (HR 0·88, 95% CI 0·79–0·99) and non-significant for monotherapy (0·85, 0·69–1·05). We observed no significant difference between drug classes for risk of hospital admission with COVID-19, hospital admission with pneumonia, or hospital admission with pneumonia, acute respiratory distress syndrome, acute kidney injury, or sepsis across all comparisons.
No clinically significant increased risk of COVID-19 diagnosis or hospital admission-related outcomes associated with ACEI or ARB use was observed, suggesting users should not discontinue or change their treatment to decrease their risk of COVID-19.
Wellcome Trust, UK National Institute for Health Research, US National Institutes of Health, US Department of Veterans Affairs, Janssen Research & Development, IQVIA, South Korean Ministry of Health and Welfare Republic, Australian National Health and Medical Research Council, and European Health Data and Evidence Network.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP