Advancing from statistical associations of complex traits with genetic markers to understanding the functional genetic variants that influence traits is often a complex process. Fine-mapping can ...select and prioritize genetic variants for further study, yet the multitude of analytical strategies and study designs makes it challenging to choose an optimal approach. We review the strengths and weaknesses of different fine-mapping approaches, emphasizing the main factors that affect performance. Topics include interpreting results from genome-wide association studies (GWAS), the role of linkage disequilibrium, statistical fine-mapping approaches, trans-ethnic studies, genomic annotation and data integration, and other analysis and design issues.
The prostate cancer (PCa) risk-associated SNP rs11672691 is positively associated with aggressive disease at diagnosis. We showed that rs11672691 maps to the promoter of a short isoform of long ...noncoding RNA PCAT19 (PCAT19-short), which is in the third intron of the long isoform (PCAT19-long). The risk variant is associated with decreased and increased levels of PCAT19-short and PCAT19-long, respectively. Mechanistically, the risk SNP region is bifunctional with both promoter and enhancer activity. The risk variants of rs11672691 and its LD SNP rs887391 decrease binding of transcription factors NKX3.1 and YY1 to the promoter of PCAT19-short, resulting in weaker promoter but stronger enhancer activity that subsequently activates PCAT19-long. PCAT19-long interacts with HNRNPAB to activate a subset of cell-cycle genes associated with PCa progression, thereby promoting PCa tumor growth and metastasis. Taken together, these findings reveal a risk SNP-mediated promoter-enhancer switching mechanism underlying both initiation and progression of aggressive PCa.
Display omitted
•rs11672691 risk region is bifunctional with both promoter and enhancer activity•This SNP modulates the bifunctionality and reciprocal expression of PCAT19 isoforms•PCAT19-long interacts with HNRNPAB to activate a subset of cell-cycle genes•PCAT19-long regulates cell proliferation, tumor growth, and metastasis
Transcription factor binding site remodeling by a risk allele for aggressive prostate cancer results in conversion of a promoter to an enhancer with downstream consequences on long noncoding RNA isoform expression and oncogenesis.
Our purpose was to detect pancreatic ductal adenocarcinoma (PDAC) at the prediagnostic stage (3–36 months before clinical diagnosis) using radiomics-based machine-learning (ML) models, and to compare ...performance against radiologists in a case-control study.
Volumetric pancreas segmentation was performed on prediagnostic computed tomography scans (CTs) (median interval between CT and PDAC diagnosis: 398 days) of 155 patients and an age-matched cohort of 265 subjects with normal pancreas. A total of 88 first-order and gray-level radiomic features were extracted and 34 features were selected through the least absolute shrinkage and selection operator–based feature selection method. The dataset was randomly divided into training (292 CTs: 110 prediagnostic and 182 controls) and test subsets (128 CTs: 45 prediagnostic and 83 controls). Four ML classifiers, k-nearest neighbor (KNN), support vector machine (SVM), random forest (RM), and extreme gradient boosting (XGBoost), were evaluated. Specificity of model with highest accuracy was further validated on an independent internal dataset (n = 176) and the public National Institutes of Health dataset (n = 80). Two radiologists (R4 and R5) independently evaluated the pancreas on a 5-point diagnostic scale.
Median (range) time between prediagnostic CTs of the test subset and PDAC diagnosis was 386 (97–1092) days. SVM had the highest sensitivity (mean; 95% confidence interval) (95.5; 85.5–100.0), specificity (90.3; 84.3–91.5), F1-score (89.5; 82.3–91.7), area under the curve (AUC) (0.98; 0.94–0.98), and accuracy (92.2%; 86.7–93.7) for classification of CTs into prediagnostic versus normal. All 3 other ML models, KNN, RF, and XGBoost, had comparable AUCs (0.95, 0.95, and 0.96, respectively). The high specificity of SVM was generalizable to both the independent internal (92.6%) and the National Institutes of Health dataset (96.2%). In contrast, interreader radiologist agreement was only fair (Cohen’s kappa 0.3) and their mean AUC (0.66; 0.46–0.86) was lower than each of the 4 ML models (AUCs: 0.95–0.98) (P < .001). Radiologists also recorded false positive indirect findings of PDAC in control subjects (n = 83) (7% R4, 18% R5).
Radiomics-based ML models can detect PDAC from normal pancreas when it is beyond human interrogation capability at a substantial lead time before clinical diagnosis. Prospective validation and integration of such models with complementary fluid-based biomarkers has the potential for PDAC detection at a stage when surgical cure is a possibility.
Display omitted
Artificial intelligence can detect pancreas cancer on computed tomography well before its detection by radiologists and before the onset of cancer symptoms when surgical cure is a possibility.
There is a strong correlation between the morphological features of new tumor vessels and malignancy. However, angiogenic heterogeneity necessitates 3D microvascular data of tumor microvessels for ...more reliable quantification. To provide more accurate information regarding vessel morphological features and improve breast lesion characterization, we introduced a quantitative 3D high-definition microvasculature imaging (q3D-HDMI) as a new easily applicable and robust tool to morphologically characterize microvasculature networks in breast tumors using a contrast-free ultrasound-based imaging approach.
In this prospective study, from January 2020 through December 2021, a newly developed q3D-HDMI technique was evaluated on participants with ultrasound-identified suspicious breast lesions recommended for core needle biopsy. The morphological features of breast tumor microvessels were extracted from the q3D-HDMI. Leave-one-out cross-validation (LOOCV) was applied to test the combined diagnostic performance of multiple morphological parameters of breast tumor microvessels. Receiver operating characteristic (ROC) curves were used to evaluate the prediction performance of the generated pooled model.
Ninety-three participants (mean age 52 ± 17 years, 91 women) with 93 breast lesions were studied. The area under the ROC curve (AUC) generated with q3D-HDMI was 95.8% (95% CI 0.901-1.000), yielding a sensitivity of 91.7% and a specificity of 98.2%, that was significantly higher than the AUC generated with the q2D-HDMI (p = 0.02). When compared to q2D-HDMI, the tumor microvessel morphological parameters obtained from q3D-HDMI provides distinctive information that increases accuracy in differentiating breast tumors.
The proposed quantitative volumetric imaging technique augments conventional breast ultrasound evaluation by increasing specificity in differentiating malignant from benign breast masses.
We have developed a novel Bayesian method, PurBayes, to estimate tumor purity and detect intratumor heterogeneity based on next-generation sequencing data of paired tumor-normal tissue samples, which ...uses finite mixture modeling methods. We demonstrate our approach using simulated data and discuss its performance under varying conditions.
PurBayes is implemented as an R package, and source code is available for download through CRAN at http://cran.r-project.org/package=PurBayes.
larson.nicholas@mayo.edu
Supplementary data are available online at Bioinformatics online.
Causal variants for rare genetic diseases are often rare in the general population. Rare variants may also contribute to common complex traits and can have much larger per-allele effect sizes than ...common variants, although power to detect these associations can be limited. Sequencing costs have steadily declined with technological advancements, making it feasible to adopt whole-exome and whole-genome profiling for large biobank-scale sample sizes. These large amounts of sequencing data provide both opportunities and challenges for rare-variant association analysis. Herein, we review the basic concepts of rare-variant analysis methods, the current state-of-the-art methods in utilizing variant annotations or external controls to improve the statistical power, and particular challenges facing rare variant analysis such as accounting for population structure, extremely unbalanced case-control design. We also review recent advances and challenges in rare variant analysis for familial sequencing data and for more complex phenotypes such as survival data. Finally, we discuss other potential directions for further methodology investigation.
For many disease conditions, tissue samples are colored with multiple dyes and stains to add contrast and location information for specific proteins to accurately identify and diagnose disease. This ...presents a computational challenge for digital pathology, as whole-slide images (WSIs) need to be properly overlaid (i.e. registered) to identify co-localized features. Traditional image registration methods sometimes fail due to the high variation of cell density and insufficient texture information in WSIs-particularly at high magnifications. In this paper, we proposed a robust image registration strategy to align re-stained WSIs precisely and efficiently. This method is applied to 30 pairs of immunohistochemical (IHC) stains and their hematoxylin and eosin (H&E) counterparts. Our approach advances the existing methods in three key ways. First, we introduce refinements to existing image registration methods. Second, we present an effective weighting strategy using kernel density estimation to mitigate registration errors. Third, we account for the linear relationship across WSI levels to improve accuracy. Our experiments show significant decreases in registration errors when matching IHC and H&E pairs, enabling subcellular-level analysis on stained and re-stained histological images. We also provide a tool to allow users to develop their own registration benchmarking experiments.
In this study, we propose a mechanical analysis for estimating the internal pressure of a finitely deformed spherical compartment from Lamb wave measurements. The proposed analysis produces a ...dispersion relation associating Lamb wave speed with pressure using limited material parameters (only a strain stiffening term). The analysis was validated on ultrasound bladder vibrometry (UBV) experiments collected from 9 ex vivo porcine bladders before and after formalin cross-linking. Estimated pressures were compared with pressures measured directly by a pressure transducer. The proposed analysis proved broadly effective at estimating pressure from UBV based Lamb wave without calibration as demonstrated by the observed concordance between estimated and measured pressures (Lin's CCC = 0.82 (0.66-0.91). Theoretical limitations and potential refinements to improve the accuracy and generality of the approach are discussed.
Evaluating the association of multiple genetic variants with a trait of interest by use of kernel-based methods has made a significant impact on how genetic association analyses are conducted. An ...advantage of kernel methods is that they tend to be robust when the genetic variants have effects that are a mixture of positive and negative effects, as well as when there is a small fraction of causal variants. Another advantage is that kernel methods fit within the framework of mixed models, providing flexible ways to adjust for additional covariates that influence traits. Herein, we review the basic ideas behind the use of kernel methods for genetic association analysis as well as recent methodological advancements for different types of traits, multivariate traits, pedigree data, and longitudinal data. Finally, we discuss opportunities for future research.
Breast cancer is the second-leading cause of mortality among women around the world. Ultrasound (US) is one of the noninvasive imaging modalities used to diagnose breast lesions and monitor the ...prognosis of cancer patients. It has the highest sensitivity for diagnosing breast masses, but it shows increased false negativity due to its high operator dependency. Underserved areas do not have sufficient US expertise to diagnose breast lesions, resulting in delayed management of breast lesions. Deep learning neural networks may have the potential to facilitate early decision-making by physicians by rapidly yet accurately diagnosing and monitoring their prognosis. This article reviews the recent research trends on neural networks for breast mass ultrasound, including and beyond diagnosis. We discussed original research recently conducted to analyze which modes of ultrasound and which models have been used for which purposes, and where they show the best performance. Our analysis reveals that lesion classification showed the highest performance compared to those used for other purposes. We also found that fewer studies were performed for prognosis than diagnosis. We also discussed the limitations and future directions of ongoing research on neural networks for breast ultrasound.