Pathologists use visual classification of glomerular lesions to assess samples from patients with diabetic nephropathy (DN). The results may vary among pathologists. Digital algorithms may reduce ...this variability and provide more consistent image structure interpretation.
We developed a digital pipeline to classify renal biopsies from patients with DN. We combined traditional image analysis with modern machine learning to efficiently capture important structures, minimize manual effort and supervision, and enforce biologic prior information onto our model. To computationally quantify glomerular structure despite its complexity, we simplified it to three components consisting of nuclei, capillary lumina and Bowman spaces; and Periodic Acid-Schiff positive structures. We detected glomerular boundaries and nuclei from whole slide images using convolutional neural networks, and the remaining glomerular structures using an unsupervised technique developed expressly for this purpose. We defined a set of digital features which quantify the structural progression of DN, and a recurrent network architecture which processes these features into a classification.
Our digital classification agreed with a senior pathologist whose classifications were used as ground truth with moderate Cohen's kappa κ = 0.55 and 95% confidence interval 0.50, 0.60. Two other renal pathologists agreed with the digital classification with κ
= 0.68, 95% interval 0.50, 0.86 and κ
= 0.48, 95% interval 0.32, 0.64. Our results suggest computational approaches are comparable to human visual classification methods, and can offer improved precision in clinical decision workflows. We detected glomerular boundaries from whole slide images with 0.93±0.04 balanced accuracy, glomerular nuclei with 0.94 sensitivity and 0.93 specificity, and glomerular structural components with 0.95 sensitivity and 0.99 specificity.
Computationally derived, histologic image features hold significant diagnostic information that may augment clinical diagnostics.
We demonstrate a simple and effective automated method for the localization of glomeruli in large (~1 gigapixel) histopathological whole-slide images (WSIs) of thin renal tissue sections and ...biopsies, using an adaptation of the well-known local binary patterns (LBP) image feature vector to train a support vector machine (SVM) model. Our method offers high precision (>90%) and reasonable recall (>70%) for glomeruli from WSIs, is readily adaptable to glomeruli from multiple species, including mouse, rat, and human, and is robust to diverse slide staining methods. Using 5 Intel(R) Core(TM) i7-4790 CPUs with 40 GB RAM, our method typically requires ~15 sec for training and ~2 min to extract glomeruli reproducibly from a WSI. Deploying a deep convolutional neural network trained for glomerular recognition in tandem with the SVM suffices to reduce false positives to below 3%. We also apply our LBP-based descriptor to successfully detect pathologic changes in a mouse model of diabetic nephropathy. We envision potential clinical and laboratory applications for this approach in the study and diagnosis of glomerular disease, and as a means of greatly accelerating the construction of feature sets to fuel deep learning studies into tissue structure and pathology.
Lupus podocytopathy (LP) is an entity that is increasingly being reported in the literature on systemic lupus erythematosus (SLE). LP is characterized by nephrotic syndrome in SLE patients with ...diffuse glomerular podocyte foot process effacement and no immune complex deposits along the capillary loops. Histologically, LP typically mimics minimal change disease or primary focal segmental glomerulosclerosis (FSGS) on a background of ISN/RPS class I or II lupus nephritis. In situations where there are coexistent glomerular diseases, however, LP may be easily masked by background lesions and overlapping clinical symptoms.
We report the case of a 24-year-old woman with type I diabetes, hypertension, psoriasis/rash, and intermittent arthritis who presented with abrupt onset of severe nephrotic proteinuria and renal insufficiency. Renal biopsy revealed nodular glomerulosclerosis and FSGS. Immune deposits were not identified by immunofluorescence or electron microscopy. Ultrastructurally, there was diffuse glomerular basement membrane thickening and over 90% podocyte foot process effacement. With no prior established diagnosis of SLE, the patient was initially diagnosed with diabetic nephropathy with coexistent FSGS, and the patient was started on angiotensin-converting enzyme inhibitors (ACEI) and diuretics. However, nephrotic proteinuria persisted and renal function deteriorated. The patient concurrently developed hemolytic anemia with pancytopenia.
Subsequent to the biopsy, serologic results showed positive autoantibodies against double strand DNA (dsDNA), Smith antigen, ribonucleoprotein (RNP), and Histone. A renal biopsy was repeated, revealing essentially similar findings to those of the previous biopsy. Integrating serology and clinical presentation, SLE was favored. The pathology findings were re-evaluated and considered to be most consistent with LP and coexistent diabetic nephropathy, with superimposed FSGS either as a component of LP or as a lesion secondary to diabetes or hypertension.
The patient was started on high-dose prednisone at 60 mg/day, with subsequent addition of mycophenolate mofetil and ACEI, while prednisone was gradually tapered.
The patient's proteinuria, serum creatinine, complete blood counts, skin rash, and arthritis were all significantly improved.
The diagnosis of LP when confounded by other glomerular diseases that may cause nephrotic syndrome can be challenging. Sufficient awareness of this condition is necessary for the appropriate diagnosis and treatment.
Quantitative histomorphometry (QH) refers to the application of advanced computational image analysis to reproducibly describe disease appearance on digitized histopathology images. QH thus could ...serve as an important complementary tool for pathologists in interrogating and interpreting cancer morphology and malignancy. In the US, annually, over 60,000 prostate cancer patients undergo radical prostatectomy treatment. Around 10,000 of these men experience biochemical recurrence within 5 years of surgery, a marker for local or distant disease recurrence. The ability to predict the risk of biochemical recurrence soon after surgery could allow for adjuvant therapies to be prescribed as necessary to improve long term treatment outcomes. The underlying hypothesis with our approach, co-occurring gland angularity (CGA), is that in benign or less aggressive prostate cancer, gland orientations within local neighborhoods are similar to each other but are more chaotically arranged in aggressive disease. By modeling the extent of the disorder, we can differentiate surgically removed prostate tissue sections from (a) benign and malignant regions and (b) more and less aggressive prostate cancer. For a cohort of 40 intermediate-risk (mostly Gleason sum 7) surgically cured prostate cancer patients where half suffered biochemical recurrence, the CGA features were able to predict biochemical recurrence with 73% accuracy. Additionally, for 80 regions of interest chosen from the 40 studies, corresponding to both normal and cancerous cases, the CGA features yielded a 99% accuracy. CGAs were shown to be statistically signicantly (Formula: see text) better at predicting BCR compared to state-of-the-art QH methods and postoperative prostate cancer nomograms.
With the increasing ability to routinely and rapidly digitize whole slide images with slide scanners, there has been interest in developing computerized image analysis algorithms for automated ...detection of disease extent from digital pathology images. The manual identification of presence and extent of breast cancer by a pathologist is critical for patient management for tumor staging and assessing treatment response. However, this process is tedious and subject to inter- and intra-reader variability. For computerized methods to be useful as decision support tools, they need to be resilient to data acquired from different sources, different staining and cutting protocols and different scanners. The objective of this study was to evaluate the accuracy and robustness of a deep learning-based method to automatically identify the extent of invasive tumor on digitized images. Here, we present a new method that employs a convolutional neural network for detecting presence of invasive tumor on whole slide images. Our approach involves training the classifier on nearly 400 exemplars from multiple different sites, and scanners, and then independently validating on almost 200 cases from The Cancer Genome Atlas. Our approach yielded a Dice coefficient of 75.86%, a positive predictive value of 71.62% and a negative predictive value of 96.77% in terms of pixel-by-pixel evaluation compared to manually annotated regions of invasive ductal carcinoma.
Precise detection of invasive cancer on whole-slide images (WSI) is a critical first step in digital pathology tasks of diagnosis and grading. Convolutional neural network (CNN) is the most popular ...representation learning method for computer vision tasks, which have been successfully applied in digital pathology, including tumor and mitosis detection. However, CNNs are typically only tenable with relatively small image sizes (200 × 200 pixels). Only recently, Fully convolutional networks (FCN) are able to deal with larger image sizes (500 × 500 pixels) for semantic segmentation. Hence, the direct application of CNNs to WSI is not computationally feasible because for a WSI, a CNN would require billions or trillions of parameters. To alleviate this issue, this paper presents a novel method, High-throughput Adaptive Sampling for whole-slide Histopathology Image analysis (HASHI), which involves: i) a new efficient adaptive sampling method based on probability gradient and quasi-Monte Carlo sampling, and, ii) a powerful representation learning classifier based on CNNs. We applied HASHI to automated detection of invasive breast cancer on WSI. HASHI was trained and validated using three different data cohorts involving near 500 cases and then independently tested on 195 studies from The Cancer Genome Atlas. The results show that (1) the adaptive sampling method is an effective strategy to deal with WSI without compromising prediction accuracy by obtaining comparative results of a dense sampling (∼6 million of samples in 24 hours) with far fewer samples (∼2,000 samples in 1 minute), and (2) on an independent test dataset, HASHI is effective and robust to data from multiple sites, scanners, and platforms, achieving an average Dice coefficient of 76%.
To determine the feasibility of using a computer-aided diagnosis (CAD) system to differentiate among triple-negative breast cancer, estrogen receptor (ER)-positive cancer, human epidermal growth ...factor receptor type 2 (HER2)-positive cancer, and benign fibroadenoma lesions on dynamic contrast material-enhanced (DCE) magnetic resonance (MR) images.
This is a retrospective study of prospectively acquired breast MR imaging data collected from an institutional review board-approved, HIPAA-compliant study between 2002 and 2007. Written informed consent was obtained from all patients. The authors collected DCE MR images from 65 women with 76 breast lesions who had been recruited into a larger study of breast MR imaging. The women had triple-negative (n = 21), ER-positive (n = 25), HER2-positive (n = 18), or fibroadenoma (n = 12) lesions. All lesions were classified as Breast Imaging Reporting and Data System category 4 or higher on the basis of previous imaging. Images were subject to quantitative feature extraction, feed-forward feature selection by means of linear discriminant analysis, and lesion classification by using a support vector machine classifier. The area under the receiver operating characteristic curve (Az) was calculated for each of five lesion classification tasks involving triple-negative breast cancers.
For each pair-wise lesion type comparison, linear discriminant analysis helped identify the most discriminatory features, which in conjunction with a support vector machine classifier yielded an Az of 0.73 (95% confidence interval CI: 0.59, 0.87) for triple-negative cancer versus all non-triple-negative lesions, 0.74 (95% CI: 0.60, 0.88) for triple-negative cancer versus ER- and HER2-positive cancer, 0.77 (95% CI: 0.63, 0.91) for triple-negative versus ER-positive cancer, 0.74 (95% CI: 0.58, 0.89) for triple-negative versus HER2-positive cancer, and 0.97 (95% CI: 0.91, 1.00) for triple-negative cancer versus fibroadenoma.
Triple-negative cancers possess certain characteristic features on DCE MR images that can be captured and quantified with CAD, enabling good discrimination of triple-negative cancers from non-triple-negative cancers, as well as between triple-negative cancers and benign fibroadenomas. Such CAD algorithms may provide added diagnostic benefit in identifying the highly aggressive triple-negative cancer phenotype with DCE MR imaging in high-risk women.
Gene-expression companion diagnostic tests, such as the Oncotype DX test, assess the risk of early stage Estrogen receptor (ER) positive (+) breast cancers, and guide clinicians in the decision of ...whether or not to use chemotherapy. However, these tests are typically expensive, time consuming, and tissue-destructive.
In this paper, we evaluate the ability of computer-extracted nuclear morphology features from routine hematoxylin and eosin (H&E) stained images of 178 early stage ER+ breast cancer patients to predict corresponding risk categories derived using the Oncotype DX test. A total of 216 features corresponding to the nuclear shape and architecture categories from each of the pathologic images were extracted and four feature selection schemes: Ranksum, Principal Component Analysis with Variable Importance on Projection (PCA-VIP), Maximum-Relevance, Minimum Redundancy Mutual Information Difference (MRMR MID), and Maximum-Relevance, Minimum Redundancy - Mutual Information Quotient (MRMR MIQ), were employed to identify the most discriminating features. These features were employed to train 4 machine learning classifiers: Random Forest, Neural Network, Support Vector Machine, and Linear Discriminant Analysis, via 3-fold cross validation.
The four sets of risk categories, and the top Area Under the receiver operating characteristic Curve (AUC) machine classifier performances were: 1) Low ODx and Low mBR grade vs. High ODx and High mBR grade (Low-Low vs. High-High) (AUC = 0.83), 2) Low ODx vs. High ODx (AUC = 0.72), 3) Low ODx vs. Intermediate and High ODx (AUC = 0.58), and 4) Low and Intermediate ODx vs. High ODx (AUC = 0.65). Trained models were tested independent validation set of 53 cases which comprised of Low and High ODx risk, and demonstrated per-patient accuracies ranging from 75 to 86%.
Our results suggest that computerized image analysis of digitized H&E pathology images of early stage ER+ breast cancer might be able predict the corresponding Oncotype DX risk categories.
The identification of phenotypic changes in breast cancer (BC) histopathology on account of corresponding molecular changes is of significant clinical importance in predicting disease outcome. One ...such example is the presence of lymphocytic infiltration (LI) in histopathology, which has been correlated with nodal metastasis and distant recurrence in HER2+ BC patients. In this paper, we present a computer-aided diagnosis (CADx) scheme to automatically detect and grade the extent of LI in digitized HER2+ BC histopathology. Lymphocytes are first automatically detected by a combination of region growing and Markov random field algorithms. Using the centers of individual detected lymphocytes as vertices, three graphs (Voronoi diagram, Delaunay triangulation, and minimum spanning tree) are constructed and a total of 50 image-derived features describing the arrangement of the lymphocytes are extracted from each sample. A nonlinear dimensionality reduction scheme, graph embedding (GE), is then used to project the high-dimensional feature vector into a reduced 3-D embedding space. A support vector machine classifier is used to discriminate samples with high and low LI in the reduced dimensional embedding space. A total of 41 HER2+ hematoxylin-and-eosin-stained images obtained from 12 patients were considered in this study. For more than 100 three-fold cross-validation trials, the architectural feature set successfully distinguished samples of high and low LI levels with a classification accuracy greater than 90%. The popular unsupervised Varma-Zisserman texton-based classification scheme was used for comparison and yielded a classification accuracy of only 60%. Additionally, the projection of the 50 image-derived features for all 41 tissue samples into a reduced dimensional space via GE allowed for the visualization of a smooth manifold that revealed a continuum between low, intermediate, and high levels of LI. Since it is known that extent of LI in BC biopsy specimens is a prognostic indicator, our CADx scheme will potentially help clinicians determine disease outcome and allow them to make better therapy recommendations for patients with HER2+ BC.