An International Ki67 Reproducibility Study POLLEY, Mei-Yin C; LEUNG, Samuel C. Y; GOWN, Allen M ...
JNCI : Journal of the National Cancer Institute,
12/2013, Letnik:
105, Številka:
24
Journal Article
Recenzirano
Odprti dostop
In breast cancer, immunohistochemical assessment of proliferation using the marker Ki67 has potential use in both research and clinical management. However, lack of consistency across laboratories ...has limited Ki67's value. A working group was assembled to devise a strategy to harmonize Ki67 analysis and increase scoring concordance. Toward that goal, we conducted a Ki67 reproducibility study.
Eight laboratories received 100 breast cancer cases arranged into 1-mm core tissue microarrays-one set stained by the participating laboratory and one set stained by the central laboratory, both using antibody MIB-1. Each laboratory scored Ki67 as percentage of positively stained invasive tumor cells using its own method. Six laboratories repeated scoring of 50 locally stained cases on 3 different days. Sources of variation were analyzed using random effects models with log2-transformed measurements. Reproducibility was quantified by intraclass correlation coefficient (ICC), and the approximate two-sided 95% confidence intervals (CIs) for the true intraclass correlation coefficients in these experiments were provided.
Intralaboratory reproducibility was high (ICC = 0.94; 95% CI = 0.93 to 0.97). Interlaboratory reproducibility was only moderate (central staining: ICC = 0.71, 95% CI = 0.47 to 0.78; local staining: ICC = 0.59, 95% CI = 0.37 to 0.68). Geometric mean of Ki67 values for each laboratory across the 100 cases ranged 7.1% to 23.9% with central staining and 6.1% to 30.1% with local staining. Factors contributing to interlaboratory discordance included tumor region selection, counting method, and subjective assessment of staining positivity. Formal counting methods gave more consistent results than visual estimation.
Substantial variability in Ki67 scoring was observed among some of the world's most experienced laboratories. Ki67 values and cutoffs for clinical decision-making cannot be transferred between laboratories without standardizing scoring methodology because analytical validity is limited.
Purpose
Alterations to mismatch repair (MMR) pathways are a known cause of cancer, particularly colorectal and endometrial carcinomas. Recently, checkpoint inhibitors have been approved for use in ...MMR-deficient cancers of any type (Prasad et al. in JAMA Oncol 4:157–158, 2018). Functional studies in breast cancer have shown associations between MMR loss, resistance to aromatase inhibitors and sensitivity to palbociclib (Haricharan et al. in Cancer Discov 7:1168–1183, 2017). Herein, we investigate the clinical meaning of MMR deficiency in breast cancer by immunohistochemical assessment of MSH2, MSH6, MLH1 and PMS2 on a large series of breast cancers linked to detailed biomarker and long-term outcome data.
Methods
Cases were classified as MMR intact when all four markers expressed nuclear reactivity, but MMR-deficient when at least one of the four biomarkers displayed loss of nuclear staining in the presence of positive internal stromal controls on the tissue microarray core.
Results
Among the 1635 cases with interpretable staining, we identified 31 (1.9%) as MMR-deficient. In our cohort, MMR deficiency was present across all major breast cancer subtypes, and was associated with high-grade, low-progesterone receptor expression and high tumor-infiltrating lymphocyte counts. MMR deficiency is significantly associated with inferior overall (HR 2.29, 95% CI 1.02–5.17,
p
= 0.040) and disease-specific survival (HR 2.71, 95% CI 1.00–7.35,
p
= 0.042) in the 431 estrogen receptor-positive patients who were uniformly treated with tamoxifen as their sole adjuvant systemic therapy.
Conclusion
Overall, this study supports the concept that breast cancer patients with MMR deficiency as assessed by immunohistochemistry may be good candidates for alternative treatment approaches such as immune checkpoint or CDK4 inhibitors.
We studied 2780 adults in Hong Kong who received CoronaVac inactivated virus vaccine (Sinovac) and BNT162b2 mRNA vaccine (“Comirnaty”, BioNTech/Fosun Pharma). We compared rates of antibody waning ...over time using an enzyme-linked immunosorbent assay for spike receptor binding domain and a surrogate virus neutralization test. We found stronger and more durable antibody responses to two doses of the mRNA vaccine, and slightly stronger initial antibody responses to each vaccine in younger adults and women. The weaker and less durable responses following CoronaVac support earlier provision of third doses to persons who previously received two doses of this vaccine.
Ki67 expression has been a valuable prognostic variable in breast cancer, but has not seen broad adoption due to lack of standardization between institutions. Automation could represent a solution. ...Here we investigate the reproducibility of Ki67 measurement between three image analysis platforms with supervised classifiers performed by the same operator, by multiple operators, and finally we compare their accuracy in prognostic potential. Two breast cancer patient cohorts were used for this study. The standardization was done with the 30 cases of ER+ breast cancer that were used in phase 3 of International Ki67 in Breast Cancer Working Group initiatives where blocks were centrally cut and stained for Ki67. The outcome cohort was from 149 breast cancer cases from the Yale Pathology archives. A tissue microarray was built from representative tissue blocks with median follow-up of 120 months. The Mib-1 antibody (Dako) was used to detect Ki67 (dilution 1:100). HALO (IndicaLab), QuantCenter (3DHistech), and QuPath (open source software) digital image analysis (DIA) platforms were used to evaluate Ki67 expression. Intraclass correlation coefficient (ICC) was used to measure reproducibility. Between-DIA platform reproducibility was excellent (ICC: 0.933, CI: 0.879–0.966). Excellent reproducibility was found between all DIA platforms and the reference standard Ki67 values of Spectrum Webscope (QuPath-Spectrum Webscope ICC: 0.970, CI: 0.936–0.986; HALO-Spectrum Webscope ICC: 0.968, CI: 0.933–0.985; QuantCenter-Spectrum Webscope ICC: 0.964, CI: 0.919–0.983). All platforms showed excellent intra-DIA reproducibility (QuPath ICC: 0.992, CI: 0.986–0.996; HALO ICC: 0.972, CI: 0.924–0.988; QuantCenter ICC: 0.978, CI: 0.932–0.991). Comparing each DIA against outcome, the hazard ratios were similar. The inter-operator reproducibility was particularly high (ICC: 0.962–0.995). Our results showed outstanding reproducibility both within and between-DIA platforms, including one freely available DIA platform (QuPath). We also found the platforms essentially indistinguishable with respect to prediction of breast cancer patient outcome. Results justify multi-institutional DIA studies to assess clinical utility.
This study shows excellent reproducibility among 3 different digital image (DIA) analysis platforms in Ki67 scoring. An outstanding concordance was found among four operators using the same calibrated DIA platform suggesting a highly reproducible Ki67 scoring method. Finally, the authors also found that all three DIA platforms were essentially indistinguishable with respect to prediction of breast cancer patient outcome.
Aims
Vulvar squamous cell carcinoma (VSCC) can be subdivided by human papillomavirus (HPV) status into two clinicopathological entities. Studies on the prognostic significance of HPV in VSCC are ...discordant.
Methods and results
We performed a retrospective analysis of overall survival (OS), disease‐specific survival (DSS) and progression‐free survival (PFS) in 217 patients with VSCC. Cases were extracted from an era of more aggressive en‐bloc radical dissections (1985–95) and more localized radical surgery through separate vulvar and groin excisions (1996–2005). p16 immunohistochemistry was used as a surrogate for HPV status. HPV status could be determined in 197 tumours, 118 HPV‐independent and 79 HPV‐associated tumours. Patients with HPV‐associated tumours were younger (mean 58.8 versus 71.6 years for HPV‐independent tumours, P < 0.0001) and more likely to have prior abnormal cervical cytology (41.1 versus 5.6% for HPV‐independent tumours, P < 0.0001). In univariable analysis, patients with HPV‐associated tumours had superior PFS hazard ratio (HR): 0.37, 95% confidence interval (CI): 0.18–0.70, DSS (HR: 0.19, 95% CI: 0.08–0.41) and OS (HR: 0.35, 95% CI: 0.21–0.59). This was driven by worse outcomes (PFS, DSS and OS) for patients with HPV‐independent tumours compared with HPV‐associated tumours who underwent surgery after 1995. After adjusting for age and stage in multivariable analysis, patients with HPV‐associated tumours showed superior PFS (HR: 0.25, 95% CI: 0.07–0.77) and DSS (HR: 0.21, 95% CI: 0.04–0.78).
Conclusions
VSCC can be stratified into two prognostically different diseases based on p16 immunostaining. HPV status was associated only with prognosis in the cohort that underwent surgery after 1995, suggesting that more conservative surgery may have led to worse outcomes for patients with HPV‐independent tumours.
Depression is a global health priority. Maintaining and delaying depressive symptoms in older adults is a key to healthy aging. This study aimed to identify depressive symptom trajectories, ...predictors and mortality, while also exploring the relationship between air quality and depressive symptoms in older adults in the Hong Kong community over 14 years.
This study is a longitudinal study in Hong Kong. The target population was community-dwelling older adults over age 65. Depressive symptoms were measured by the Geriatric Depression Scale (GDS-15). Group-based trajectory model was used to identify heterogeneity in longitudinal changes over 14 years and examine the associations between baseline variables and trajectories for different cohort members using multinomial logistic regression. The Kaplan-Meier method was employed to conduct survival analysis and explore the variations in survival probabilities over time among different trajectory group. Linear mixed model was used to explore the relationship between air quality and depressive symptoms.
A total of 2828 older adults were included. Three different trajectories of depressive symptoms in older people were identified: relatively stable (15.4%), late increase (67.1%) and increase (17.5%). Female, more number of chronic diseases, poor cognitive function, and poor health-related quality of life (HRQOL) were significantly associated with other less favorable trajectories compared with participants with stable levels of depressive symptoms. The late increase group had a lower mortality rate than the relatively stable and increased groups. Lower baseline ambient air pollutant exposure to NO
over 14 years was significantly associated with fewer depressive symptoms.
In this study, we found that a late increase in depressive symptoms was the predominant trend in older Chinese people in Hong Kong. Poorer HRQOL was predictive of less favorable trajectories of depressive symptoms. Ambient air pollution was associated with depressive symptoms. This novel observation strengthens the epidemiological evidence of longitudinal changes in depressive symptoms and associations with late-life exposure to air pollution.
Supercooling of water complicates phase change dynamics, the understanding of which remains limited yet vital to energy-related and aerospace processes. Here, we investigate the freezing and jumping ...dynamics of supercooled water droplets on superhydrophobic surfaces, induced by a remarkable vaporization momentum, in a low-pressure environment. The vaporization momentum arises from the vaporization at droplet's free surface, progressed and intensified by recalescence, subsequently inducing droplet compression and finally self-jumping. By incorporating liquid-gas-solid phase changes involving vaporization, freezing recalescence, and liquid-solid interactions, we resolve the vaporization momentum and droplet dynamics, revealing a size-scaled jumping velocity and a nucleation-governed jumping direction. A droplet-size-defined regime map is established, distinguishing the vaporization-momentum-dominated self-jumping from evaporative drying and overpressure-initiated levitation, all induced by depressurization and vaporization. Our findings illuminate the role of supercooling and low-pressure mediated phase change in shaping fluid transport dynamics, with implications for passive anti-icing, advanced cooling, and climate physics.
The nuclear proliferation biomarker Ki67 has potential prognostic, predictive, and monitoring roles in breast cancer. Unacceptable between-laboratory variability has limited its clinical value. The ...International Ki67 in Breast Cancer Working Group investigated whether Ki67 immunohistochemistry can be analytically validated and standardized across laboratories using automated machine-based scoring. Sets of pre-stained core-cut biopsy sections of 30 breast tumors were circulated to 14 laboratories for scanning and automated assessment of the average and maximum percentage of tumor cells positive for Ki67. Seven unique scanners and 10 software platforms were involved in this study. Pre-specified analyses included evaluation of reproducibility between all laboratories (primary) as well as among those using scanners from a single vendor (secondary). The primary reproducibility metric was intraclass correlation coefficient between laboratories, with success considered to be intraclass correlation coefficient >0.80. Intraclass correlation coefficient for automated average scores across 16 operators was 0.83 (95% credible interval: 0.73-0.91) and intraclass correlation coefficient for maximum scores across 10 operators was 0.63 (95% credible interval: 0.44-0.80). For the laboratories using scanners from a single vendor (8 score sets), intraclass correlation coefficient for average automated scores was 0.89 (95% credible interval: 0.81-0.96), which was similar to the intraclass correlation coefficient of 0.87 (95% credible interval: 0.81-0.93) achieved using these same slides in a prior visual-reading reproducibility study. Automated machine assessment of average Ki67 has the potential to achieve between-laboratory reproducibility similar to that for a rigorously standardized pathologist-based visual assessment of Ki67. The observed intraclass correlation coefficient was worse for maximum compared to average scoring methods, suggesting that maximum score methods may be suboptimal for consistent measurement of proliferation. Automated average scoring methods show promise for assessment of Ki67 scoring, but requires further standardization and subsequent clinical validation.
Although an important biomarker in breast cancer, Ki67 lacks scoring standardization, which has limited its clinical use. Our previous study found variability when laboratories used their own scoring ...methods on centrally stained tissue microarray slides. In this current study, 16 laboratories from eight countries calibrated to a specific Ki67 scoring method and then scored 50 centrally MIB-1 stained tissue microarray cases. Simple instructions prescribed scoring pattern and staining thresholds for determination of the percentage of stained tumor cells. To calibrate, laboratories scored 18 'training' and 'test' web-based images. Software tracked object selection and scoring. Success for the calibration was prespecified as Root Mean Square Error of scores compared with reference <0.6 and Maximum Absolute Deviation from reference <1.0 (log2-transformed data). Prespecified success criteria for tissue microarray scoring required intraclass correlation significantly >0.70 but aiming for observed intraclass correlation ≥0.90. Laboratory performance showed non-significant but promising trends of improvement through the calibration exercise (mean Root Mean Square Error decreased from 0.6 to 0.4, Maximum Absolute Deviation from 1.6 to 0.9; paired t-test: P=0.07 for Root Mean Square Error, 0.06 for Maximum Absolute Deviation). For tissue microarray scoring, the intraclass correlation estimate was 0.94 (95% credible interval: 0.90-0.97), markedly and significantly >0.70, the prespecified minimum target for success. Some discrepancies persisted, including around clinically relevant cutoffs. After calibrating to a common scoring method via a web-based tool, laboratories can achieve high inter-laboratory reproducibility in Ki67 scoring on centrally stained tissue microarray slides. Although these data are potentially encouraging, suggesting that it may be possible to standardize scoring of Ki67 among pathology laboratories, clinically important discrepancies persist. Before this biomarker could be recommended for clinical use, future research will need to extend this approach to biopsies and whole sections, account for staining variability, and link to outcomes.
To prospectively study the relationship among vertebral marrow fat content, marrow diffusion indexes, and marrow and erector spinae muscle perfusion indexes in female subjects with varying bone ...mineral density.
Institutional study approval and informed consent were obtained. Dual x-ray absorptiometry, proton magnetic resonance (MR) spectroscopy, diffusion-weighted MR imaging, and dynamic contrast material-enhanced MR imaging of the lumbar spine and erector spinae muscle were performed in 110 women (mean age, 73 years; range, 67-84 years). Marrow fat content, marrow apparent diffusion coefficient (ADC), and perfusion indexes (maximum enhancement and enhancement slope) of marrow and erector spinae muscle were compared among three bone density groups (normal, osteopenic, and osteoporotic). The t test comparisons and Pearson correlations were applied.
Seven subjects were excluded, which yielded a final cohort of 103 subjects: 18 with normal bone density, 30 with osteopenia, and 55 with osteoporosis. Vertebral marrow fat content was significantly increased in the osteoporotic group (67.8% +/- 8.5 standard deviation) when compared with that of the normal bone density group (59.2% +/- 10.0, P = .002). Vertebral marrow perfusion indexes were significantly decreased in the osteoporotic group (enhancement slope, 1.10%/sec +/- 0.51) compared with those of the osteopenic group (1.45%/sec +/- 0.51, P = .01) and normal bone density group (1.70%/sec +/- 0.52, P < .001). Erector spinae muscle perfusion indexes did not decrease as bone density decreased. The ADC of vertebral marrow did not change with bone density.
The subjects experienced a decrease in vertebral marrow maximum enhancement and enhancement slope and an increase in marrow fat content as bone density decreased. The reduction in perfusion indexes occurred only within the vertebral body and not in the paravertebral tissues supplied by the same artery.