Although an important biomarker in breast cancer, Ki67 lacks scoring standardization, which has limited its clinical use. Our previous study found variability when laboratories used their own scoring ...methods on centrally stained tissue microarray slides. In this current study, 16 laboratories from eight countries calibrated to a specific Ki67 scoring method and then scored 50 centrally MIB-1 stained tissue microarray cases. Simple instructions prescribed scoring pattern and staining thresholds for determination of the percentage of stained tumor cells. To calibrate, laboratories scored 18 'training' and 'test' web-based images. Software tracked object selection and scoring. Success for the calibration was prespecified as Root Mean Square Error of scores compared with reference <0.6 and Maximum Absolute Deviation from reference <1.0 (log2-transformed data). Prespecified success criteria for tissue microarray scoring required intraclass correlation significantly >0.70 but aiming for observed intraclass correlation ≥0.90. Laboratory performance showed non-significant but promising trends of improvement through the calibration exercise (mean Root Mean Square Error decreased from 0.6 to 0.4, Maximum Absolute Deviation from 1.6 to 0.9; paired t-test: P=0.07 for Root Mean Square Error, 0.06 for Maximum Absolute Deviation). For tissue microarray scoring, the intraclass correlation estimate was 0.94 (95% credible interval: 0.90-0.97), markedly and significantly >0.70, the prespecified minimum target for success. Some discrepancies persisted, including around clinically relevant cutoffs. After calibrating to a common scoring method via a web-based tool, laboratories can achieve high inter-laboratory reproducibility in Ki67 scoring on centrally stained tissue microarray slides. Although these data are potentially encouraging, suggesting that it may be possible to standardize scoring of Ki67 among pathology laboratories, clinically important discrepancies persist. Before this biomarker could be recommended for clinical use, future research will need to extend this approach to biopsies and whole sections, account for staining variability, and link to outcomes.
Uncontrolled proliferation is a hallmark of cancer. In breast cancer, immunohistochemical assessment of the proportion of cells staining for the nuclear antigen Ki67 has become the most widely used ...method for comparing proliferation between tumor samples. Potential uses include prognosis, prediction of relative responsiveness or resistance to chemotherapy or endocrine therapy, estimation of residual risk in patients on standard therapy and as a dynamic biomarker of treatment efficacy in samples taken before, during, and after neoadjuvant therapy, particularly neoadjuvant endocrine therapy. Increasingly, Ki67 is measured in these scenarios for clinical research, including as a primary efficacy endpoint for clinical trials, and sometimes for clinical management. At present, the enormous variation in analytical practice markedly limits the value of Ki67 in each of these contexts. On March 12, 2010, an international panel of investigators with substantial expertise in the assessment of Ki67 and in the development of biomarker guidelines was convened in London by the cochairs of the Breast International Group and North American Breast Cancer Group Biomarker Working Party to consider evidence for potential applications. Comprehensive recommendations on preanalytical and analytical assessment, and interpretation and scoring of Ki67 were formulated based on current evidence. These recommendations are geared toward achieving a harmonized methodology, create greater between-laboratory and between-study comparability, and allow earlier valid applications of this marker in clinical practice.
An International Ki67 Reproducibility Study POLLEY, Mei-Yin C; LEUNG, Samuel C. Y; GOWN, Allen M ...
JNCI : Journal of the National Cancer Institute,
12/2013, Letnik:
105, Številka:
24
Journal Article
Recenzirano
Odprti dostop
In breast cancer, immunohistochemical assessment of proliferation using the marker Ki67 has potential use in both research and clinical management. However, lack of consistency across laboratories ...has limited Ki67's value. A working group was assembled to devise a strategy to harmonize Ki67 analysis and increase scoring concordance. Toward that goal, we conducted a Ki67 reproducibility study.
Eight laboratories received 100 breast cancer cases arranged into 1-mm core tissue microarrays-one set stained by the participating laboratory and one set stained by the central laboratory, both using antibody MIB-1. Each laboratory scored Ki67 as percentage of positively stained invasive tumor cells using its own method. Six laboratories repeated scoring of 50 locally stained cases on 3 different days. Sources of variation were analyzed using random effects models with log2-transformed measurements. Reproducibility was quantified by intraclass correlation coefficient (ICC), and the approximate two-sided 95% confidence intervals (CIs) for the true intraclass correlation coefficients in these experiments were provided.
Intralaboratory reproducibility was high (ICC = 0.94; 95% CI = 0.93 to 0.97). Interlaboratory reproducibility was only moderate (central staining: ICC = 0.71, 95% CI = 0.47 to 0.78; local staining: ICC = 0.59, 95% CI = 0.37 to 0.68). Geometric mean of Ki67 values for each laboratory across the 100 cases ranged 7.1% to 23.9% with central staining and 6.1% to 30.1% with local staining. Factors contributing to interlaboratory discordance included tumor region selection, counting method, and subjective assessment of staining positivity. Formal counting methods gave more consistent results than visual estimation.
Substantial variability in Ki67 scoring was observed among some of the world's most experienced laboratories. Ki67 values and cutoffs for clinical decision-making cannot be transferred between laboratories without standardizing scoring methodology because analytical validity is limited.
The nuclear proliferation biomarker Ki67 has potential prognostic, predictive, and monitoring roles in breast cancer. Unacceptable between-laboratory variability has limited its clinical value. The ...International Ki67 in Breast Cancer Working Group investigated whether Ki67 immunohistochemistry can be analytically validated and standardized across laboratories using automated machine-based scoring. Sets of pre-stained core-cut biopsy sections of 30 breast tumors were circulated to 14 laboratories for scanning and automated assessment of the average and maximum percentage of tumor cells positive for Ki67. Seven unique scanners and 10 software platforms were involved in this study. Pre-specified analyses included evaluation of reproducibility between all laboratories (primary) as well as among those using scanners from a single vendor (secondary). The primary reproducibility metric was intraclass correlation coefficient between laboratories, with success considered to be intraclass correlation coefficient >0.80. Intraclass correlation coefficient for automated average scores across 16 operators was 0.83 (95% credible interval: 0.73-0.91) and intraclass correlation coefficient for maximum scores across 10 operators was 0.63 (95% credible interval: 0.44-0.80). For the laboratories using scanners from a single vendor (8 score sets), intraclass correlation coefficient for average automated scores was 0.89 (95% credible interval: 0.81-0.96), which was similar to the intraclass correlation coefficient of 0.87 (95% credible interval: 0.81-0.93) achieved using these same slides in a prior visual-reading reproducibility study. Automated machine assessment of average Ki67 has the potential to achieve between-laboratory reproducibility similar to that for a rigorously standardized pathologist-based visual assessment of Ki67. The observed intraclass correlation coefficient was worse for maximum compared to average scoring methods, suggesting that maximum score methods may be suboptimal for consistent measurement of proliferation. Automated average scoring methods show promise for assessment of Ki67 scoring, but requires further standardization and subsequent clinical validation.
Aims
The nuclear proliferation marker Ki67 assayed by immunohistochemistry has multiple potential uses in breast cancer, but an unacceptable level of interlaboratory variability has hampered its ...clinical utility. The International Ki67 in Breast Cancer Working Group has undertaken a systematic programme to determine whether Ki67 measurement can be analytically validated and standardised among laboratories. This study addresses whether acceptable scoring reproducibility can be achieved on excision whole sections.
Methods and results
Adjacent sections from 30 primary ER+ breast cancers were centrally stained for Ki67 and sections were circulated among 23 pathologists in 12 countries. All pathologists scored Ki67 by two methods: (i) global: four fields of 100 tumour cells each were selected to reflect observed heterogeneity in nuclear staining; (ii) hot‐spot: the field with highest apparent Ki67 index was selected and up to 500 cells scored. The intraclass correlation coefficient (ICC) for the global method confidence interval (CI) = 0.87; 95% CI = 0.799–0.93 marginally met the prespecified success criterion (lower 95% CI ≥ 0.8), while the ICC for the hot‐spot method (0.83; 95% CI = 0.74–0.90) did not. Visually, interobserver concordance in location of selected hot‐spots varies between cases. The median times for scoring were 9 and 6 min for global and hot‐spot methods, respectively.
Conclusions
The global scoring method demonstrates adequate reproducibility to warrant next steps towards evaluation for technical and clinical validity in appropriate cohorts of cases. The time taken for scoring by either method is practical using counting software we are making publicly available. Establishment of external quality assessment schemes is likely to improve the reproducibility between laboratories further.
Summary Background We compared standard adjuvant anthracycline chemotherapy with anthracycline–taxane combination chemotherapy in women with operable node-positive breast cancer. Here we report the ...final, 10-year follow-up analysis of disease-free survival, overall survival, and long-term safety. Methods BCIRG 001 was an open label, phase 3, multicentre trial in which 1491 patients aged 18–70 years with node-positive, early breast cancer and a Karnofsky score of 80% or more were randomly assigned to adjuvant treatment with docetaxel, doxorubicin, and cyclophosphamide (TAC) or fluorouracil, doxorubicin, and cyclophosphamide (FAC) every 3 weeks for six cycles. Randomisation was stratified according to institution and number of involved axillary lymph nodes per patient (one to three vs four or more). Disease-free survival was the primary endpoint and was defined as the interval between randomisation and breast cancer relapse, second primary cancer, or death, whichever occurred first. Efficacy analyses were based on the intention-to-treat principle. BCIRG 001 is registered with ClinicalTrials.gov , number NCT00688740. Findings Enrolement took place between June 11, 1997 and June 3, 1999; 745 patients were assigned to receive TAC and 746 patients were assigned to receive FAC. After a median follow-up of 124 months (IQR 90–126), disease-free survival was 62% (95% CI 58–65) for patients in the TAC group and 55% (51–59) for patients in the FAC group (hazard ratio HR 0·80, 95% CI 0·68–0·93; log-rank p=0·0043). 10-year overall survival was 76% (95% CI 72–79) for patients in the TAC group and 69% (65–72) for patients in the FAC group (HR 0·74, 0·61–0·90; log-rank p=0·0020). TAC improved disease-free survival relative to FAC irrespective of nodal, hormone receptor, and HER2 status, although not all differences were significant in these subgroup analyses. Grade 3–4 heart failure occurred in 26 (3%) patients in the TAC group and 17 (2%) patients in the FAC group, and caused death in two patients in the TAC group and four patients in the FAC group. A substantial decrease in left ventricular ejection fraction (defined as a relative decrease from baseline of 20% or more) was seen in 58 (17%) patients who received TAC and 41 (15%) patients who received FAC. Six patients who received TAC developed leukaemia or myelodysplasia, as did three patients who received FAC. Interpretation Our results provide evidence that the initial therapeutic outcomes seen at the 5-year follow-up with a docetaxel-containing adjuvant regimen are maintained at 10 years. However, a substantial percentage of patients had a decrease in left ventricular ejection fraction, probably caused by anthracycline therapy, which warrants further investigation. Funding Sanofi.
It is generally assumed that all estrogen-receptor-positive (ER+) breast cancers proliferate in response to estrogen and, therefore, examples of the estrogen-induced regression of ER+ cancers are ...paradoxical. This review re-examines the estrogen regression paradox for the Luminal A subtype of ER+ breast cancers. The proliferative response to estrogen is shown to depend on the level of ER. Mechanistically, a window of opportunity study of pre-operative estradiol suggested that with higher levels of ER, estradiol could activate the DREAM-MMB (Dimerization partner, Retinoblastoma-like proteins, E2F4, and MuvB–MYB-MuvB) pathway to decrease proliferation. The response of breast epithelium and the incidence of breast cancers during hormonal variations that occur during the menstrual cycle and at the menopausal transition, respectively, suggest that a single hormone, either estrogen, progesterone or androgen, could activate the DREAM pathway, leading to reversible cell cycle arrest. Conversely, the presence of two hormones could switch the DREAM-MMB complex to a pro-proliferative pathway. Using publicly available data, we examine the gene expression changes after aromatase inhibitors and ICI 182,780 to provide support for the hypothesis. This review suggests that it might be possible to integrate all current hormonal therapies for Luminal A tumors within a single theoretical schema.
Ki67 has potential clinical importance in breast cancer but has yet to see broad acceptance due to inter-laboratory variability. Here we tested an open source and calibrated automated digital image ...analysis (DIA) platform to: (i) investigate the comparability of Ki67 measurement across corresponding core biopsy and resection specimen cases, and (ii) assess section to section differences in Ki67 scoring. Two sets of 60 previously stained slides containing 30 core-cut biopsy and 30 corresponding resection specimens from 30 estrogen receptor-positive breast cancer patients were sent to 17 participating labs for automated assessment of average Ki67 expression. The blocks were centrally cut and immunohistochemically (IHC) stained for Ki67 (MIB-1 antibody). The QuPath platform was used to evaluate tumoral Ki67 expression. Calibration of the DIA method was performed as in published studies. A guideline for building an automated Ki67 scoring algorithm was sent to participating labs. Very high correlation and no systematic error (p = 0.08) was found between consecutive Ki67 IHC sections. Ki67 scores were higher for core biopsy slides compared to paired whole sections from resections (p ≤ 0.001; median difference: 5.31%). The systematic discrepancy between core biopsy and corresponding whole sections was likely due to pre-analytical factors (tissue handling, fixation). Therefore, Ki67 IHC should be tested on core biopsy samples to best reflect the biological status of the tumor.