The Sister Study was designed to address gaps in the study of environment and breast cancer by taking advantage of more frequent breast cancer diagnoses among women with a sister history of breast ...cancer and the presumed enrichment of shared environmental and genetic exposures.
The Sister Study sought a large cohort of women never diagnosed with breast cancer but who had a sister (full or half) diagnosed with breast cancer.
A multifaceted national effort employed novel strategies to recruit a diverse cohort, and collected biological and environmental samples and extensive data on potential breast cancer risk factors.
The Sister Study enrolled 50,884 U.S. and Puerto Rican women 35-74y of age (median 56 y). Although the majority were non-Hispanic white, well educated, and economically well off, substantial numbers of harder-to-recruit women also enrolled (race/ethnicity other than non-Hispanic white: 16%; no college degree: 35%; household income <$50,000: 26%). Although all had a biologic sister with breast cancer, 16.5% had average or lower risk of breast cancer according to the Breast Cancer Risk Assessment Tool (Gail score). Most were postmenopausal (66%), parous with a first full-term pregnancy <30y of age (79%), never-smokers (56%) with body mass indexes (BMIs) of <29.9
kg/m
(70%). Few (5%) reported any cancer prior to enrollment.
The Sister Study is a unique cohort designed to efficiently study environmental and genetic risk factors for breast cancer. Extensive exposure data over the life-course and baseline specimens provide important opportunities for studying breast cancer and other health outcomes in women. Collaborations are welcome. https://doi.org/10.1289/EHP1923.
We carried out a genome-wide association study among Chinese women to identify risk variants for breast cancer. After analyzing 607,728 SNPs in 1,505 cases and 1,522 controls, we selected 29 SNPs for ...a fast-track replication in an independent set of 1,554 cases and 1,576 controls. We further investigated four replicated loci in a third set of samples comprising 3,472 cases and 900 controls. SNP rs2046210 at 6q25.1, located upstream of the gene encoding estrogen receptor α (ESR1), showed strong and consistent association with breast cancer across all three stages. Adjusted odds ratio (95% CI) were 1.36 (1.24-1.49) and 1.59 (1.40-1.82), respectively, for genotypes A/G and A/A versus G/G (P for trend 2.0 × 10−15) in the pooled analysis of samples from all three stages. We also found a similar, albeit weaker, association in an independent study comprising 1,591 cases and 1,466 controls of European ancestry (Ptrend = 0.01). These results strongly implicate 6q25.1 as a susceptibility locus for breast cancer.
Women may have incomplete understanding of a breast cancer diagnosis, leading to inaccurate reporting in epidemiological studies. However, it is not feasible to obtain consent for medical records ...from all women participating in a study. Therefore, it is important to determine how well self-reported breast cancer characteristics correspond with what is found in medical records, but few studies have evaluated agreement of self-reported breast cancer characteristics with abstracted medical records.
We calculated the positive predictive value (PPV) of self-reports compared to medical records and explored whether participant characteristics may have influenced reporting accuracy. We analyzed data from 2518 reported breast cancer cases from the Sister Study, a large nationwide cohort of women with a family history of breast cancer.
Medical records or pathology reports were obtained for 2066 of 2518 (82%) women who reported incident breast cancer. Breast cancer was confirmed for over 99% (n = 2054) of women with medical records. Confirmation rates were high for invasive, ductal, hormone receptor positive, and HER2 negative breast cancers, with little variation by race/ethnicity or age. Self-reported in situ breast cancer had a lower PPV (64.2%), with medical records showing invasive breast cancer instead, especially for older and Hispanic women. Hormone receptor (ER and PR) negative and HER2 positive self-reports had lower PPVs (83.0%, 71.6%, and 66.1% respectively). Hispanic women and women ages 65 or older at diagnosis were less able to accurately report breast cancer stage, excluding stage I.
Accuracy of reporting overall breast cancer and common subtypes is high. Despite having a family history of breast cancer and voluntarily enrolling in a study evaluating breast cancer risk factors, participants may have greater difficulty distinguishing between in situ and invasive breast cancer and may less accurately report other less common subtypes. Discrepancies may reflect women's poor understanding of information conveyed by health care providers or lack of consistent terminology used to describe subtypes.
Gene expression analysis has identified several breast cancer subtypes, including basal-like, human epidermal growth factor receptor-2 positive/estrogen receptor negative (HER2+/ER-), luminal A, and ...luminal B.
To determine population-based distributions and clinical associations for breast cancer subtypes.
Immunohistochemical surrogates for each subtype were applied to 496 incident cases of invasive breast cancer from the Carolina Breast Cancer Study (ascertained between May 1993 and December 1996), a population-based, case-control study that oversampled premenopausal and African American women. Subtype definitions were as follows: luminal A (ER+ and/or progesterone receptor positive PR+, HER2-), luminal B (ER+ and/or PR+, HER2+), basal-like (ER-, PR-, HER2-, cytokeratin 5/6 positive, and/or HER1+), HER2+/ER- (ER-, PR-, and HER2+), and unclassified (negative for all 5 markers).
We examined the prevalence of breast cancer subtypes within racial and menopausal subsets and determined their associations with tumor size, axillary nodal status, mitotic index, nuclear pleomorphism, combined grade, p53 mutation status, and breast cancer-specific survival.
The basal-like breast cancer subtype was more prevalent among premenopausal African American women (39%) compared with postmenopausal African American women (14%) and non-African American women (16%) of any age (P<.001), whereas the luminal A subtype was less prevalent (36% vs 59% and 54%, respectively). The HER2+/ER- subtype did not vary with race or menopausal status (6%-9%). Compared with luminal A, basal-like tumors had more TP53 mutations (44% vs 15%, P<.001), higher mitotic index (odds ratio OR, 11.0; 95% confidence interval CI, 5.6-21.7), more marked nuclear pleomorphism (OR, 9.7; 95% CI, 5.3-18.0), and higher combined grade (OR, 8.3; 95% CI, 4.4-15.6). Breast cancer-specific survival differed by subtype (P<.001), with shortest survival among HER2+/ER- and basal-like subtypes.
Basal-like breast tumors occurred at a higher prevalence among premenopausal African American patients compared with postmenopausal African American and non-African American patients in this population-based study. A higher prevalence of basal-like breast tumors and a lower prevalence of luminal A tumors could contribute to the poor prognosis of young African American women with breast cancer.
Etiologic differences between subtypes of breast cancer defined by estrogen receptor (ER) and progesterone receptor (PR) status are not well understood. The authors evaluated associations of ...hormone-related factors with breast cancer subtypes in a population-based case-control study involving 1,409 ER-positive (ER+)/PR-positive (PR+) cases, 712 ER-negative (ER-)/PR-negative (PR-) cases, 301 ER+/PR- cases, 254 ER-/PR+ cases, and 3,474 controls aged 20-70 years in Shanghai, China (phase I, 1996-1998; phase II, 2002-2005). Polytomous logistic regression and Wald tests for heterogeneity across subtypes were conducted. Breast cancer risks associated with age at menarche, age at menopause, breastfeeding, age at first livebirth, waist-to-hip ratio, and oral contraceptive use did not differ by hormone receptor status. Among postmenopausal women, higher parity (≥2 children vs. 1) was associated with reduced risk (odds ratio (OR) = 0.69, 95% confidence interval (CI): 0.52, 0.91) and higher body mass index (BMI; weight (kg)/height (m)(2)) with increased risk (highest quartile: OR = 2.40, 95% CI: 1.65, 3.47) of the ER+/PR+ subtype but was unrelated to the ER-/PR- subtype (for parity, P(heterogeneity) = 0.02; for BMI, P(heterogeneity) < 0.01). Hormone replacement therapy (OR = 2.25, 95% CI: 1.40, 3.62) and alcohol consumption (OR = 1.59, 95% CI: 1.01, 2.51) appeared to be preferentially associated with the ER+/PR- subtype. These findings indicate that BMI, parity, hormone replacement therapy, and alcohol consumption may play different roles in subtypes of breast cancer. More research is needed to better understand the etiology of 2 relatively rare subtypes, ER+/PR- tumors and ER-/PR+ tumors.
Our study describes breast cancer risk loci using a cross-ancestry GWAS approach. We first identify variants that are associated with breast cancer at P < 0.05 from African ancestry GWAS ...meta-analysis (9241 cases and 10193 controls), then meta-analyze with European ancestry GWAS data (122977 cases and 105974 controls) from the Breast Cancer Association Consortium. The approach identifies four loci for overall breast cancer risk 1p13.3, 5q31.1, 15q24 (two independent signals), and 15q26.3 and two loci for estrogen receptor-negative disease (1q41 and 7q11.23) at genome-wide significance. Four of the index single nucleotide polymorphisms (SNPs) lie within introns of genes (KCNK2, C5orf56, SCAMP2, and SIN3A) and the other index SNPs are located close to GSTM4, AMPD2, CASTOR2, and RP11-168G16.2. Here we present risk loci with consistent direction of associations in African and European descendants. The study suggests that replication across multiple ancestry populations can help improve the understanding of breast cancer genetics and identify causal variants.
Recombination, together with mutation, gives rise to genetic variation in populations. Here we leverage the recent mixture of people of African and European ancestry in the Americas to build a ...genetic map measuring the probability of crossing over at each position in the genome, based on about 2.1 million crossovers in 30,000 unrelated African Americans. At intervals of more than three megabases it is nearly identical to a map built in Europeans. At finer scales it differs significantly, and we identify about 2,500 recombination hotspots that are active in people of West African ancestry but nearly inactive in Europeans. The probability of a crossover at these hotspots is almost fully controlled by the alleles an individual carries at PRDM9 (P value < 10(-245)). We identify a 17-base-pair DNA sequence motif that is enriched in these hotspots, and is an excellent match to the predicted binding target of PRDM9 alleles common in West Africans and rare in Europeans. Sites of this motif are predicted to be risk loci for disease-causing genomic rearrangements in individuals carrying these alleles. More generally, this map provides a resource for research in human genetic variation and evolution.
Abstract
Background
Expansion of genome-wide association studies across population groups is needed to improve our understanding of shared and unique genetic contributions to breast cancer. We ...performed association and replication studies guided by a priori linkage findings from African ancestry (AA) relative pairs.
Methods
We performed fixed-effect inverse-variance weighted meta-analysis under three significant AA breast cancer linkage peaks (3q26-27, 12q22-23, and 16q21-22) in 9241 AA cases and 10 193 AA controls. We examined associations with overall breast cancer as well as estrogen receptor (ER)-positive and negative subtypes (193,132 SNPs). We replicated associations in the African-ancestry Breast Cancer Genetic Consortium (AABCG).
Results
In AA women, we identified two associations on chr12q for overall breast cancer (rs1420647, OR = 1.15, p = 2.50×10−6; rs12322371, OR = 1.14, p = 3.15×10−6), and one for ER-negative breast cancer (rs77006600, OR = 1.67, p = 3.51×10−6). On chr3, we identified two associations with ER-negative disease (rs184090918, OR = 3.70, p = 1.23×10−5; rs76959804, OR = 3.57, p = 1.77×10−5) and on chr16q we identified an association with ER-negative disease (rs34147411, OR = 1.62, p = 8.82×10−6). In the replication study, the chr3 associations were significant and effect sizes were larger (rs184090918, OR: 6.66, 95% CI: 1.43, 31.01; rs76959804, OR: 5.24, 95% CI: 1.70, 16.16).
Conclusion
The two chr3 SNPs are upstream to open chromatin ENSR00000710716, a regulatory feature that is actively regulated in mammary tissues, providing evidence that variants in this chr3 region may have a regulatory role in our target organ. Our study provides support for breast cancer variant discovery using prioritization based on linkage evidence.
Abstract
Polygenic risk scores (PRSs) are useful for predicting breast cancer risk, but the prediction accuracy of existing PRSs in women of African ancestry (AA) remains relatively low. We aim to ...develop optimal PRSs for the prediction of overall and estrogen receptor (ER) subtype-specific breast cancer risk in AA women. The AA dataset comprised 9235 cases and 10 184 controls from four genome-wide association study (GWAS) consortia and a GWAS study in Ghana. We randomly divided samples into training and validation sets. We built PRSs using individual-level AA data by a forward stepwise logistic regression and then developed joint PRSs that combined (1) the PRSs built in the AA training dataset and (2) a 313-variant PRS previously developed in women of European ancestry. PRSs were evaluated in the AA validation set. For overall breast cancer, the odds ratio per standard deviation of the joint PRS in the validation set was 1.34 95% confidence interval (CI): 1.27–1.42 with the area under receiver operating characteristic curve (AUC) of 0.581. Compared with women with average risk (40th–60th PRS percentile), women in the top decile of the PRS had a 1.98-fold increased risk (95% CI: 1.63–2.39). For PRSs of ER-positive and ER-negative breast cancer, the AUCs were 0.608 and 0.576, respectively. Compared with existing methods, the proposed joint PRSs can improve prediction of breast cancer risk in AA women.
The rapid pace of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2; COVID-19) pandemic presents challenges to the real-time collection of population-scale data to inform near-term ...public health needs as well as future investigations. We established the COronavirus Pandemic Epidemiology (COPE) consortium to address this unprecedented crisis on behalf of the epidemiology research community. As a central component of this initiative, we have developed a COVID Symptom Study (previously known as the COVID Symptom Tracker) mobile application as a common data collection tool for epidemiologic cohort studies with active study participants. This mobile application collects information on risk factors, daily symptoms, and outcomes through a user-friendly interface that minimizes participant burden. Combined with our efforts within the general population, data collected from nearly 3 million participants in the United States and United Kingdom are being used to address critical needs in the emergency response, including identifying potential hot spots of disease and clinically actionable risk factors. The linkage of symptom data collected in the app with information and biospecimens already collected in epidemiology cohorts will position us to address key questions related to diet, lifestyle, environmental, and socioeconomic factors on susceptibility to COVID-19, clinical outcomes related to infection, and long-term physical, mental health, and financial sequalae. We call upon additional epidemiology cohorts to join this collective effort to strengthen our impact on the current health crisis and generate a new model for a collaborative and nimble research infrastructure that will lead to more rapid translation of our work for the betterment of public health.