The vacuolated lens (vl) mouse mutation arose on the C3H/HeSnJ background and results in lethality, neural tube defects (NTDs) and cataracts. The vl phenotypes are due to a deletion/frameshift ...mutation in the orphan GPCR, Gpr161. A recent study using a null allele demonstrated that Gpr161 functions in primary cilia and represses the Shh pathway. We show the hypomorphic Gpr161vl allele does not severely affect the Shh pathway. To identify additional pathways regulated by Gpr161 during neurulation, we took advantage of naturally occurring genetic variation in the mouse. Previously Gpr161vl-C3H was crossed to different inbred backgrounds including MOLF/EiJ and the Gpr161vl mutant phenotypes were rescued. Five modifiers were mapped (Modvl: Modifier of vl) including Modvl5MOLF. In this study we demonstrate the Modvl5MOLF congenic rescues the Gpr161vl-associated lethality and NTDs but not cataracts. Bioinformatics determined the transcription factor, Cdx1, is the only annotated gene within the Modvl5 95% CI co-expressed with Gpr161 during neurulation and not expressed in the eye. Using Cdx1 as an entry point, we identified the retinoid acid (RA) and canonical Wnt pathways as downstream targets of Gpr161. QRT-PCR, ISH and IHC determined that expression of RA and Wnt genes are down-regulated in Gpr161vl/vl but rescued by the Modvl5MOLF congenic during neurulation. Intraperitoneal RA injection restores expression of canonical Wnt markers and rescues Gpr161vl/vl NTDs. These results establish the RA and canonical Wnt as pathways downstream of Gpr161 during neurulation, and suggest that Modvl5MOLF bypasses the Gpr161vl mutation by restoring the activity of these pathways.
•Modvl5MOLF is a genetic locus that rescues lethality and NTDs in Gpr161vl/vl.•Bioinformatics identified Cdx1 as a QTG that contributes to the Modvl5 locus.•Shh signaling and Gpr161 ciliary localization are not severely affected in Gpr161vl/vl.•RA and canonical Wnt signaling are downstream to Gpr161 in regulating neurulation.•Modvl5MOLF bypasses Gpr161vl by restoring the activity of RA and Wnt signaling.
Several regions of the genome show pleiotropic associations with multiple cancers. We sought to evaluate whether 181 single-nucleotide polymorphisms previously associated with various cancers in ...genome-wide association studies were also associated with melanoma risk.
We evaluated 2,131 melanoma cases and 20,353 controls from three studies in the Population Architecture using Genomics and Epidemiology (PAGE) study (EAGLE-BioVU, MEC, WHI) and two collaborating studies (HPFS, NHS). Overall and sex-stratified analyses were performed across studies.
We observed statistically significant associations with melanoma for two lung cancer SNPs in the TERT-CLPTM1L locus (Bonferroni-corrected p<2.8x10-4), replicating known pleiotropic effects at this locus. In sex-stratified analyses, we also observed a potential male-specific association between prostate cancer risk variant rs12418451 and melanoma risk (OR=1.22, p=8.0x10-4). No other variants in our study were associated with melanoma after multiple comparisons adjustment (p>2.8e-4).
We provide confirmatory evidence of pleiotropic associations with melanoma for two SNPs previously associated with lung cancer, and provide suggestive evidence for a male-specific association with melanoma for prostate cancer variant rs12418451. This SNP is located near TPCN2, an ion transport gene containing SNPs which have been previously associated with hair pigmentation but not melanoma risk. Previous evidence provides biological plausibility for this association, and suggests a complex interplay between ion transport, pigmentation, and melanoma risk that may vary by sex. If confirmed, these pleiotropic relationships may help elucidate shared molecular pathways between cancers and related phenotypes.
Aims/hypothesis
Elevated levels of fasting glucose and fasting insulin in non-diabetic individuals are markers of dysregulation of glucose metabolism and are strong risk factors for type 2 diabetes. ...Genome-wide association studies have discovered over 50 SNPs associated with these traits. Most of these loci were discovered in European populations and have not been tested in a well-powered multi-ethnic study. We hypothesised that a large, ancestrally diverse, fine-mapping genetic study of glycaemic traits would identify novel and population-specific associations that were previously undetectable by European-centric studies.
Methods
A multiethnic study of up to 26,760 unrelated individuals without diabetes, of predominantly Hispanic/Latino and African ancestries, were genotyped using the Metabochip. Transethnic meta-analysis of racial/ethnic-specific linear regression analyses were performed for fasting glucose and fasting insulin. We attempted to replicate 39 fasting glucose and 17 fasting insulin loci. Genetic fine-mapping was performed through sequential conditional analyses in 15 regions that included both the initially reported SNP association(s) and denser coverage of SNP markers. In addition, Metabochip-wide analyses were performed to discover novel fasting glucose and fasting insulin loci. The most significant SNP associations were further examined using bioinformatic functional annotation.
Results
Previously reported SNP associations were significantly replicated (
p
≤ 0.05) in 31/39 fasting glucose loci and 14/17 fasting insulin loci. Eleven glycaemic trait loci were refined to a smaller list of potentially causal variants through transethnic meta-analysis. Stepwise conditional analysis identified two loci with independent secondary signals (
G6PC2
-rs477224 and
GCK-
rs2908290), which had not previously been reported. Population-specific conditional analyses identified an independent signal in
G6PC2
tagged by the rare variant rs77719485 in African ancestry. Further Metabochip-wide analysis uncovered one novel fasting insulin locus at
SLC17A2
-rs75862513.
Conclusions/interpretation
These findings suggest that while glycaemic trait loci often have generalisable effects across the studied populations, transethnic genetic studies help to prioritise likely functional SNPs, identify novel associations that may be population-specific and in turn have the potential to influence screening efforts or therapeutic discoveries.
Data availability
The summary statistics from each of the ancestry-specific and transethnic (combined ancestry) results can be found under the PAGE study on dbGaP here:
https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000356.v1.p1
Objective:
Several genome–wide association studies (GWAS) have demonstrated that common genetic variants contribute to obesity. However, studies of this complex trait have focused on ancestrally ...European populations, despite the high prevalence of obesity in some minority groups.
Design and Methods:
As part of the “Population Architecture using Genomics and Epidemiology (PAGE)” Consortium, we investigated the association between 13 GWAS‐identified single‐nucleotide polymorphisms (SNPs) and BMI and obesity in 69,775 subjects, including 6,149 American Indians, 15,415 African‐Americans, 2,438 East Asians, 7,346 Hispanics, 604 Pacific Islanders, and 37,823 European Americans. For the BMI‐increasing allele of each SNP, we calculated β coefficients using linear regression (for BMI) and risk estimates using logistic regression (for obesity defined as BMI ≥ 30) followed by fixed‐effects meta‐analysis to combine results across PAGE sites. Analyses stratified by racial/ethnic group assumed an additive genetic model and were adjusted for age, sex, and current smoking. We defined “replicating SNPs” (in European Americans) and “generalizing SNPs” (in other racial/ethnic groups) as those associated with an allele frequency‐specific increase in BMI.
Results:
By this definition, we replicated 9/13 SNP associations (5 out of 8 loci) in European Americans. We also generalized 8/13 SNP associations (5/8 loci) in East Asians, 7/13 (5/8 loci) in African Americans, 6/13 (4/8 loci) in Hispanics, 5/8 in Pacific Islanders (5/8 loci), and 5/9 (4/8 loci) in American Indians.
Conclusion:
Linkage disequilibrium patterns suggest that tagSNPs selected for European Americans may not adequately tag causal variants in other ancestry groups. Accordingly, fine‐mapping in large samples is needed to comprehensively explore these loci in diverse populations.
We have constructed de novo a high-resolution genetic map that includes the largest set, to our knowledge, of polymorphic markers (
N=14,759) for which genotype data are publicly available; that ...combines genotype data from both the Centre d'Etude du Polymorphisme Humain (CEPH) and deCODE pedigrees; that incorporates single-nucleotide polymorphisms; and that also incorporates sequence-based positional information. The position of all markers on our map is corroborated by both genomic sequence and recombination-based data. This specific combination of features maximizes marker inclusion, coverage, and resolution, making this map uniquely suitable as a comprehensive resource for determining genetic map information (order and distances) for any large set of polymorphic markers.
Multiple primary cancers account for approximately 16% of all incident cancers in the United States. Although genome-wide association studies (GWAS) have identified many common genetic variants ...associated with various cancer sites, no study has examined the association of these genetic variants with risk of multiple primary cancers (MPC).
As part of the National Human Genome Research Institute (NHGRI) Population Architecture using Genomics and Epidemiology (PAGE) study, we used data from the Multiethnic Cohort (MEC) and Women's Health Initiative (WHI). Incident MPC (IMPC) cases (n = 1,385) were defined as participants diagnosed with more than one incident cancer after cohort entry. Participants diagnosed with only one incident cancer after cohort entry with follow-up equal to or longer than IMPC cases served as controls (single-index cancer controls; n = 9,626). Fixed-effects meta-analyses of unconditional logistic regression analyses were used to evaluate the associations between 188 cancer risk variants and IMPC risk. To account for multiple comparisons, we used the false-positive report probability (FPRP) to determine statistical significance.
A nicotine dependence-associated and lung cancer variant, CHRNA3 rs578776 OR, 1.16; 95% confidence interval (CI), 1.05-1.26; P = 0.004, and two breast cancer variants, EMBP1 rs11249433 and TOX3 rs3803662 (OR, 1.16; 95% CI, 1.04-1.28; P = 0.005 and OR, 1.13; 95% CI, 1.03-1.23; P = 0.006), were significantly associated with risk of IMPC. The associations for rs578776 and rs11249433 remained (P < 0.05) after removing subjects who had lung or breast cancers, respectively (P ≤ 0.046). These associations did not show significant heterogeneity by smoking status (Pheterogeneity ≥ 0.53).
Our study has identified rs578776 and rs11249433 as risk variants for IMPC.
These findings may help to identify genetic regions associated with IMPC risk.
Genome-wide association studies (GWAS) have laid the foundation for investigations into the biology of complex traits, drug development and clinical guidelines. However, the majority of discovery ...efforts are based on data from populations of European ancestry
. In light of the differential genetic architecture that is known to exist between populations, bias in representation can exacerbate existing disease and healthcare disparities. Critical variants may be missed if they have a low frequency or are completely absent in European populations, especially as the field shifts its attention towards rare variants, which are more likely to be population-specific
. Additionally, effect sizes and their derived risk prediction scores derived in one population may not accurately extrapolate to other populations
. Here we demonstrate the value of diverse, multi-ethnic participants in large-scale genomic studies. The Population Architecture using Genomics and Epidemiology (PAGE) study conducted a GWAS of 26 clinical and behavioural phenotypes in 49,839 non-European individuals. Using strategies tailored for analysis of multi-ethnic and admixed populations, we describe a framework for analysing diverse populations, identify 27 novel loci and 38 secondary signals at known loci, as well as replicate 1,444 GWAS catalogue associations across these traits. Our data show evidence of effect-size heterogeneity across ancestries for published GWAS associations, substantial benefits for fine-mapping using diverse cohorts and insights into clinical implications. In the United States-where minority populations have a disproportionately higher burden of chronic conditions
-the lack of representation of diverse populations in genetic research will result in inequitable access to precision medicine for those with the highest burden of disease. We strongly advocate for continued, large genome-wide efforts in diverse populations to maximize genetic discovery and reduce health disparities.
Although the 1000 Genomes haplotypes are the most commonly used reference panel for imputation, medical sequencing projects are generating large alternate sets of sequenced samples. Imputation in ...African Americans using 3384 haplotypes from the Exome Sequencing Project, compared with 2184 haplotypes from 1000 Genomes Project, increased effective sample size by 8.3-11.4% for coding variants with minor allele frequency <1%. No loss of imputation quality was observed using a panel built from phenotypic extremes. We recommend using haplotypes from Exome Sequencing Project alone or concatenation of the two panels over quality score-based post-imputation selection or IMPUTE2's two-panel combination.
yunli@med.unc.edu.
Supplementary data are available at Bioinformatics online.
A key goal of whole-genome sequencing for studies of human genetics is to interrogate all forms of variation, including single-nucleotide variants, small insertion or deletion (indel) variants and ...structural variants. However, tools and resources for the study of structural variants have lagged behind those for smaller variants. Here we used a scalable pipeline
to map and characterize structural variants in 17,795 deeply sequenced human genomes. We publicly release site-frequency data to create the largest, to our knowledge, whole-genome-sequencing-based structural variant resource so far. On average, individuals carry 2.9 rare structural variants that alter coding regions; these variants affect the dosage or structure of 4.2 genes and account for 4.0-11.2% of rare high-impact coding alleles. Using a computational model, we estimate that structural variants account for 17.2% of rare alleles genome-wide, with predicted deleterious effects that are equivalent to loss-of-function coding alleles; approximately 90% of such structural variants are noncoding deletions (mean 19.1 per genome). We report 158,991 ultra-rare structural variants and show that 2% of individuals carry ultra-rare megabase-scale structural variants, nearly half of which are balanced or complex rearrangements. Finally, we infer the dosage sensitivity of genes and noncoding elements, and reveal trends that relate to element class and conservation. This work will help to guide the analysis and interpretation of structural variants in the era of whole-genome sequencing.
Evidence is limited as to whether heritable risk of obesity varies throughout adulthood. Among >34,000 European Americans, aged 18-100 years, from multiple U.S. studies in the Population Architecture ...using Genomics and Epidemiology (PAGE) Consortium, we examined evidence for heterogeneity in the associations of five established obesity risk variants (near FTO, GNPDA2, MTCH2, TMEM18, and NEGR1) with BMI across four distinct epochs of adulthood: 1) young adulthood (ages 18-25 years), adulthood (ages 26-49 years), middle-age adulthood (ages 50-69 years), and older adulthood (ages ≥70 years); or 2) by menopausal status in women and stratification by age 50 years in men. Summary-effect estimates from each meta-analysis were compared for heterogeneity across the life epochs. We found heterogeneity in the association of the FTO (rs8050136) variant with BMI across the four adulthood epochs (P = 0.0006), with larger effects in young adults relative to older adults (β SE = 1.17 0.45 vs. 0.09 0.09 kg/m², respectively, per A allele) and smaller intermediate effects. We found no evidence for heterogeneity in the association of GNPDA2, MTCH2, TMEM18, and NEGR1 with BMI across adulthood. Genetic predisposition to obesity may have greater effects on body weight in young compared with older adulthood for FTO, suggesting changes by age, generation, or secular trends. Future research should compare and contrast our findings with results using longitudinal data.