Large biobank-scale whole genome sequencing (WGS) studies are rapidly identifying a multitude of coding and non-coding variants. They provide an unprecedented resource for illuminating the genetic ...basis of human diseases. Variant functional annotations play a critical role in WGS analysis, result interpretation, and prioritization of disease- or trait-associated causal variants. Existing functional annotation databases have limited scope to perform online queries and functionally annotate the genotype data of large biobank-scale WGS studies. We develop the Functional Annotation of Variants Online Resources (FAVOR) to meet these pressing needs. FAVOR provides a comprehensive multi-faceted variant functional annotation online portal that summarizes and visualizes findings of all possible nine billion single nucleotide variants (SNVs) across the genome. It allows for rapid variant-, gene- and region-level queries of variant functional annotations. FAVOR integrates variant functional information from multiple sources to describe the functional characteristics of variants and facilitates prioritizing plausible causal variants influencing human phenotypes. Furthermore, we provide a scalable annotation tool, FAVORannotator, to functionally annotate large-scale WGS studies and efficiently store the genotype and their variant functional annotation data in a single file using the annotated Genomic Data Structure (aGDS) format, making downstream analysis more convenient. FAVOR and FAVORannotator are available at https://favor.genohub.org.
Although the relationship between APOE and Alzheimer's disease (AD) is well established in populations of European descent, the effects of APOE and ancestry on AD risk in diverse populations is not ...well understood.
Logistic mixed model regression and survival analyses were performed in a sample of 3067 Caribbean Hispanics and 3028 individuals of European descent to assess the effects of APOE genotype, local ancestry, and genome-wide ancestry on AD risk and age at onset.
Among the Caribbean Hispanics, individuals with African-derived ancestry at APOE had 39% lower odds of AD than individuals with European-derived APOE, after adjusting for APOE genotype, age, and genome-wide ancestry. While APOE E2 and E4 effects on AD risk and age at onset were significant in the Caribbean Hispanics, they were substantially attenuated compared with those in European ancestry individuals.
These results suggest that additional genetic variation in the APOE region influences AD risk beyond APOE E2/E3/E4.
The 8q24 genomic locus is tied to the origin of numerous cancers. We investigate its contribution to hereditary prostate cancer (HPC) in independent study populations of the Nashville Familial ...Prostate Cancer Study and International Consortium for Prostate Cancer Genetics (combined: 2,836 HPC cases, 2,206 controls of European ancestry). Here we report 433 variants concordantly associated with HPC in both study populations, accounting for 9% of heritability and modifying age of diagnosis as well as aggressiveness; 183 reach genome-wide significance. The variants comprehensively distinguish independent risk-altering haplotypes overlapping the 648 kb locus (three protective, and four risk (peak odds ratios: 1.5, 4, 5, and 22)). Sequence of the near-Mendelian haplotype reveals eleven causal mutation candidates. We introduce a linkage disequilibrium-based algorithm discerning eight independent sentinel variants, carrying considerable risk prediction ability (AUC = 0.625) for a single locus. These findings elucidate 8q24 locus structure and correlates for clinical prediction of prostate cancer risk.
Lung disease is the major cause of morbidity and mortality in persons with cystic fibrosis (pwCF). Variability in CF lung disease has substantial non-CFTR (CF transmembrane conductance regulator) ...genetic influence. Identification of genetic modifiers has prognostic and therapeutic importance.
Identify genetic modifier loci and genes/pathways associated with pulmonary disease severity.
Whole-genome sequencing data on 4,248 unique pwCF with pancreatic insufficiency and lung function measures were combined with imputed genotypes from an additional 3,592 patients with pancreatic insufficiency from the United States, Canada, and France. This report describes association of approximately 15.9 million SNPs using the quantitative Kulich normal residual mortality-adjusted (KNoRMA) lung disease phenotype in 7,840 pwCF using premodulator lung function data.
Testing included common and rare SNPs, transcriptome-wide association, gene-level, and pathway analyses. Pathway analyses identified novel associations with genes that have key roles in organ development, and we hypothesize that these genes may relate to dysanapsis and/or variability in lung repair. Results confirmed and extended previous genome-wide association study findings. These whole-genome sequencing data provide finely mapped genetic information to support mechanistic studies. No novel primary associations with common single variants or rare variants were found. Multilocus effects at chr5p13 (
) and chr11p13 (
) were identified. Variant effect size estimates at associated loci were consistently ordered across the cohorts, indicating possible age or birth cohort effects.
This premodulator genomic, transcriptomic, and pathway association study of 7,840 pwCF will facilitate mechanistic and postmodulator genetic studies and the development of novel therapeutics for CF lung disease.
Split hand/foot malformation (SHFM) is a rare limb abnormality with clefting of the fingers and/or toes. For many individuals, the genetic etiology is unknown. Through whole-exome and targeted ...sequencing, we detected three novel variants in a gene encoding a transcription factor, PRDM1, that arose de novo in families with SHFM or segregated with the phenotype. PRDM1 is required for limb development; however, its role is not well understood and it is unclear how the PRDM1 variants affect protein function. Using transient and stable overexpression rescue experiments in zebrafish, we show that the variants disrupt the proline/serine-rich and DNA-binding zinc finger domains, resulting in a dominant-negative effect. Through gene expression assays, RNA sequencing, and CUT&RUN in isolated pectoral fin cells, we demonstrate that Prdm1a directly binds to and regulates genes required for fin induction, outgrowth and anterior/posterior patterning, such as fgfr1a, dlx5a, dlx6a and smo. Taken together, these results improve our understanding of the role of PRDM1 in the limb gene regulatory network and identified novel PRDM1 variants that link to SHFM in humans.
INTRODUCTION
Alzheimer's disease (AD) is a complex disease influenced by genetics and environment. More than 75 susceptibility loci have been linked to late‐onset AD, but most of these loci were ...discovered in genome‐wide association studies (GWAS) exclusive to non‐Hispanic White individuals. There are wide disparities in AD risk across racially stratified groups, and while these disparities are not due to genetic differences, underrepresentation in genetic research can further exacerbate and contribute to their persistence. We investigated the racial/ethnic representation of participants in United States (US)‐based AD genetics and the statistical implications of current representation.
METHODS
We compared racial/ethnic data of participants from array and sequencing studies in US AD genetics databases, including National Institute on Aging Genetics of Alzheimer's Disease Data Storage Site (NIAGADS) and NIAGADS Data Sharing Service (dssNIAGADS), to AD and related dementia (ADRD) prevalence and mortality. We then simulated the statistical power of these datasets to identify risk variants from non‐White populations.
RESULTS
There is insufficient statistical power (probability <80%) to detect single nucleotide polymorphisms (SNPs) with low to moderate effect sizes (odds ratio OR<1.5) using array data from Black and Hispanic participants; studies of Asian participants are not powered to detect variants OR <= 2. Using available and projected sequencing data from Black and Hispanic participants, risk variants with OR = 1.2 are detectable at high allele frequencies. Sample sizes remain insufficiently powered to detect these variants in Asian populations.
DISCUSSION
AD genetics datasets are largely representative of US ADRD burden. However, there is a wide discrepancy between proportional representation and statistically meaningful representation. Most variation identified in GWAS of non‐Hispanic White individuals have low to moderate effects. Comparable risk variants in non‐White populations are not detectable given current sample sizes, which could lead to disparities in future studies and drug development. We urge AD genetics researchers and institutions to continue investing in recruiting diverse participants and use community‐based participatory research practices.
Individuals with cystic fibrosis (CF) develop complications of the gastrointestinal tract influenced by genetic variants outside of CFTR. Cystic fibrosis-related diabetes (CFRD) is a distinct form of ...diabetes with a variable age of onset that occurs frequently in individuals with CF, while meconium ileus (MI) is a severe neonatal intestinal obstruction affecting ∼20% of newborns with CF. CFRD and MI are slightly correlated traits with previous evidence of overlap in their genetic architectures. To better understand the genetic commonality between CFRD and MI, we used whole-genome-sequencing data from the CF Genome Project to perform genome-wide association. These analyses revealed variants at 11 loci (6 not previously identified) that associated with MI and at 12 loci (5 not previously identified) that associated with CFRD. Of these, variants at SLC26A9, CEBPB, and PRSS1 associated with both traits; variants at SLC26A9 and CEBPB increased risk for both traits, while variants at PRSS1, the higher-risk alleles for CFRD, conferred lower risk for MI. Furthermore, common and rare variants within the SLC26A9 locus associated with MI only or CFRD only. As expected, different loci modify risk of CFRD and MI; however, a subset exhibit pleiotropic effects indicating etiologic and mechanistic overlap between these two otherwise distinct complications of CF.
Genetic modifiers play a significant role in two independent complications of cystic fibrosis (CF): neonatal intestinal obstruction and diabetes. Whole-genome sequencing followed by common and rare variant association identified pleiotropic loci displaying concordant and/or discordant modification of each trait, revealing unexpected mechanistic overlap between distinct complications of CF.
Introduction
Recent studies suggest that both sex‐specific genetic risk factors and those shared between dementia and stroke are involved in dementia pathogenesis.
Methods
We performed both ...single‐variant and gene‐based genome‐wide association studies of >11,000 whole genome sequences from the Women's Health Initiative cohort to discover loci associated with dementia, with adjustment for age, ethnicity, stroke, and venous thromboembolism status. Evidence for prior evidence of association and differential gene expression in dementia‐related tissues and samples was gathered for each locus.
Results
Our multiethnic studies identified significant associations between variants within APOE, MYH11, FZD3, SORCS3, and GOLGA8B and risk of dementia. Ten genes implicated by these loci, including MYH11, FZD3, SORCS3, and GOLGA8B, were differentially expressed in the context of Alzheimer's disease.
Discussion
Our association of MYH11, FZD3, SORCS3, and GOLGA8B with dementia is supported by independent functional studies in human subjects, model systems, and associations with shared risk factors for stroke and dementia.
Genetic studies have primarily been conducted in European ancestry populations, identifying dozens of loci associated with late-onset Alzheimer's disease (AD). However, much of AD's heritability ...remains unexplained; as the prevalence of AD varies across populations, the genetic architecture of the disease may also vary by population with the presence of novel variants or loci.
We conducted genome-wide analyses of AD in a sample of 2565 Caribbean Hispanics to better understand the genetic contribution to AD in this population. Statistical analysis included both admixture mapping and association testing. Evidence for differential gene expression within regions of interest was collected from independent transcriptomic studies comparing AD cases and controls in samples with primarily European ancestry.
Our genome-wide association study of AD identified no loci reaching genome-wide significance. However, a genome-wide admixture mapping analysis that tests for association between a haplotype's ancestral origin and AD status detected a genome-wide significant association with chromosome 3q13.11 (103.7-107.7Mb, P = 8.76E-07), driven by a protective effect conferred by the Native American ancestry (OR = 0.58, 95%CI = 0.47-0.73). Within this region, two variants were significantly associated with AD after accounting for the number of independent tests (rs12494162, P = 2.33E-06; rs1731642, P = 6.36E-05). The significant admixture mapping signal is composed of 15 haplotype blocks spanning 5 protein-coding genes (ALCAM, BBX, CBLB, CCDC54, CD47) and four brain-derived topologically associated domains, and includes markers significantly associated with the expression of ALCAM, BBX, CBLB, and CD47 in the brain. ALCAM and BBX were also significantly differentially expressed in the brain between AD cases and controls with European ancestry.
These results provide multiethnic evidence for a relationship between AD and multiple genes at 3q13.11 and illustrate the utility of leveraging genetic ancestry diversity via admixture mapping for new insights into AD.
Genetic testing has become routine for many inherited conditions; however, little is known about the unique issues that arise when offering genetic testing for inherited forms of dementia. To better ...understand the patient perspective, we surveyed study participants about their experiences as they underwent genetic counseling and genetic testing for dementia. We recruited 50 pairs of subjects. Each pair was comprised of one person with cognitive impairment and a cognitively intact co‐participant. Study participants received pre‐ and post‐test genetic counseling and comprehensive genetic testing for dementia. During the study, participant pairs completed four surveys which asked about their experience. Testing began with a 38 gene dementia panel. Participants with negative panel results or variants of uncertain significance (VUS) were reflexed to exome sequencing (ES). Twenty‐nine participants (58%) reported that their primary motivation to join the study was for the benefit to their families. Fifty‐two percent of participants initially planned to use their test results to make health and wellness changes, but, six months after disclosure, only 31% had done so. Six months after result disclosure, approximately 90% of participant pairs accurately recalled their genetic test results. Overall satisfaction with testing was high, and decision regret was negligible. This observational study describes the experiences of study participants undergoing genetic counseling and genetic testing for dementia and found that most participant pairs accurately recalled their results up to six months following disclosure while also maintaining high levels of satisfaction without decision regret. These findings suggest that, in the context of genetic counseling, genetic testing can be effectively used in this population.