Evidence is limited as to whether heritable risk of obesity varies throughout adulthood. Among >34,000 European Americans, aged 18–100 years, from multiple U.S. studies in the Population Architecture ...using Genomics and Epidemiology (PAGE) Consortium, we examined evidence for heterogeneity in the associations of five established obesity risk variants (near FTO, GNPDA2, MTCH2, TMEM18, and NEGR1) with BMI across four distinct epochs of adulthood: 1) young adulthood (ages 18–25 years), adulthood (ages 26–49 years), middle-age adulthood (ages 50–69 years), and older adulthood (ages ≥70 years); or 2) by menopausal status in women and stratification by age 50 years in men. Summary-effect estimates from each meta-analysis were compared for heterogeneity across the life epochs. We found heterogeneity in the association of the FTO (rs8050136) variant with BMI across the four adulthood epochs (P = 0.0006), with larger effects in young adults relative to older adults (β SE = 1.17 0.45 vs. 0.09 0.09 kg/m2, respectively, per A allele) and smaller intermediate effects. We found no evidence for heterogeneity in the association of GNPDA2, MTCH2, TMEM18, and NEGR1 with BMI across adulthood. Genetic predisposition to obesity may have greater effects on body weight in young compared with older adulthood for FTO, suggesting changes by age, generation, or secular trends. Future research should compare and contrast our findings with results using longitudinal data.
Most association studies focus on disease risk, with less attention paid to disease progression or severity. These phenotypes require longitudinal data. This paper presents a new method for analyzing ...longitudinal data to map genes in both population-based and family-based studies. Using simulated systolic blood pressure measurements obtained from Genetic Analysis Workshop 18, we cluster the phenotype data into trajectory subgroups. We then use the Bayesian posterior probability of being in the high subgroup as a quantitative trait in an association analysis with genotype data. This method maintains high power (>80%) in locating genes known to affect the simulated phenotype for most specified significance levels (α). We believe that this method can be useful to aid in the discovery of genes that affect severity or progression of disease.
As with any new technology, next-generation sequencing (NGS) has potential advantages and potential challenges. One advantage is the identification of multiple causal variants for disease that might ...otherwise be missed by SNP-chip technology. One potential challenge is misclassification error (as with any emerging technology) and the issue of power loss due to multiple testing. Here, we develop an extension of the linear trend test for association that incorporates differential misclassification error and may be applied to any number of SNPs. We call the statistic the linear trend test allowing for error, applied to NGS, or LTT
ae,NGS. This statistic allows for differential misclassification. The observed data are phenotypes for unrelated cases and controls, coverage, and the number of putative causal variants for every individual at all SNPs. We simulate data considering multiple factors (disease mode of inheritance, genotype relative risk, causal variant frequency, sequence error rate in cases, sequence error rate in controls, number of loci, and others) and evaluate type I error rate and power for each vector of factor settings. We compare our results with two recently published NGS statistics. Also, we create a fictitious disease model based on downloaded 1000 Genomes data for 5 SNPs and 388 individuals, and apply our statistic to those data. We find that the LTT
ae,NGS maintains the correct type I error rate in all simulations (differential and non-differential error), while the other statistics show large inflation in type I error for lower coverage. Power for all three methods is approximately the same for all three statistics in the presence of non-differential error. Application of our statistic to the 1000 Genomes data suggests that, for the data downloaded, there is a 1.5% sequence misclassification rate over all SNPs. Finally, application of the multi-variant form of LTT
ae,NGS shows high power for a number of simulation settings, although it can have lower power than the corresponding single-variant simulation results, most probably due to our specification of multivariant SNP correlation values. In conclusion, our LTT
ae,NGS addresses two key challenges with NGS disease studies; first, it allows for differential misclassification when computing the statistic; and second, it addresses the multiple-testing issue in that there is a multi-variant form of the statistic that has only one degree of freedom, and provides a single p value, no matter how many loci.
The laboratory rat, Rattus novegicus, is a major model system for physiological and pathophysiological studies, and since 1966 more than 422,000 publications describe biological studies on the rat ...(NCBI/Medline). The rat is becoming an increasingly important genetic model for the study of specific diseases, as well as retaining its role as a major preclinical model system for pharmaceutical development. The initial genetic linkage map of the rat contained 432 genetic markers (Jacob et al. 1995) out of 1171 developed due to the relatively low polymorphism rate of the mapping cross used (SHR × BN) when compared to the interspecific crosses in the mouse. While the rat genome project continues to localize additional markers on the linkage map, and as of 11/97 more than 3,200 loci have been mapped. Current map construction is using two different crosses (SHRSP × BN and FHH × ACI) rather than the initial mapping cross. Consequently there is a need to provide integration among the different maps. We set out to develop an integrated map, as well as increase the number of markers on the rat genetic map.The crosses available for this analysis included the original mapping cross SHR × BN reciprocal F2 intercross (448 markers), a GH × BN intercross (205 markers), a SS/Mcw × BN intercross (235 markers), and a FHH/Eur × ACI/Hsd intercross (276 markers), which is also one of the new mapping crosses. Forty-six animals from each cross were genotyped with markers polymorphic for that cross. The maps appear to cover the vast majority of the rat genome. The availability of these additional markers should facilitate more complete whole genome scans in a greater number of strains and provide additional markers in specific genomic regions of interest.
: Human cognition in normal and disease states is both environmentally and genetically mediated. Except for measures of language-specific abilities, however, few cognitive measures have been ...associated with specific genes or chromosomal regions. We performed genome-wide linkage analysis of five neuropsychological tests in the Collaborative Study on Genetics of Alcoholism sample. The sample included 1579 individuals (53% female, 76% White Non-Hispanic) in 217 families. There were 390 markers with mean inter-marker distance of 9.6 cM. Performance on the Digit Symbol Substitution Test, a component of the Wechsler Adult Intelligence Scale-R, showed significant linkage to 14q11.2 and suggestive linkage to 14q 24.2. This test of sustained visual attention also involves visual-motor coordination and executive functions. Performance on the WAIS-R Digit Span Test of immediate memory and mental flexibility showed suggestive linkage to 11q 25. Although the validity of these results beyond populations with a susceptibility for alcohol dependence is unclear, these results are among the first linkage results for non-language components of cognition.
We performed a hypothesis-generating phenome-wide association study (PheWAS) to identify and characterize cross-phenotype associations, where one SNP is associated with two or more phenotypes, ...between thousands of genetic variants assayed on the Metabochip and hundreds of phenotypes in 5,897 African Americans as part of the Population Architecture using Genomics and Epidemiology (PAGE) I study. The PAGE I study was a National Human Genome Research Institute-funded collaboration of four study sites accessing diverse epidemiologic studies genotyped on the Metabochip, a custom genotyping chip that has dense coverage of regions in the genome previously associated with cardio-metabolic traits and outcomes in mostly European-descent populations. Here we focus on identifying novel phenome-genome relationships, where SNPs are associated with more than one phenotype. To do this, we performed a PheWAS, testing each SNP on the Metabochip for an association with up to 273 phenotypes in the participating PAGE I study sites. We identified 133 putative pleiotropic variants, defined as SNPs associated at an empirically derived p-value threshold of p<0.01 in two or more PAGE study sites for two or more phenotype classes. We further annotated these PheWAS-identified variants using publicly available functional data and local genetic ancestry. Amongst our novel findings is SPARC rs4958487, associated with increased glucose levels and hypertension. SPARC has been implicated in the pathogenesis of diabetes and is also known to have a potential role in fibrosis, a common consequence of multiple conditions including hypertension. The SPARC example and others highlight the potential that PheWAS approaches have in improving our understanding of complex disease architecture by identifying novel relationships between genetic variants and an array of common human phenotypes.
Lipid levels are important markers for the development of cardio-metabolic diseases. Although hundreds of associated loci have been identified through genetic association studies, the contribution of ...genetic factors to variation in lipids is not fully understood, particularly in U.S. minority groups. We performed genome-wide association analyses for four lipid traits in over 45,000 ancestrally diverse participants from the Population Architecture using Genomics and Epidemiology (PAGE) Study, followed by a meta-analysis with several European ancestry studies. We identified nine novel lipid loci, five of which showed evidence of replication in independent studies. Furthermore, we discovered one novel gene in a PrediXcan analysis, minority-specific independent signals at eight previously reported loci, and potential functional variants at two known loci through fine-mapping. Systematic examination of known lipid loci revealed smaller effect estimates in African American and Hispanic ancestry populations than those in Europeans, and better performance of polygenic risk scores based on minority-specific effect estimates. Our findings provide new insight into the genetic architecture of lipid traits and highlight the importance of conducting genetic studies in diverse populations in the era of precision medicine.
Dubowitz syndrome (DubS) is considered a recognizable syndrome characterized by a distinctive facial appearance and deficits in growth and development. There have been over 200 individuals reported ...with Dubowitz or a “Dubowitz‐like” condition, although no single gene has been implicated as responsible for its cause. We have performed exome (ES) or genome sequencing (GS) for 31 individuals clinically diagnosed with DubS. After genome‐wide sequencing, rare variant filtering and computational and Mendelian genomic analyses, a presumptive molecular diagnosis was made in 13/27 (48%) families. The molecular diagnoses included biallelic variants in SKIV2L, SLC35C1, BRCA1, NSUN2; de novo variants in ARID1B, ARID1A, CREBBP, POGZ, TAF1, HDAC8, and copy‐number variation at1p36.11(ARID1A), 8q22.2(VPS13B), Xp22, and Xq13(HDAC8). Variants of unknown significance in known disease genes, and also in genes of uncertain significance, were observed in 7/27 (26%) additional families. Only one gene, HDAC8, could explain the phenotype in more than one family (N = 2). All but two of the genomic diagnoses were for genes discovered, or for conditions recognized, since the introduction of next‐generation sequencing. Overall, the DubS‐like clinical phenotype is associated with extensive locus heterogeneity and the molecular diagnoses made are for emerging clinical conditions sharing characteristic features that overlap the DubS phenotype.
Most body mass index (BMI) genetic loci have been identified in studies of primarily European ancestries. The effect of these loci in other racial/ethnic groups is less clear. Thus, we aimed to ...characterize the generalizability of 170 established BMI variants, or their proxies, to diverse US populations and trans-ethnically fine-map 36 BMI loci using a sample of >102,000 adults of African, Hispanic/Latino, Asian, European and American Indian/Alaskan Native descent from the Population Architecture using Genomics and Epidemiology Study. We performed linear regression of the natural log of BMI (18.5–70 kg/m
2
) on the additive single nucleotide polymorphisms (SNPs) at BMI loci on the MetaboChip (Illumina, Inc.), adjusting for age, sex, population stratification, study site, or relatedness. We then performed fixed-effect meta-analyses and a Bayesian trans-ethnic meta-analysis to empirically cluster by allele frequency differences. Finally, we approximated conditional and joint associations to test for the presence of secondary signals. We noted directional consistency with the previously reported risk alleles beyond what would have been expected by chance (binomial
p
< 0.05). Nearly, a quarter of the previously described BMI index SNPs and 29 of 36 densely-genotyped BMI loci on the MetaboChip replicated/generalized in trans-ethnic analyses. We observed multiple signals at nine loci, including the description of seven loci with novel multiple signals. This study supports the generalization of most common genetic loci to diverse ancestral populations and emphasizes the importance of dense multiethnic genomic data in refining the functional variation at genetic loci of interest and describing several loci with multiple underlying genetic variants.
Lipid levels are important markers for the development of cardio-metabolic diseases. Although hundreds of associated loci have been identified through genetic association studies, the contribution of ...genetic factors to variation in lipids is not fully understood, particularly in U.S. minority groups. We performed genome-wide association analyses for four lipid traits in over 45,000 ancestrally diverse participants from the Population Architecture using Genomics and Epidemiology (PAGE) Study, followed by a meta-analysis with several European ancestry studies. We identified nine novel lipid loci, five of which showed evidence of replication in independent studies. Furthermore, we discovered one novel gene in a PrediXcan analysis, minority-specific independent signals at eight previously reported loci, and potential functional variants at two known loci through fine-mapping. Systematic examination of known lipid loci revealed smaller effect estimates in African American and Hispanic ancestry populations than those in Europeans, and better performance of polygenic risk scores based on minority-specific effect estimates. Our findings provide new insight into the genetic architecture of lipid traits and highlight the importance of conducting genetic studies in diverse populations in the era of precision medicine.