Based on epidemiologic and embryologic patterns, nonsyndromic orofacial clefts- the most common craniofacial birth defects in humans- are commonly categorized into cleft lip with or without cleft ...palate (CL/P) and cleft palate alone (CP), which are traditionally considered to be etiologically distinct. However, some evidence of shared genetic risk in IRF6, GRHL3 and ARHGAP29 regions exists; only FOXE1 has been recognized as significantly associated with both CL/P and CP in genome-wide association studies (GWAS). We used a new statistical approach, PLACO (pleiotropic analysis under composite null), on a combined multi-ethnic GWAS of 2,771 CL/P and 611 CP case-parent trios. At the genome-wide significance threshold of 5 x 10.sup.-8, PLACO identified 1 locus in 1q32.2 (IRF6) that appears to increase risk for one OFC subgroup but decrease risk for the other. At a suggestive significance threshold of 10.sup.-6, we found 5 more loci with compelling candidate genes having opposite effects on CL/P and CP: 1p36.13 (PAX7), 3q29 (DLG1), 4p13 (LIMCH1), 4q21.1 (SHROOM3) and 17q22 (NOG). Additionally, we replicated the recognized shared locus 9q22.33 (FOXE1), and identified 2 loci in 19p13.12 (RAB8A) and 20q12 (MAFB) that appear to influence risk of both CL/P and CP in the same direction. We found locus-specific effects may vary by racial/ethnic group at these regions of genetic overlap, and failed to find evidence of sex-specific differences. We confirmed shared etiology of the two OFC subtypes comprising CL/P, and additionally found suggestive evidence of differences in their pathogenesis at 2 loci of genetic overlap. Our novel findings include 6 new loci of genetic overlap between CL/P and CP; 3 new loci between pairwise OFC subtypes; and 4 loci not previously implicated in OFCs. Our in-silico validation showed PLACO is robust to subtype-specific effects, and can achieve massive power gains over existing approaches for identifying genetic overlap between disease subtypes. In summary, we found suggestive evidence for new genetic regions and confirmed some recognized OFC genes either exerting shared risk or with opposite effects on risk to OFC subtypes.
Although de novo mutations (DNMs) are known to increase an individual’s risk of congenital defects, DNMs have not been fully explored regarding orofacial clefts (OFCs), one of the most common human ...birth defects. Therefore, whole-genome sequencing of 756 child-parent trios of European, Colombian, and Taiwanese ancestry was performed to determine the contributions of coding DNMs to an individual’s OFC risk. Overall, we identified a significant excess of loss-of-function DNMs in genes highly expressed in craniofacial tissues, as well as genes associated with known autosomal dominant OFC syndromes. This analysis also revealed roles for zinc-finger homeobox domain and SOX2-interacting genes in OFC etiology.
We used a deeply sequenced dataset of 910 individuals, all of African descent, to construct a set of DNA sequences that is present in these individuals but missing from the reference human genome. We ...aligned 1.19 trillion reads from the 910 individuals to the reference genome (GRCh38), collected all reads that failed to align, and assembled these reads into contiguous sequences (contigs). We then compared all contigs to one another to identify a set of unique sequences representing regions of the African pan-genome missing from the reference genome. Our analysis revealed 296,485,284 bp in 125,715 distinct contigs present in the populations of African descent, demonstrating that the African pan-genome contains ~10% more DNA than the current human reference genome. Although the functional significance of nearly all of this sequence is unknown, 387 of the novel contigs fall within 315 distinct protein-coding genes, and the rest appear to be intergenic.
DNA methylation may play an important role in schizophrenia (SZ), either directly as a mechanism of pathogenesis or as a biomarker of risk.
To scan genome-wide DNA methylation data to identify ...differentially methylated CpGs between SZ cases and controls.
Epigenome-wide association study begun in 2008 using DNA methylation levels of 456 513 CpG loci measured on the Infinium HumanMethylation450 array (Illumina) in a consortium of case-control studies for initial discovery and in an independent replication set. Primary analyses used general linear regression, adjusting for age, sex, race/ethnicity, smoking, batch, and cell type heterogeneity. The discovery set contained 689 SZ cases and 645 controls (n = 1334), from 3 multisite consortia: the Consortium on the Genetics of Endophenotypes in Schizophrenia, the Project among African-Americans To Explore Risks for Schizophrenia, and the Multiplex Multigenerational Family Study of Schizophrenia. The replication set contained 247 SZ cases and 250 controls (n = 497) from the Genomic Psychiatry Cohort.
Identification of differentially methylated positions across the genome in SZ cases compared with controls.
Of the 689 case participants in the discovery set, 477 (69%) were men and 258 (37%) were non-African American; of the 645 controls, 273 (42%) were men and 419 (65%) were non-African American. In our replication set, cases/controls were 76% male and 100% non-African American. We identified SZ-associated methylation differences at 923 CpGs in the discovery set (false discovery rate, <0.2). Of these, 625 showed changes in the same direction including 172 with P < .05 in the replication set. Some replicated differentially methylated positions are located in a top-ranked SZ region from genome-wide association study analyses.
This analysis identified 172 replicated new associations with SZ after careful correction for cell type heterogeneity and other potential confounders. The overlap with previous genome-wide association study data can provide potential insights into the functional relevance of genetic signals for SZ.
Abstract
Telomere length (TL) attrition, epigenetic age acceleration, and mitochondrial DNA copy number (mtDNAcn) decline are established hallmarks of aging. Each has been individually associated ...with Alzheimer’s dementia, cognitive function, and pathologic Alzheimer’s disease (AD). Epigenetic age and mtDNAcn have been studied in brain tissue directly but prior work on TL in brain is limited to small sample sizes and most studies have examined leukocyte TL. Importantly, TL, epigenetic age clocks, and mtDNAcn have not been studied jointly in brain tissue from an AD cohort. We examined dorsolateral prefrontal cortex (DLPFC) tissue from N = 367 participants of the Religious Orders Study (ROS) or the Rush Memory and Aging Project (MAP). TL and mtDNAcn were estimated from whole genome sequencing (WGS) data and cortical clock age was computed on 347 CpG sites. We examined dementia, MCI, and level of and change in cognition, pathologic AD, and three quantitative AD traits, as well as measures of other neurodegenerative diseases and cerebrovascular diseases (CVD). We previously showed that mtDNAcn from DLPFC brain tissue was associated with clinical and pathologic features of AD. Here, we show that those associations are independent of TL. We found TL to be associated with β-amyloid levels (beta = − 0.15,
p
= 0.023), hippocampal sclerosis (OR = 0.56,
p
= 0.0015) and cerebral atherosclerosis (OR = 1.44,
p
= 0.0007). We found strong associations between mtDNAcn and clinical measures of AD. The strongest associations with pathologic measures of AD were with cortical clock and there were associations of mtDNAcn with global AD pathology and tau tangles. Of the other pathologic traits, mtDNAcn was associated with hippocampal sclerosis, macroscopic infarctions and CAA and cortical clock was associated with Lewy bodies. Multi-modal age acceleration, accelerated aging on both mtDNAcn and cortical clock, had greater effect size than a single measure alone. These findings highlight for the first time that age acceleration determined on multiple genomic measures, mtDNAcn and cortical clock may have a larger effect on AD/AD related disorders (ADRD) pathogenesis than single measures.
Using GitHub Classroom To Teach Statistics Fiksel, Jacob; Jager, Leah R.; Hardin, Johanna S. ...
Journal of statistics education,
01/2019, Volume:
27, Issue:
2
Journal Article
Peer reviewed
Open access
Git and GitHub are common tools for keeping track of multiple versions of data analytic content, which allow for more than one person to simultaneously work on a project. GitHub Classroom aims to ...provide a way for students to work on and submit their assignments via Git and GitHub, giving teachers an opportunity to facilitate the integration of these version control tools into their undergraduate statistics courses. In the Fall 2017 semester, we implemented GitHub Classroom in two educational settings-an introductory computational statistics lab and a more advanced computational statistics course. We found many educational benefits of implementing GitHub Classroom, such as easily providing coding feedback during assignments and making students more confident in their ability to collaborate and use version control tools for future data science work. To encourage and ease the transition into using GitHub Classroom, we provide free and publicly available resources-both for students to begin using Git/GitHub and for teachers to use GitHub Classroom for their own courses.
Integration of metabolomics with genetics may advance understanding of disease pathogenesis but has been underused in asthma genetic studies.
We sought to discover new genetic effects in asthma and ...to characterize the molecular consequences of asthma genetic risk through integration with the metabolome in a homogeneous population.
From fasting serum samples collected on 348 Tangier Island residents, we quantified 2612 compounds using untargeted metabolomics. Genotyping was performed using Illumina’s MEGA array imputed to the TOPMed reference panel. To prioritize metabolites for genome-wide association analysis, we performed a metabolome-wide association study with asthma, selecting asthma-associated metabolites with heritability q value less than 0.01 for genome-wide association analysis. We also tested the association between all metabolites and 8451 candidate asthma single nucleotide polymorphisms previously associated with asthma in the UK Biobank. We followed up significant associations by characterizing shared genetic signal for metabolites and asthma using colocalization analysis. For detailed Methods, please see this article’s Online Repository at www.jacionline.org.
A total of 60 metabolites were associated with asthma (P < .01), including 40 heritable metabolites tested in genome-wide association analysis. We observed a strong association peak for the endocannabinoid linoleoyl ethanolamide on chromosome 6 in VNN1 (P < 2.7 × 10−9). We found strong evidence (colocalization posterior probability >75%) for a shared causal variant between 3 metabolites and asthma, including the polyamine acisoga and variants in LPP, and derivative leukotriene B4 and intergenic variants in chr10p14.
We identified novel metabolite quantitative trait loci with asthma associations. Identification and characterization of these genetically driven metabolites may provide insight into the functional consequences of genetic risk factors for asthma.
Background
There is interest in deriving megakaryocytes (MKs) from pluripotent stem cells (iPSC) for biological studies. We previously found that genomic structural integrity and genotype concordance ...is maintained in iPSC‐derived MKs.
Objective
To establish a comprehensive dataset of genes and proteins expressed in iPSC‐derived MKs.
Methods
iPSCs were reprogrammed from peripheral blood mononuclear cells (MNCs) and MKs were derived from the iPSCs in 194 healthy European American and African American subjects. mRNA was isolated and gene expression measured by RNA sequencing. Protein expression was measured in 62 of the subjects using mass spectrometry.
Results and Conclusions
MKs expressed genes and proteins known to be important in MK and platelet function and demonstrated good agreement with previous studies in human MKs derived from CD34+ progenitor cells. The percent of cells expressing the MK markers CD41 and CD42a was consistent in biological replicates, but variable across subjects, suggesting that unidentified subject‐specific factors determine differentiation of MKs from iPSCs. Gene and protein sets important in platelet function were associated with increasing expression of CD41/42a, while those related to more basic cellular functions were associated with lower CD41/42a expression. There was differential gene expression by the sex and race (but not age) of the subject. Numerous genes and proteins were highly expressed in MKs but not known to play a role in MK or platelet function; these represent excellent candidates for future study of hematopoiesis, platelet formation, and/or platelet function.
Genome-wide association studies have identified common variants associated with platelet-related phenotypes, but because these variants are largely intronic or intergenic, their link to platelet ...biology is unclear. In 290 normal subjects from the GeneSTAR Research Study (110 African Americans AAs and 180 European Americans EAs), we generated whole-genome sequence data from whole blood and RNA sequence data from extracted nonribosomal RNA from 185 induced pluripotent stem cell-derived megakaryocyte (MK) cell lines (platelet precursor cells) and 290 blood platelet samples from these subjects. Using eigenMT software to select the peak single-nucleotide polymorphism (SNP) for each expressed gene, and meta-analyzing the results of AAs and EAs, we identify (q-value < 0.05) 946 cis-expression quantitative trait loci (eQTLs) in derived MKs and 1830 cis-eQTLs in blood platelets. Among the 57 eQTLs shared between the 2 tissues, the estimated directions of effect are very consistent (98.2% concordance). A high proportion of detected cis-eQTLs (74.9% in MKs and 84.3% in platelets) are unique to MKs and platelets compared with peak-associated SNP-expressed gene pairs of 48 other tissue types that are reported in version V7 of the Genotype-Tissue Expression Project. The locations of our identified eQTLs are significantly enriched for overlap with several annotation tracks highlighting genomic regions with specific functionality in MKs, including MK-specific DNAse hotspots, H3K27-acetylation marks, H3K4-methylation marks, enhancers, and superenhancers. These results offer insights into the regulatory signature of MKs and platelets, with significant overlap in genes expressed, eQTLs detected, and enrichment within known superenhancers relevant to platelet biology.