Background DNA methylation of CpG sites on genetic loci has been linked to increased risk of asthma in children exposed to elevated ambient air pollutants (AAPs). Further identification of specific ...CpG sites and the pollutants that are associated with methylation of these CpG sites in immune cells could impact our understanding of asthma pathophysiology. In this study, we sought to identify some CpG sites in specific genes that could be associated with asthma regulation (Foxp3 and IL10) and to identify the different AAPs for which exposure prior to the blood draw is linked to methylation levels at these sites. We recruited subjects from Fresno, California, an area known for high levels of AAPs. Blood samples and responses to questionnaires were obtained (n = 188), and in a subset of subjects (n = 33), repeat samples were collected 2 years later. Average measures of AAPs were obtained for 1, 15, 30, 90, 180, and 365 days prior to each blood draw to estimate the short-term vs. long-term effects of the AAP exposures. Results Asthma was significantly associated with higher differentially methylated regions (DMRs) of the Foxp3 promoter region (p = 0.030) and the IL10 intronic region (p = 0.026). Additionally, at the 90-day time period (90 days prior to the blood draw), Foxp3 methylation was positively associated with NO.sub.2, CO, and PM.sub.2.5 exposures (p = 0.001, p = 0.001, and p = 0.012, respectively). In the subset of subjects retested 2 years later (n = 33), a positive association between AAP exposure and methylation was sustained. There was also a negative correlation between the average Foxp3 methylation of the promoter region and activated Treg levels (p = 0.039) and a positive correlation between the average IL10 methylation of region 3 of intron 4 and IL10 cytokine expression (p = 0.030). Conclusions Short-term and long-term exposures to high levels of CO, NO.sub.2, and PM.sub.2.5 were associated with alterations in differentially methylated regions of Foxp3. IL10 methylation showed a similar trend. For any given individual, these changes tend to be sustained over time. In addition, asthma was associated with higher differentially methylated regions of Foxp3 and IL10. Keywords: Ambient air pollution, Immune system, Regulatory T cell, Epigenetics
Efforts to identify gene variants associated with susceptibility to common diseases use three approaches: pedigree and affected sib-pair linkage studies and association studies of population samples. ...The different aims of these study designs reflect their derivation from biological versus epidemiological traditions. Similar principles regarding determination of the evidence levels required to consider the results statistically significant apply to both linkage and association studies, however. Such determination requires explicit attention to the prior probability of particular findings, as well as appropriate correction for multiple comparisons. For most common diseases, increasing the sample size in a study is a crucial step in achieving statistically significant genetic mapping results. Recent studies suggest that the technology and statistical methodology will soon be available to make well-powered studies feasible using any of these approaches.
Schizophrenia is a severe psychiatric disease with complex etiology, affecting approximately 1% of the general population. Most genetics studies so far have focused on disease association with common ...genetic variation, such as single-nucleotide polymorphisms (SNPs), but it has recently become apparent that large-scale genomic copy-number variants (CNVs) are involved in disease development as well. To assess the role of rare CNVs in schizophrenia, we screened 54 patients with deficit schizophrenia using Affymetrix's GeneChip 250K SNP arrays. We identified 90 CNVs in total, 77 of which have been reported previously in unaffected control cohorts. Among the genes disrupted by the remaining rare CNVs are
MYT1L,
CTNND2,
NRXN1, and
ASTN2, genes that play an important role in neuronal functioning but—except for
NRXN1—have not been associated with schizophrenia before. We studied the occurrence of CNVs at these four loci in an additional cohort of 752 patients and 706 normal controls from The Netherlands. We identified eight additional CNVs, of which the four that affect coding sequences were found only in the patient cohort. Our study supports a role for rare CNVs in schizophrenia susceptibility and identifies at least three candidate genes for this complex disorder.
Structural variation in the complement 4 gene (C4) confers genetic risk for schizophrenia. The variation includes numbers of the increased C4A copy number, which predicts increased C4A mRNA ...expression. C4-anaphylatoxin (C4-ana) is a C4 protein fragment released upon C4 protein activation that has the potential to change the blood-brain barrier (BBB). We hypothesized that elevated plasma levels of C4-ana occur in individuals with schizophrenia (iSCZ). Blood was collected from 15 iSCZ with illness duration < 5 years and from 14 healthy controls (HC). Plasma C4-ana was measured by radioimmunoassay. Other complement activation products C3-ana, C5-ana, and terminal complement complex (TCC) were also measured. Digital-droplet PCR was used to determine C4 gene structural variation state. Recombinant C4-ana was added to primary brain endothelial cells (BEC) and permeability was measured in vitro. C4-ana concentration was elevated in plasma from iSCZ compared to HC (mean = 654 ± 16 ng/mL, 557 ± 94 respectively, p = 0.01). The patients also carried more copies of the C4AL gene and demonstrated a positive correlation between plasma C4-ana concentrations and C4A gene copy number. Furthermore, C4-ana increased the permeability of a monolayer of BEC in vitro. Our findings are consistent with a specific role for C4A protein in schizophrenia and raise the possibility that its activation product, C4-ana, increases BBB permeability. Exploratory analyses suggest the novel hypothesis that the relationship between C4-ana levels and C4A gene copy number could also be altered in iSCZ, suggesting an interaction with unknown genetic and/or environmental risk factors.
High-dimensional data sets generated by high-throughput technologies, such as DNA microarray, are often the outputs of complex networked systems driven by hidden regulatory signals. Traditional ...statistical methods for computing low-dimensional or hidden representations of these data sets, such as principal component analysis and independent component analysis, ignore the underlying network structures and provide decompositions based purely on a priori statistical constraints on the computed component signals. The resulting decomposition thus provides a phenomenological model for the observed data and does not necessarily contain physically or biologically meaningful signals. Here, we develop a method, called network component analysis, for uncovering hidden regulatory signals from outputs of networked systems, when only a partial knowledge of the underlying network topology is available. The a priori network structure information is first tested for compliance with a set of identifiability criteria. For networks that satisfy the criteria, the signals from the regulatory nodes and their strengths of influence on each output node can be faithfully reconstructed. This method is first validated experimentally by using the absorbance spectra of a network of various hemoglobin species. The method is then applied to microarray data generated from yeast Saccharamyces cerevisiae and the activities of various transcription factors during cell cycle are reconstructed by using recently discovered connectivity information for the underlying transcriptional regulatory networks.
Down Syndrome cell adhesion molecule (Dscam) genes encode neuronal cell recognition proteins of the immunoglobulin superfamily. In Drosophila, Dscam1 generates 19,008 different ectodomains by ...alternative splicing of three exon clusters, each encoding half or a complete variable immunoglobulin domain. Identical isoforms bind to each other, but rarely to isoforms differing at any one of the variable immunoglobulin domains. Binding between isoforms on opposing membranes promotes repulsion. Isoform diversity provides the molecular basis for neurite self-avoidance. Self-avoidance refers to the tendency of branches from the same neuron (self-branches) to selectively avoid one another. To ensure that repulsion is restricted to self-branches, different neurons express different sets of isoforms in a biased stochastic fashion. Genetic studies demonstrated that Dscam1 diversity has a profound role in wiring the fly brain. Here we show how many isoforms are required to provide an identification system that prevents non-self branches from inappropriately recognizing each other. Using homologous recombination, we generated mutant animals encoding 12, 24, 576 and 1,152 potential isoforms. Mutant animals with deletions encoding 4,752 and 14,256 isoforms were also analysed. Branching phenotypes were assessed in three classes of neurons. Branching patterns improved as the potential number of isoforms increased, and this was independent of the identity of the isoforms. Although branching defects in animals with 1,152 potential isoforms remained substantial, animals with 4,752 isoforms were indistinguishable from wild-type controls. Mathematical modelling studies were consistent with the experimental results that thousands of isoforms are necessary to ensure acquisition of unique Dscam1 identities in many neurons. We conclude that thousands of isoforms are essential to provide neurons with a robust discrimination mechanism to distinguish between self and non-self during self-avoidance.
Abnormalities in sleep and circadian rhythms are central features of bipolar disorder (BP), often persisting between episodes. We report here, to our knowledge, the first systematic analysis of ...circadian rhythm activity in pedigrees segregating severe BP (BP-I). By analyzing actigraphy data obtained from members of 26 Costa Rican and Colombian pedigrees 136 euthymic (i.e., interepisode) BP-I individuals and 422 non–BP-I relatives, we delineated 73 phenotypes, of which 49 demonstrated significant heritability and 13 showed significant trait-like association with BP-I. All BP-I–associated traits related to activity level, with BP-I individuals consistently demonstrating lower activity levels than their non–BP-I relatives. We analyzed all 49 heritable phenotypes using genetic linkage analysis, with special emphasis on phenotypes judged to have the strongest impact on the biology underlying BP. We identified a locus for interdaily stability of activity, at a threshold exceeding genome-wide significance, on chromosome 12pter, a region that also showed pleiotropic linkage to two additional activity phenotypes.
Abstract We consider problems where many, somewhat redundant, hypotheses are tested and we are interested in reporting the most precise rejections, with false discovery rate (FDR) control. This is ...the case, for example, when researchers are interested both in individual hypotheses as well as group hypotheses corresponding to intersections of sets of the original hypotheses, at several resolution levels. A concrete application is in genome-wide association studies, where, depending on the signal strengths, it might be possible to resolve the influence of individual genetic variants on a phenotype with greater or lower precision. To adapt to the unknown signal strength, analyses are conducted at multiple resolutions and researchers are most interested in the more precise discoveries. Assuring FDR control on the reported findings with these adaptive searches is, however, often impossible. To design a multiple comparison procedure that allows for an adaptive choice of resolution with FDR control, we leverage e-values and linear programming. We adapt this approach to problems where knockoffs and group knockoffs have been successfully applied to test conditional independence hypotheses. We demonstrate its efficacy by analysing data from the UK Biobank.
Bipolar disorder is a highly heritable illness, associated with alterations of brain structure. As such, identification of genes influencing inter-individual differences in brain morphology may help ...elucidate the underlying pathophysiology of bipolar disorder (BP). To identify quantitative trait loci (QTL) that contribute to phenotypic variance of brain structure, structural neuroimages were acquired from family members (n = 527) of extended pedigrees heavily loaded for bipolar disorder ascertained from genetically isolated populations in Latin America. Genome-wide linkage and association analysis were conducted on the subset of heritable brain traits that showed significant evidence of association with bipolar disorder (n = 24) to map QTL influencing regional measures of brain volume and cortical thickness. Two chromosomal regions showed significant evidence of linkage; a QTL on chromosome 1p influencing corpus callosum volume and a region on chromosome 7p linked to cortical volume. Association analysis within the two QTLs identified three SNPs correlated with the brain measures.
Motivation: In systems like Escherichia Coli, the abundance of sequence information, gene expression array studies and small scale experiments allows one to reconstruct the regulatory network and to ...quantify the effects of transcription factors on gene expression. However, this goal can only be achieved if all information sources are used in concert. Results: Our method integrates literature information, DNA sequences and expression arrays. A set of relevant transcription factors is defined on the basis of literature. Sequence data are used to identify potential target genes and the results are used to define a prior distribution on the topology of the regulatory network. A Bayesian hidden component model for the expression array data allows us to identify which of the potential binding sites are actually used by the regulatory proteins in the studied cell conditions, the strength of their control, and their activation profile in a series of experiments. We apply our methodology to 35 expression studies in E.Coli with convincing results. Availability: Supplementary information: The supplementary material are available at Bioinformatics online. Contact:csabatti@mednet.ucla.edu