Schizophrenia is a heritable brain illness with unknown pathogenic mechanisms. Schizophrenia's strongest genetic association at a population level involves variation in the major histocompatibility ...complex (MHC) locus, but the genes and molecular mechanisms accounting for this have been challenging to identify. Here we show that this association arises in part from many structurally diverse alleles of the complement component 4 (C4) genes. We found that these alleles generated widely varying levels of C4A and C4B expression in the brain, with each common C4 allele associating with schizophrenia in proportion to its tendency to generate greater expression of C4A. Human C4 protein localized to neuronal synapses, dendrites, axons, and cell bodies. In mice, C4 mediated synapse elimination during postnatal development. These results implicate excessive complement activity in the development of schizophrenia and may help explain the reduced numbers of synapses in the brains of individuals with schizophrenia.
By analyzing the exomes of 12,332 unrelated Swedish individuals, including 4,877 individuals affected with schizophrenia, in ways informed by exome sequences from 45,376 other individuals, we ...identified 244,246 coding-sequence and splice-site ultra-rare variants (URVs) that were unique to individual Swedes. We found that gene-disruptive and putatively protein-damaging URVs (but not synonymous URVs) were more abundant among individuals with schizophrenia than among controls (P = 1.3 × 10
). This elevation of protein-compromising URVs was several times larger than an analogously elevated rate for de novo mutations, suggesting that most rare-variant effects on schizophrenia risk are inherited. Among individuals with schizophrenia, the elevated frequency of protein-compromising URVs was concentrated in brain-expressed genes, particularly in neuronally expressed genes; most of this elevation arose from large sets of genes whose RNAs have been found to interact with synaptically localized proteins. Our results suggest that synaptic dysfunction may mediate a large fraction of strong, individually rare genetic influences on schizophrenia risk.
The genetic architecture of schizophrenia is complex, involving risk alleles ranging from common alleles of weak effect to rare alleles of large effect, the best exemplar of the latter being large ...copy number variants (CNVs). It is currently unknown whether pathophysiology in those with defined rare mutations overlaps with that in other individuals with the disorder who do not share the same rare mutation. Under an extreme heterogeneity model, carriers of specific high-penetrance mutations form distinct subgroups. In contrast, under a polygenic threshold model, high-penetrance rare allele carriers possess many risk factors, of which the rare allele is the only one, albeit an important, factor. Under the latter model, cases with rare mutations can be expected to share some common risk alleles, and therefore pathophysiological mechanisms, with cases without the same mutation. Here we show that, compared with controls, individuals with schizophrenia who have known pathogenic CNVs carry an excess burden of common risk alleles (P=2.25 × 10(-17)) defined from a genome-wide association study largely based on individuals without known CNVs. Our finding is not consistent with an extreme heterogeneity model for CNV carriers, but does offer support for the polygenic threshold model of schizophrenia. That this is so provides support for the notion that studies aiming to model the effects of rare variation may uncover pathophysiological mechanisms of relevance to those with the disorder more widely.
Appreciating the contribution of human genome copy-number variation (CNV) to clinical phenotypes is one of the compelling genetics challenges of the coming years. It is increasingly possible to ...pursue such investigations as an extension of genome-wide association studies (GWAS), enabled by innovations in the design and analysis of SNP (single nucleotide polymorphism) arrays and by progress in determining the genomic locations and population-genetic properties of the CNVs that segregate in the human population. Extensions of GWAS to CNV have already resulted in discoveries of both de novo and inherited CNV that are associated with risk of common disease. This review will discuss new approaches, recent findings and the analytical challenges involved in expanding GWAS to appreciate the contribution of CNV to human phenotypes.
Schizophrenia (SCZ) is a highly heritable neuropsychiatric disorder of complex genetic etiology. Previous genome-wide surveys have revealed a greater burden of large, rare copy number variations ...(CNVs) in SCZ cases and identified multiple rare recurrent CNVs that increase risk of SCZ although with incomplete penetrance and pleiotropic effects. Identification of additional recurrent CNVs and biological pathways enriched for SCZ CNVs requires greater sample sizes. We conducted a genome-wide survey for CNVs associated with SCZ using a Swedish national sample (4719 cases and 5917 controls). High-confidence CNV calls were generated using genotyping array intensity data, and their effect on risk of SCZ was measured. Our data confirm increased burden of large, rare CNVs in SCZ cases as well as significant associations for recurrent 16p11.2 duplications, 22q11.2 deletions and 3q29 deletions. We report a novel association for 17q12 duplications (odds ratio=4.16, P=0.018), previously associated with autism and mental retardation but not SCZ. Intriguingly, gene set association analyses implicate biological pathways previously associated with SCZ through common variation and exome sequencing (calcium channel signaling and binding partners of the fragile X mental retardation protein). We found significantly increased burden of the largest CNVs (>500 kb) in genes present in the postsynaptic density, in genomic regions implicated via SCZ genome-wide association studies and in gene products localized to mitochondria and cytoplasm. Our findings suggest that multiple lines of genomic inquiry--genome-wide screens for CNVs, common variation and exonic variation--are converging on similar sets of pathways and/or genes.
Sequencing of gene-coding regions (the exome) is increasingly used for studying human disease, for which copy-number variants (CNVs) are a critical genetic component. However, detecting copy number ...from exome sequencing is challenging because of the noncontiguous nature of the captured exons. This is compounded by the complex relationship between read depth and copy number; this results from biases in targeted genomic hybridization, sequence factors such as GC content, and batching of samples during collection and sequencing. We present a statistical tool (exome hidden Markov model XHMM) that uses principal-component analysis (PCA) to normalize exome read depth and a hidden Markov model (HMM) to discover exon-resolution CNV and genotype variation across samples. We evaluate performance on 90 schizophrenia trios and 1,017 case-control samples. XHMM detects a median of two rare (<1%) CNVs per individual (one deletion and one duplication) and has 79% sensitivity to similarly rare CNVs overlapping three or more exons discovered with microarrays. With sensitivity similar to state-of-the-art methods, XHMM achieves higher specificity by assigning quality metrics to the CNV calls to filter out bad ones, as well as to statistically genotype the discovered CNV in all individuals, yielding a trio call set with Mendelian-inheritance properties highly consistent with expectation. We also show that XHMM breakpoint quality scores enable researchers to explicitly search for novel classes of structural variation. For example, we apply XHMM to extract those CNVs that are highly likely to disrupt (delete or duplicate) only a portion of a gene.
Large (>100 kb), rare (<1% in the population) copy number variants (CNVs) have been shown to confer risk for schizophrenia (SZ), but the findings for bipolar disorder (BD) are less clear. In a new BD ...sample from the United Kingdom (n=2591), we have examined the occurrence of CNVs and compared this with previously reported samples of 6882 SZ and 8842 control subjects. When combined with previous data, we find evidence for a contribution to BD for three SZ-associated CNV loci: duplications at 1q21.1 (P=0.022), deletions at 3q29 (P=0.03) and duplications at 16p11.2 (P=2.3 × 10(-4)). The latter survives multiple-testing correction for the number of recurrent large CNV loci in the genome. Genes in 20 regions (total of 55 genes) were enriched for rare exonic CNVs among BD cases, but none of these survives correction for multiple testing. Finally, our data provide strong support for the hypothesis of a lesser contribution of very large (>500 kb) CNVs in BD compared with SZ, most notably for deletions >1 Mb (P=9 × 10(-4)).
Thrombosis is a major cause of morbidity and mortality in Philadelphia chromosome-negative myeloproliferative neoplasms (MPNs), clonal disorders of hematopoiesis characterized by activated Janus ...kinase (JAK)-signal transducer and activator of transcription (STAT) signaling. Neutrophil extracellular trap (NET) formation, a component of innate immunity, has been linked to thrombosis. We demonstrate that neutrophils from patients with MPNs are primed for NET formation, an effect blunted by pharmacological inhibition of JAK signaling. Mice with conditional knock-in of
, the most common molecular driver of MPN, have an increased propensity for NET formation and thrombosis. Inhibition of JAK-STAT signaling with the clinically available JAK2 inhibitor ruxolitinib abrogated NET formation and reduced thrombosis in a deep vein stenosis murine model. We further show that expression of PAD4, a protein required for NET formation, is increased in
-expressing neutrophils and that PAD4 is required for
-driven NET formation and thrombosis in vivo. Finally, in a population study of more than 10,000 individuals without a known myeloid disorder,
-positive clonal hematopoiesis was associated with an increased incidence of thrombosis. In aggregate, our results link
expression to NET formation and thrombosis and suggest that JAK2 inhibition may reduce thrombosis in MPNs through cell-intrinsic effects on neutrophil function.
Disruptive, damaging ultra-rare variants in highly constrained genes are enriched in individuals with neurodevelopmental disorders. In the general population, this class of variants was associated ...with a decrease in years of education (YOE). This effect was stronger among highly brain-expressed genes and explained more YOE variance than pathogenic copy number variation but less than common variants. Disruptive, damaging ultra-rare variants in highly constrained genes influence the determinants of YOE in the general population.
Studies of copy-number variation and linkage disequilibrium (LD) have typically excluded complex regions of the genome that are rich in duplications and prone to rearrangement. In an attempt to ...assess the heritability and LD of copy-number polymorphisms (CNPs) in duplication-rich regions of the genome, we profiled copy-number variation in 130 putative “rearrangement hotspot regions” among 269 individuals of European, Yoruba, Chinese, and Japanese ancestry analyzed by the International HapMap Consortium. Eighty-four hotspot regions, corresponding to 257 bacterial artificial chromosome (BAC) probes, showed evidence of copy-number differences. Despite a predisposing genetic architecture, no polymorphism was ever observed in the remaining 46 “rearrangement hotspots,” and we suggest these represent excellent candidate sites for pathogenic rearrangements. We used a combination of BAC-based and high-density customized oligonucleotide arrays to resolve the molecular basis of structural rearrangements. For common variants (frequency >10%), we observed a distinct bias against copy-number losses, suggesting that deletions are subject to purifying selection. Heritability estimates did not differ significantly from 1.0 among the majority (30 of 34) of loci analyzed, consistent with normal Mendelian inheritance. Some of the CNPs in duplication-rich regions showed strong LD with nearby single-nucleotide polymorphisms (SNPs) and were observed to segregate on ancestral SNP haplotypes. However, LD with the best available SNP markers was weaker than has been reported for deletion polymorphisms in less complex regions of the genome. These observations may be accounted for by a low density of SNP data in duplicated regions, challenges in mapping and typing the CNPs, and the possibility that CNPs in these regions have rearranged on multiple haplotype backgrounds. Our results underscore the need for complete maps of genetic variation in duplication-rich regions of the genome.