We report genome sequences of 17 inbred strains of laboratory mice and identify almost ten times more variants than previously known. We use these genomes to explore the phylogenetic history of the ...laboratory mouse and to examine the functional consequences of allele-specific variation on transcript abundance, revealing that at least 12% of transcripts show a significant tissue-specific expression bias. By identifying candidate functional variants at 718 quantitative trait loci we show that the molecular nature of functional variants and their position relative to genes vary according to the effect size of the locus. These sequences provide a starting point for a new era in the functional analysis of a key model organism.
Structural variation is widespread in mammalian genomes and is an important cause of disease, but just how abundant and important structural variants (SVs) are in shaping phenotypic variation remains ...unclear. Without knowing how many SVs there are, and how they arise, it is difficult to discover what they do. Combining experimental with automated analyses, we identified 711,920 SVs at 281,243 sites in the genomes of thirteen classical and four wild-derived inbred mouse strains. The majority of SVs are less than 1 kilobase in size and 98% are deletions or insertions. The breakpoints of 160,000 SVs were mapped to base pair resolution, allowing us to infer that insertion of retrotransposons causes more than half of SVs. Yet, despite their prevalence, SVs are less likely than other sequence variants to cause gene expression or quantitative phenotypic variation. We identified 24 SVs that disrupt coding exons, acting as rare variants of large effect on gene function. One-third of the genes so affected have immunological functions.
Spinal muscular atrophy (SMA) is a leading genetic cause of infant mortality. The advent of approved treatments for this devastating condition has significantly changed SMA patients' life expectancy ...and quality of life. Nevertheless, these are not without limitations, and research efforts are underway to develop new approaches for improved and long-lasting benefits for patients. Protein arginine methyltransferases (PRMTs) are emerging as druggable epigenetic targets, with several small-molecule PRMT inhibitors already in clinical trials. From a screen of epigenetic molecules, we have identified MS023, a potent and selective type I PRMT inhibitor able to promote SMN2 exon 7 inclusion in preclinical SMA models. Treatment of SMA mice with MS023 results in amelioration of the disease phenotype, with strong synergistic amplification of the positive effect when delivered in combination with the antisense oligonucleotide nusinersen. Moreover, transcriptomic analysis revealed that MS023 treatment has minimal off-target effects, and the added benefit is mainly due to targeting neuroinflammation. Our study warrants further clinical investigation of PRMT inhibition both as a stand-alone and add-on therapy for SMA.
The discovery of genetic variants influencing sleep patterns can shed light on the physiological processes underlying sleep. As part of a large clinical sequencing project, WGS500, we sequenced a ...family in which the two male children had severe developmental delay and a dramatically disturbed sleep-wake cycle, with very long wake and sleep durations, reaching up to 106-h awake and 48-h asleep. The most likely causal variant identified was a novel missense variant in the X-linked GRIA3 gene, which has been implicated in intellectual disability. GRIA3 encodes GluA3, a subunit of AMPA-type ionotropic glutamate receptors (AMPARs). The mutation (A653T) falls within the highly conserved transmembrane domain of the ion channel gate, immediately adjacent to the analogous residue in the Grid2 (glutamate receptor) gene, which is mutated in the mouse neurobehavioral mutant, Lurcher. In vitro, the GRIA3(A653T) mutation stabilizes the channel in a closed conformation, in contrast to Lurcher. We introduced the orthologous mutation into a mouse strain by CRISPR-Cas9 mutagenesis and found that hemizygous mutants displayed significant differences in the structure of their activity and sleep compared to wild-type littermates. Typically, mice are polyphasic, exhibiting multiple sleep bouts of sleep several minutes long within a 24-h period. The Gria3A653T mouse showed significantly fewer brief bouts of activity and sleep than the wild-types. Furthermore, Gria3A653T mice showed enhanced period lengthening under constant light compared to wild-type mice, suggesting an increased sensitivity to light. Our results suggest a role for GluA3 channel activity in the regulation of sleep behavior in both mice and humans.
Genome-wide association studies using commercially available outbred mice can detect genes involved in phenotypes of biomedical interest. Useful populations need high-frequency alleles to ensure high ...power to detect quantitative trait loci (QTLs), low linkage disequilibrium between markers to obtain accurate mapping resolution, and an absence of population structure to prevent false positive associations. We surveyed 66 colonies for inbreeding, genetic diversity, and linkage disequilibrium, and we demonstrate that some have haplotype blocks of less than 100 Kb, enabling gene-level mapping resolution. The same alleles contribute to variation in different colonies, so that when mapping progress stalls in one, another can be used in its stead. Colonies are genetically diverse: 45% of the total genetic variation is attributable to differences between colonies. However, quantitative differences in allele frequencies, rather than the existence of private alleles, are responsible for these population differences. The colonies derive from a limited pool of ancestral haplotypes resembling those found in inbred strains: over 95% of sequence variants segregating in outbred populations are found in inbred strains. Consequently it is possible to impute the sequence of any mouse from a dense SNP map combined with inbred strain sequence data, which opens up the possibility of cataloguing and testing all variants for association, a situation that has so far eluded studies in completely outbred populations. We demonstrate the colonies' potential by identifying a deletion in the promoter of H2-Ea as the molecular change that strongly contributes to setting the ratio of CD4+ and CD8+ lymphocytes.
Extracellular small RNAs (sRNAs), including microRNAs (miRNAs), are promising biomarkers for diseases such as Duchenne muscular dystrophy (DMD), although their biological relevance is largely ...unknown. To investigate the relationship between intracellular and extracellular sRNA levels on a global scale, we performed sRNA sequencing in four muscle types and serum from wild-type, dystrophic mdx, and mdx mice in which dystrophin protein expression was restored by exon skipping. Differentially abundant sRNAs were identified in serum (mapping to miRNA, small nuclear RNA snRNA, and PIWI-interacting RNA piRNA loci). One novel candidate biomarker, miR-483, was increased in both mdx serum and muscle, and also elevated in DMD patient sera. Dystrophin restoration induced global shifts in miRNA (including miR-483) and snRNA-fragment abundance toward wild-type levels. Specific serum piRNA-like sRNAs also responded to exon skipping therapy. Absolute miRNA expression in muscle was positively correlated with abundance in the circulation, although multiple highly expressed miRNAs in muscle were not elevated in mdx serum, suggesting that both passive and selective release mechanisms contribute to serum miRNA levels. In conclusion, this study has revealed new insights into the sRNA biology of dystrophin deficiency and identified novel DMD biomarkers.
Background
Duchenne muscular dystrophy (DMD) is a fatal muscle‐wasting disorder caused by genetic loss of dystrophin protein. Extracellular microRNAs (ex‐miRNAs) are putative, minimally invasive ...biomarkers of DMD. Specific ex‐miRNAs (e.g. miR‐1, miR‐133a, miR‐206, and miR‐483) are highly up‐regulated in the serum of DMD patients and dystrophic animal models and are restored to wild‐type levels following exon skipping‐mediated dystrophin rescue in mdx mice. As such, ex‐miRNAs are promising pharmacodynamic biomarkers of exon skipping efficacy. Here, we aimed to determine the degree to which ex‐miRNA levels reflect the underlying level of dystrophin protein expression in dystrophic muscle.
Methods
Candidate ex‐miRNA biomarker levels were investigated in mdx mice in which dystrophin was restored with peptide‐PMO (PPMO) exon skipping conjugates and in mdx‐XistΔhs mice that express variable amounts of dystrophin from birth as a consequence of skewed X‐chromosome inactivation. miRNA profiling was performed in mdx‐XistΔhs mice using the FirePlex methodology and key results validated by small RNA TaqMan RT‐qPCR. The muscles from each animal model were further characterized by dystrophin western blot and immunofluorescence staining.
Results
The restoration of ex‐myomiR abundance observed following PPMO treatment was not recapitulated in the high dystrophin‐expressing mdx‐XistΔhs group, despite these animals expressing similar amounts of total dystrophin protein (~37% of wild‐type levels). Instead, ex‐miRNAs were present at high levels in mdx‐XistΔhs mice regardless of dystrophin expression. PPMO‐treated muscles exhibited a uniform pattern of dystrophin localization and were devoid of regenerating fibres, whereas mdx‐XistΔhs muscles showed non‐homogeneous dystrophin staining and sporadic regenerating foci.
Conclusions
Uniform dystrophin expression is required to prevent ex‐miRNA release, stabilize myofiber turnover, and attenuate pathology in dystrophic muscle.
Array comparative genomic hybridization (aCGH) to detect copy number variants (CNVs) in mammalian genomes has led to a growing awareness of the potential importance of this category of sequence ...variation as a cause of phenotypic variation. Yet there are large discrepancies between studies, so that the extent of the genome affected by CNVs is unknown. We combined molecular and aCGH analyses of CNVs in inbred mouse strains to investigate this question.
Using a 2.1 million probe array we identified 1,477 deletions and 499 gains in 7 inbred mouse strains. Molecular characterization indicated that approximately one third of the CNVs detected by the array were false positives and we estimate the false negative rate to be more than 50%. We show that low concordance between studies is largely due to the molecular nature of CNVs, many of which consist of a series of smaller deletions and gains interspersed by regions where the DNA copy number is normal.
Our results indicate that CNVs detected by arrays may be the coincidental co-localization of smaller CNVs, whose presence is more likely to perturb an aCGH hybridization profile than the effect of an isolated, small, copy number alteration. Our findings help explain the hitherto unexplored discrepancies between array-based studies of copy number variation in the mouse genome.
Background Exploiting synteny between mouse and human disease loci has been proposed as a cost-effective method for the identification of human susceptibility genes. Here we explore its utility in an ...analysis of a human personality trait, neuroticism, which can be modeled in mice by tests of emotionality. We investigated a mouse emotionality locus on chromosome 1 that contains no annotated genes but abuts four regulators of G protein signaling, one of which ( rgs 2) has been previously identified as a quantitative trait gene for emotionality. This locus is syntenic with a human region that has been consistently implicated in the genetic aetiology of neuroticism. Methods The functional candidacy of 29 murine sequence variants was tested by a combination of gel shift and transient transfection assays. Murine sequences that contained functional variants and exhibited significant cross-species conservation were prioritized for investigation in humans. Genetic association with neuroticism was tested in 1869 high and 2032 low unrelated individuals scored for neuroticism, selected from the extremes of 88,141 people from southwest England. Results Fifteen sequence variants contributed to variation in the expression of rgs18 , the gene lying at the edge of the quantitative trait loci (QTL) interval. There was no evidence of association between neuroticism and single nucleotide polymorphisms (SNPs) lying in the human regions homologous to those of mouse functional variants. One SNP, rs6428058, in a region of sequence conservation 644 kb upstream of RGS18, showed significant association ( p = .000631). Conclusions It is unlikely that a single variant is responsible for the mouse emotionality locus on chromosome 1. This level of underlying genetic complexity means that although cross-species QTL concordance may be invaluable for the identification of human disease loci, it is unlikely to be as informative in the identification of human disease-causing variants.
Two bottlenecks impeding the genetic analysis of complex traits in rodents are access to mapping populations able to deliver gene-level mapping resolution and the need for population-specific ...genotyping arrays and haplotype reference panels. Here we combine low-coverage (0.15×) sequencing with a new method to impute the ancestral haplotype space in 1,887 commercially available outbred mice. We mapped 156 unique quantitative trait loci for 92 phenotypes at a 5% false discovery rate. Gene-level mapping resolution was achieved at about one-fifth of the loci, implicating Unc13c and Pgc1a at loci for the quality of sleep, Adarb2 for home cage activity, Rtkn2 for intensity of reaction to startle, Bmp2 for wound healing, Il15 and Id2 for several T cell measures and Prkca for bone mineral content. These findings have implications for diverse areas of mammalian biology and demonstrate how genome-wide association studies can be extended via low-coverage sequencing to species with highly recombinant outbred populations.