Short telomere syndromes manifest as familial idiopathic pulmonary fibrosis; they are the most common premature aging disorders. We used genome-wide linkage to identify heterozygous loss of function ...of
, a zinc-knuckle containing protein, as a cause of autosomal dominant pulmonary fibrosis. ZCCHC8 associated with
and was required for telomerase function. In ZCCHC8 knockout cells and in mutation carriers, genomically extended telomerase RNA (
) accumulated at the expense of mature
, consistent with a role for ZCCHC8 in mediating
3' end targeting to the nuclear RNA exosome. We generated
-null mice and found that heterozygotes, similar to human mutation carriers, had
insufficiency but an otherwise preserved transcriptome. In contrast,
mice developed progressive and fatal neurodevelopmental pathology with features of a ciliopathy. The
brain transcriptome was highly dysregulated, showing accumulation and 3' end misprocessing of other low-abundance RNAs, including those encoding cilia components as well as the intronless replication-dependent histones. Our data identify a novel cause of human short telomere syndromes-familial pulmonary fibrosis and uncover nuclear exosome targeting as an essential 3' end maturation mechanism that vertebrate
shares with replication-dependent histones.
Lung disease is the major cause of morbidity and mortality in persons with cystic fibrosis (pwCF). Variability in CF lung disease has substantial non-CFTR (CF transmembrane conductance regulator) ...genetic influence. Identification of genetic modifiers has prognostic and therapeutic importance.
Identify genetic modifier loci and genes/pathways associated with pulmonary disease severity.
Whole-genome sequencing data on 4,248 unique pwCF with pancreatic insufficiency and lung function measures were combined with imputed genotypes from an additional 3,592 patients with pancreatic insufficiency from the United States, Canada, and France. This report describes association of approximately 15.9 million SNPs using the quantitative Kulich normal residual mortality-adjusted (KNoRMA) lung disease phenotype in 7,840 pwCF using premodulator lung function data.
Testing included common and rare SNPs, transcriptome-wide association, gene-level, and pathway analyses. Pathway analyses identified novel associations with genes that have key roles in organ development, and we hypothesize that these genes may relate to dysanapsis and/or variability in lung repair. Results confirmed and extended previous genome-wide association study findings. These whole-genome sequencing data provide finely mapped genetic information to support mechanistic studies. No novel primary associations with common single variants or rare variants were found. Multilocus effects at chr5p13 (
) and chr11p13 (
) were identified. Variant effect size estimates at associated loci were consistently ordered across the cohorts, indicating possible age or birth cohort effects.
This premodulator genomic, transcriptomic, and pathway association study of 7,840 pwCF will facilitate mechanistic and postmodulator genetic studies and the development of novel therapeutics for CF lung disease.
Five genes have been identified that contribute to Mendelian forms of Parkinson disease (PD); however, mutations have been found in fewer than 5% of patients, suggesting that additional genes ...contribute to disease risk. Unlike previous studies that focused primarily on sporadic PD, we have performed the first genomewide association study (GWAS) in familial PD. Genotyping was performed with the Illumina HumanCNV370Duo array in 857 familial PD cases and 867 controls. A logistic model was employed to test for association under additive and recessive modes of inheritance after adjusting for gender and age. No result met genomewide significance based on a conservative Bonferroni correction. The strongest association result was with SNPs in the GAK/DGKQ region on chromosome 4 (additive model: p = 3.4 x 10⁻⁶; OR = 1.69). Consistent evidence of association was also observed to the chromosomal regions containing SNCA (additive model: p = 5.5 x 10⁻⁵; OR = 1.35) and MAPT (recessive model: p = 2.0 x 10⁻⁵; OR = 0.56). Both of these genes have been implicated previously in PD susceptibility; however, neither was identified in previous GWAS studies of PD. Meta-analysis was performed using data from a previous case-control GWAS, and yielded improved p values for several regions, including GAK/DGKQ (additive model: p = 2.5 x 10⁻⁷) and the MAPT region (recessive model: p = 9.8 x 10⁻⁶; additive model: p = 4.8 x 10⁻⁵). These data suggest the identification of new susceptibility alleles for PD in the GAK/DGKQ region, and also provide further support for the role of SNCA and MAPT in PD susceptibility.
We estimate and partition genetic variation for height, body mass index (BMI), von Willebrand factor and QT interval (QTi) using 586,898 SNPs genotyped on 11,586 unrelated individuals. We estimate ...that ∼45%, ∼17%, ∼25% and ∼21% of the variance in height, BMI, von Willebrand factor and QTi, respectively, can be explained by all autosomal SNPs and a further ∼0.5-1% can be explained by X chromosome SNPs. We show that the variance explained by each chromosome is proportional to its length, and that SNPs in or near genes explain more variation than SNPs between genes. We propose a new approach to estimate variation due to cryptic relatedness and population stratification. Our results provide further evidence that a substantial proportion of heritability is captured by common SNPs, that height, BMI and QTi are highly polygenic traits, and that the additive variation explained by a part of the genome is approximately proportional to the total length of DNA contained within genes therein.
Megacystis-microcolon-intestinal hypoperistalsis syndrome (MMIHS) is a severe disease characterized by functional obstruction in the urinary and gastrointestinal tract. The molecular basis of this ...condition started to be defined recently, and the genes related to the syndrome (ACTG2-heterozygous variant in sporadic cases; and MYH11 (myosin heavy chain 11), LMOD1 (leiomodin 1) and MYLK (myosin light chain (MLC) kinase)-autosomal recessive inheritance), encode proteins involved in the smooth muscle contraction, supporting a myopathic basis for the disease. In the present article, we described a family with two affected siblings with MMIHS born to consanguineous parents and the molecular investigation performed to define the genetic etiology. Previous whole exome sequencing of the affected child and parents did not identify a candidate gene for the disease in this family, but now we present a reanalysis of the data that led to the identification of a homozygous deletion encompassing the last exon of MYL9 (myosin regulatory light chain 9) in the affected individual. MYL9 gene encodes a regulatory myosin MLC and the phosphorylation of this protein is a crucial step in the contraction process of smooth muscle cell. Despite the absence of human or animal phenotype related to MYL9, a cause-effect relationship between MYL9 and the MMIHS seems biologically plausible. The present study reveals a strong candidate gene for autosomal recessive forms of MMIHS, expanding the molecular basis of this disease and reinforces the myopathic basis of this condition.
Cystic fibrosis (CF) is a severe genetic disorder that can cause multiple comorbidities affecting the lungs, the pancreas, the luminal digestive system and beyond. In our previous genome-wide ...association studies (GWAS), we genotyped approximately 8,000 CF samples using a mixture of different genotyping platforms. More recently, the Cystic Fibrosis Genome Project (CFGP) performed deep (approximately 30×) whole genome sequencing (WGS) of 5,095 samples to better understand the genetic mechanisms underlying clinical heterogeneity among patients with CF. For mixtures of GWAS array and WGS data, genotype imputation has proven effective in increasing effective sample size. Therefore, we first performed imputation for the approximately 8,000 CF samples with GWAS array genotype using the Trans-Omics for Precision Medicine (TOPMed) freeze 8 reference panel. Our results demonstrate that TOPMed can provide high-quality imputation for patients with CF, boosting genomic coverage from approximately 0.3–4.2 million genotyped markers to approximately 11–43 million well-imputed markers, and significantly improving polygenic risk score (PRS) prediction accuracy. Furthermore, we built a CF-specific CFGP reference panel based on WGS data of patients with CF. We demonstrate that despite having approximately 3% the sample size of TOPMed, our CFGP reference panel can still outperform TOPMed when imputing some CF disease-causing variants, likely owing to allele and haplotype differences between patients with CF and general populations. We anticipate our imputed data for 4,656 samples without WGS data will benefit our subsequent genetic association studies, and the CFGP reference panel built from CF WGS samples will benefit other investigators studying CF.
We assessed the performance of two reference panels for genotype imputation among patients with cystic fibrosis: the TOPMed freeze 8 panel (n = 97,256) and a CF-specific reference panel (n = 2,850) we constructed from whole genome sequencing data from the Cystic Fibrosis Genome Project.
In March 2013 the American College of Medical Genetics and Genomics published a list of 56 genes with the recommendation that pathogenic and likely pathogenic variants detected incidentally by ...clinical sequencing be reported to patients. As an initial step in determining the practical consequences of this recommendation in the research setting, we searched for variants in these genes in 232 whole-exome sequences from the Baylor-Hopkins Center for Mendelian Genomics.
We identified rare, nonsynonymous, and splicing single-nucleotide variants and insertions/deletions and assessed variant classification using the Human Gene Mutation, Emory, and ClinVar databases. We analyzed the burden of mutation in each of the 56 genes and determined which variants should be reported to patients.
Our filtering resulted in 249 distinct variants, with a mean of 1.69 variants per individual. Half of these were novel missense mutations not classified by any of the three reference databases. Of 101 variants listed in the Human Gene Mutation Database, 48 were also in ClinVar and 3 were also in Emory; half of these shared variants were classified discordantly between databases. Some genes consistently had greater variation than others. In total, 0.86% of individuals had a reportable incidental variant.
These observations demonstrate some current challenges of assessing phenotypic consequences of incidental variants for counseling patients.
Identifying genetic variants that influence human height will advance our understanding of skeletal growth and development. Several rare genetic variants have been convincingly and reproducibly ...associated with height in mendelian syndromes, and common variants in the transcription factor gene HMGA2 are associated with variation in height in the general population. Here we report genome-wide association analyses, using genotyped and imputed markers, of 6,669 individuals from Finland and Sardinia, and follow-up analyses in an additional 28,801 individuals. We show that common variants in the osteoarthritis-associated locus GDF5-UQCC contribute to variation in height with an estimated additive effect of 0.44 cm (overall P < 10−15). Our results indicate that there may be a link between the genetic basis of height and osteoarthritis, potentially mediated through alterations in bone growth and development.