The deamination of adenosine to inosine at the wobble position of tRNA is an essential post-transcriptional RNA modification required for wobble decoding in bacteria and eukaryotes. In humans, the ...wobble inosine modification is catalyzed by the heterodimeric ADAT2/3 complex. Here, we describe novel pathogenic ADAT3 variants impairing adenosine deaminase activity through a distinct mechanism that can be corrected through expression of the heterodimeric ADAT2 subunit. The variants were identified in a family in which all three siblings exhibit intellectual disability linked to biallelic variants in the
locus. The biallelic ADAT3 variants result in a missense variant converting alanine to valine at a conserved residue or the introduction of a premature stop codon in the deaminase domain. Fibroblast cells derived from two ID-affected individuals exhibit a reduction in tRNA wobble inosine levels and severely diminished adenosine tRNA deaminase activity. Notably, the ADAT3 variants exhibit impaired interaction with the ADAT2 subunit and alterations in ADAT2-dependent nuclear localization. Based upon these findings, we find that tRNA adenosine deaminase activity and wobble inosine modification can be rescued in patient cells by overexpression of the ADAT2 catalytic subunit. These results uncover a key role for the inactive ADAT3 deaminase domain in proper assembly with ADAT2 and demonstrate that ADAT2/3 nuclear import is required for maintaining proper levels of the wobble inosine modification in tRNA.
DNA copy number variations (CNVs) are a significant and ubiquitous source of inherited human genetic variation. However, the importance of CNVs to cancer susceptibility and tumor progression has not ...yet been explored. Li-Fraumeni syndrome (LFS) is an autosomal dominantly inherited disorder characterized by a strikingly increased risk of early-onset breast cancer, sarcomas, brain tumors and other neoplasms in individuals harboring germline TP53 mutations. Known genetic determinants of LFS do not fully explain the variable clinical phenotype in affected family members. As part of a wider study of CNVs and cancer, we conducted a genome-wide profile of germline CNVs in LFS families. Here, by examining DNA from a large healthy population and an LFS cohort using high-density oligonucleotide arrays, we show that the number of CNVs per genome is well conserved in the healthy population, but strikingly enriched in these cancer-prone individuals. We found a highly significant increase in CNVs among carriers of germline TP53 mutations with a familial cancer history. Furthermore, we identified a remarkable number of genomic regions in which known cancer-related genes coincide with CNVs, in both LFS families and healthy individuals. Germline CNVs may provide a foundation that enables the more dramatic chromosomal changes characteristic of TP53-related tumors to be established. Our results suggest that screening families predisposed to cancer for CNVs may identify individuals with an abnormally high number of these events.
Several genomes have now been sequenced, with millions of genetic variants annotated. While significant progress has been made in mapping single nucleotide polymorphisms (SNPs) and small (<10 bp) ...insertion/deletions (indels), the annotation of larger structural variants has been less comprehensive. It is still unclear to what extent a typical genome differs from the reference assembly, and the analysis of the genomes sequenced to date have shown varying results for copy number variation (CNV) and inversions.
We have combined computational re-analysis of existing whole genome sequence data with novel microarray-based analysis, and detect 12,178 structural variants covering 40.6 Mb that were not reported in the initial sequencing of the first published personal genome. We estimate a total non-SNP variation content of 48.8 Mb in a single genome. Our results indicate that this genome differs from the consensus reference sequence by approximately 1.2% when considering indels/CNVs, 0.1% by SNPs and approximately 0.3% by inversions. The structural variants impact 4,867 genes, and >24% of structural variants would not be imputed by SNP-association.
Our results indicate that a large number of structural variants have been unreported in the individual genomes published to date. This significant extent and complexity of structural variants, as well as the growing recognition of their medical relevance, necessitate they be actively studied in health-related analyses of personal genomes. The new catalogue of structural variants generated for this genome provides a crucial resource for future comparison studies.
There are 3 major sweat-producing glands present in skin; eccrine, apocrine, and apoeccrine glands. Due to the high rate of secretion, eccrine sweating is a vital regulator of body temperature in ...response to thermal stress in humans; therefore, an inability to sweat (anhidrosis) results in heat intolerance that may cause impaired consciousness and death. Here, we have reported 5 members of a consanguineous family with generalized, isolated anhidrosis, but morphologically normal eccrine sweat glands. Whole-genome analysis identified the presence of a homozygous missense mutation in ITPR2, which encodes the type 2 inositol 1,4,5-trisphosphate receptor (InsP3R2), that was present in all affected family members. We determined that the mutation is localized within the pore forming region of InsP3R2 and abrogates Ca2+ release from the endoplasmic reticulum, which suggests that intracellular Ca2+ release by InsP3R2 in clear cells of the sweat glands is important for eccrine sweat production. Itpr2-/- mice exhibited a marked reduction in sweat secretion, and evaluation of sweat glands from Itpr2-/- animals revealed a decrease in Ca2+ response compared with controls. Together, our data indicate that loss of InsP3R2-mediated Ca2+ release causes isolated anhidrosis in humans and suggest that specific InsP3R inhibitors have the potential to reduce sweat production in hyperhidrosis.
Multiple genetic studies have linked copy number variation (CNV) in different genes to body mass index (BMI) and obesity. A CNV on chromosome 10q11.22 has been associated with body weight. This CNV ...region spans NPY4R, the gene encoding the pancreatic polypeptide receptor Y4, which has been described as a satiety-stimulating receptor. We have investigated CNV of the NPY4R gene and analysed its relationship to BMI, waist circumference and self-reported dietary intake from 558 individuals (216 men and 342 women) representing a wide BMI range. The copy number for NPY4R ranged from 2 to 8 copies (average 4.6±0.8). Rather than the expected negative correlation, we observed a positive correlation between NPY4R copy number and BMI as well as waist circumference (r = 0.267, p = 2.65×10-7 and r = 0.256, p = 8×10-7, respectively). Each additional copy of NPY4R correlated with 2.6 kg/m2 increase in BMI and 5.67 cm increase in waist circumference (p = 3.3×10-7 and p = 1×10-6, respectively) for women. For men, there was no statistically significant correlation between CNV and BMI. Our results suggest that NPY4R genetic variation influences body weight in women, but the exact role of this receptor appears to be more complex than previously proposed.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
The near completeness of human chromosome sequences is facilitating accurate characterization and assessment of all classes of genomic variation. Particularly, using the DNA reference sequence as a ...guide, genome scanning technologies, such as microarray-based comparative genomic hybridization (array CGH) and genome-wide single nucleotide polymorphism (SNP) platforms, have now enabled the detection of a previously unrecognized degree of larger-sized (non-SNP) variability in all genomes. This heterogeneity can include copy number variations (CNVs), inversions, insertions, deletions and other complex rearrangements, most of which are not detected by standard cytogenetics or DNA sequencing. Although these genomic alterations (collectively termed structural variants or polymorphisms) have been described previously, mainly through locus-specific studies, they are now known to be more global in occurrence. Moreover, as just one example, CNVs can contain entire genes and their number can correlate with the level of gene expression. It is also plausible that structural variants may commonly influence nearby genes through chromosomal positional or domain effects. Here, we discuss what is known of the prevalence of structural variants in the human genome and how they might influence phenotype, including the continuum of etiologic events underlying monogenic to complex diseases. Particularly, we highlight the newest studies and some classic examples of how structural variants might have adverse genetic consequences. We also discuss why analysis of structural variants should become a vital step in any genetic study going forward. All these progresses have set the stage for a golden era of combined microscopic and sub-microscopic (cytogenomic)-based research of chromosomes leading to a more complete understanding of the human genome.
Long-read single molecule sequencing is increasingly used in human genomics research, as it allows to accurately detect large-scale DNA rearrangements such as structural variations (SVs) at high ...resolution. However, few studies have evaluated the performance of different single molecule sequencing platforms for SV detection in human samples. Here we performed Oxford Nanopore Technologies (ONT) whole-genome sequencing of two Swedish human samples (average 32× coverage) and compared the results to previously generated Pacific Biosciences (PacBio) data for the same individuals (average 66× coverage). Our analysis inferred an average of 17k and 23k SVs from the ONT and PacBio data, respectively, with a majority of them overlapping with an available multi-platform SV dataset. When comparing the SV calls in the two Swedish individuals, we find a higher concordance between ONT and PacBio SVs detected in the same individual as compared to SVs detected by the same technology in different individuals. Downsampling of PacBio reads, performed to obtain similar coverage levels for all datasets, resulted in 17k SVs per individual and improved overlap with the ONT SVs. Our results suggest that ONT and PacBio have a similar performance for SV detection in human whole genome sequencing data, and that both technologies are feasible for population-scale studies.
Oesophageal atresia (OA) is a life-threatening developmental defect characterized by a lost continuity between the upper and lower oesophagus. The most common form is a distal connection between the ...trachea and the oesophagus, i.e. a tracheoesophageal fistula (TEF). The condition may be part of a syndrome or occurs as an isolated feature. The recurrence risk in affected families is increased compared to the population-based incidence suggesting contributing genetic factors.
To gain insight into gene variants and genes associated with isolated OA we conducted whole genome sequencing on samples from three families with recurrent cases affected by congenital and isolated TEF.
We identified a combination of single nucleotide variants (SNVs), splice site variants (SSV) and structural variants (SV) annotated to altogether 100 coding genes in the six affected individuals.
This study highlights rare SVs among candidate gene variants in our individuals with OA and provides a gene framework for further investigations of genetic factors behind this malformation.
Autism spectrum disorder (ASD) is a heterogeneous neuropsychiatric disorder with a complex genetic background. Analysis of altered molecular processes in ASD patients requires linear and nonlinear ...methods that provide interpretable solutions. Interpretable machine learning provides legible models that allow explaining biological mechanisms and support analysis of clinical subgroups. In this work, we investigated several case-control studies of gene expression measurements of ASD individuals. We constructed a rule-based learning model from three independent datasets that we further visualized as a nonlinear gene-gene co-predictive network. To find dissimilarities between ASD subtypes, we scrutinized a topological structure of the network and estimated a centrality distance. Our analysis revealed that autism is the most severe subtype of ASD, while pervasive developmental disorder-not otherwise specified and Asperger syndrome are closely related and milder ASD subtypes. Furthermore, we analyzed the most important ASD-related features that were described in terms of gene co-predictors. Among others, we found a strong co-predictive mechanism between
and
, which may suggest a co-regulation between these genes. The present study demonstrates the potential of applying interpretable machine learning in bioinformatics analyses. Although the proposed methodology was designed for transcriptomics data, it can be applied to other omics disciplines.
The aim of this data paper is to describe a collection of 33 genomic, transcriptomic and epigenomic sequencing datasets of the B-cell acute lymphoblastic leukemia (ALL) cell line REH. REH is one of ...the most frequently used cell lines for functional studies of pediatric ALL, and these data provide a multi-faceted characterization of its molecular features. The datasets described herein, generated with short- and long-read sequencing technologies, can both provide insights into the complex aberrant karyotype of REH, and be used as reference datasets for sequencing data quality assessment or for methods development.