Ethnic-specific differences in minor allele frequency impact variant categorization for genetic screening of nonsyndromic hearing loss (NSHL) and other genetic disorders. We sought to evaluate all ...previously reported pathogenic NSHL variants in the context of a large number of controls from ethnically distinct populations sequenced with orthogonal massively parallel sequencing methods. We used HGMD, ClinVar, and dbSNP to generate a comprehensive list of reported pathogenic NSHL variants and re-evaluated these variants in the context of 8,595 individuals from 12 populations and 6 ethnically distinct major human evolutionary phylogenetic groups from three sources (Exome Variant Server, 1000 Genomes project, and a control set of individuals created for this study, the OtoDB). Of the 2,197 reported pathogenic deafness variants, 325 (14.8%) were present in at least one of the 8,595 controls, indicating a minor allele frequency (MAF) >0.00006. MAFs ranged as high as 0.72, a level incompatible with pathogenicity for a fully penetrant disease like NSHL. Based on these data, we established MAF thresholds of 0.005 for autosomal-recessive variants (excluding specific variants in GJB2) and 0.0005 for autosomal-dominant variants. Using these thresholds, we recategorized 93 (4.2%) of reported pathogenic variants as benign. Our data show that evaluation of reported pathogenic deafness variants using variant MAFs from multiple distinct ethnicities and sequenced by orthogonal methods provides a powerful filter for determining pathogenicity. The proposed MAF thresholds will facilitate clinical interpretation of variants identified in genetic testing for NSHL. All data are publicly available to facilitate interpretation of genetic variants causing deafness.
Nucleosome organization is critical for gene regulation. In living cells this organization is determined by multiple factors, including the action of chromatin remodellers, competition with ...site-specific DNA-binding proteins, and the DNA sequence preferences of the nucleosomes themselves. However, it has been difficult to estimate the relative importance of each of these mechanisms in vivo, because in vivo nucleosome maps reflect the combined action of all influencing factors. Here we determine the importance of nucleosome DNA sequence preferences experimentally by measuring the genome-wide occupancy of nucleosomes assembled on purified yeast genomic DNA. The resulting map, in which nucleosome occupancy is governed only by the intrinsic sequence preferences of nucleosomes, is similar to in vivo nucleosome maps generated in three different growth conditions. In vitro, nucleosome depletion is evident at many transcription factor binding sites and around gene start and end sites, indicating that nucleosome depletion at these sites in vivo is partly encoded in the genome. We confirm these results with a micrococcal nuclease-independent experiment that measures the relative affinity of nucleosomes for ∼40,000 double-stranded 150-base-pair oligonucleotides. Using our in vitro data, we devise a computational model of nucleosome sequence preferences that is significantly correlated with in vivo nucleosome occupancy in Caenorhabditis elegans. Our results indicate that the intrinsic DNA sequence preferences of nucleosomes have a central role in determining the organization of nucleosomes in vivo.
Celotno besedilo
Dostopno za:
DOBA, IJS, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Misfolded ER proteins are retrotranslocated into the cytosol for degradation via the ubiquitin-proteasome system. The human cytomegalovirus protein US11 exploits this ER-associated protein ...degradation (ERAD) pathway to downregulate HLA class I molecules in virus-infected cells, thereby evading elimination by cytotoxic T-lymphocytes. US11-mediated degradation of HLA class I has been instrumental in the identification of key components of mammalian ERAD, including Derlin-1, p97, VIMP and SEL1L. Despite this, the process governing retrotranslocation of the substrate is still poorly understood. Here using a high-coverage genome-wide shRNA library, we identify the uncharacterized protein TMEM129 and the ubiquitin-conjugating E2 enzyme UBE2J2 to be essential for US11-mediated HLA class I downregulation. TMEM129 is an unconventional C4C4-type RING finger E3 ubiquitin ligase that resides within a complex containing various other ERAD components, including Derlin-1, Derlin-2, VIMP and p97, indicating that TMEM129 is an integral part of the ER-resident dislocation complex mediating US11-induced HLA class I degradation.
Immune responses targeting self-proteins (autoantigens) can lead to a variety of autoimmune diseases. Identification of these antigens is important for both diagnostic and therapeutic reasons. ...However, current approaches to characterize autoantigens have, in most cases, met only with limited success. Here we present a synthetic representation of the complete human proteome, the T7 peptidome phage display library (T7-Pep), and demonstrate its application to autoantigen discovery. T7-Pep is composed of >413,000 36-residue, overlapping peptides that cover all open reading frames in the human genome, and can be analyzed using high-throughput DNA sequencing. We developed a phage immunoprecipitation sequencing (PhIP-Seq) methodology to identify known and previously unreported autoantibodies contained in the spinal fluid of three individuals with paraneoplastic neurological syndromes. We also show how T7-Pep can be used more generally to identify peptide-protein interactions, suggesting the broader utility of our approach for proteomic research.
Current DNA methylation assays are limited in the flexibility and efficiency of characterizing a large number of genomic targets. We report a method to specifically capture an arbitrary subset of ...genomic targets for single-molecule bisulfite sequencing for digital quantification of DNA methylation at single-nucleotide resolution. A set of ~30,000 padlock probes was designed to assess methylation of ~66,000 CpG sites within 2,020 CpG islands on human chromosome 12, chromosome 20, and 34 selected regions. To investigate epigenetic differences associated with dedifferentiation, we compared methylation in three human fibroblast lines and eight human pluripotent stem cell lines. Chromosome-wide methylation patterns were similar among all lines studied, but cytosine methylation was slightly more prevalent in the pluripotent cells than in the fibroblasts. Induced pluripotent stem (iPS) cells appeared to display more methylation than embryonic stem cells. We found 288 regions methylated differently in fibroblasts and pluripotent cells. This targeted approach should be particularly useful for analyzing DNA methylation in large genomes.
High-throughput sequencing of targeted genomic loci in large populations is an effective approach for evaluating the contribution of rare variants to disease risk. We evaluated the feasibility of ...using in-solution hybridization-based target capture on pooled DNA samples to enable cost-efficient population sequencing studies. For this, we performed pooled sequencing of 100 HapMap samples across ∼ 600 kb of DNA sequence using the Illumina GAIIx. Using our accurate variant calling method for pooled sequence data, we were able to not only identify single nucleotide variants with a low false discovery rate (<1%) but also accurately detect short insertion/deletion variants. In addition, with sufficient coverage per individual in each pool (30-fold) we detected 97.2% of the total variants and 93.6% of variants below 5% in frequency. Finally, allele frequencies for single nucleotide variants (SNVs) estimated from the pooled data and the HapMap genotype data were tightly correlated (correlation coefficient > = 0.995).
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
We developed a digital RNA allelotyping method for quantitatively interrogating allele-specific gene expression. This method involves ultra-deep sequencing of padlock-captured single-nucleotide ...polymorphisms (SNPs) from the transcriptome. We characterized four cell lines established from two human subjects in the Personal Genome Project. Approximately 11-22% of the heterozygous mRNA-associated SNPs showed allele-specific expression in each cell line and 4.3-8.5% were tissue-specific, suggesting the presence of tissue-specific cis regulation. When we applied allelotyping to two pairs of sibling human embryonic stem cell lines, the sibling lines were more similar in allele-specific expression than were the genetically unrelated lines. We found that the variation of allelic ratios in gene expression among different cell lines was primarily explained by genetic variations, much more so than by specific tissue types or growth conditions. Comparison of expressed SNPs on the sense and antisense transcripts suggested that allelic ratios are primarily determined by cis-regulatory mechanisms on the sense transcripts.
A new generation of technologies is poised to reduce DNA sequencing costs by several orders of magnitude. But our ability to fully leverage the power of these technologies is crippled by the absence ...of suitable 'front-end' methods for isolating complex subsets of a mammalian genome at a scale that matches the throughput at which these platforms will routinely operate. We show that targeting oligonucleotides released from programmable microarrays can be used to capture and amplify approximately 10,000 human exons in a single multiplex reaction. Additionally, we show integration of this protocol with ultra-high-throughput sequencing for targeted variation discovery. Although the multiplex capture reaction is highly specific, we found that nonuniform capture is a key issue that will need to be resolved by additional optimization. We anticipate that highly multiplexed methods for targeted amplification will enable the comprehensive resequencing of human exons at a fraction of the cost of whole-genome resequencing.
To exploit fully the potential of current sequencing technologies for population-based studies, one must enrich for loci from the human genome. Here we evaluate the hybridization-based approach by ...using oligonucleotide capture probes in solution to enrich for approximately 3.9 Mb of sequence target. We demonstrate that the tiling probe frequency is important for generating sequence data with high uniform coverage of targets. We obtained 93% sensitivity to detect SNPs, with a calling accuracy greater than 99%.
Non-syndromic hearing loss (NSHL) is the most common sensory impairment in humans. Until recently its extreme genetic heterogeneity precluded comprehensive genetic testing. Using a platform that ...couples targeted genomic enrichment (TGE) and massively parallel sequencing (MPS) to sequence all exons of all genes implicated in NSHL, we tested 100 persons with presumed genetic NSHL and in so doing established sequencing requirements for maximum sensitivity and defined MPS quality score metrics that obviate Sanger validation of variants.
We examined DNA from 100 sequentially collected probands with presumed genetic NSHL without exclusions due to inheritance, previous genetic testing, or type of hearing loss. We performed TGE using post-capture multiplexing in variable pool sizes followed by Illumina sequencing. We developed a local Galaxy installation on a high performance computing cluster for bioinformatics analysis.
To obtain maximum variant sensitivity with this platform 3.2-6.3 million total mapped sequencing reads per sample were required. Quality score analysis showed that Sanger validation was not required for 95% of variants. Our overall diagnostic rate was 42%, but this varied by clinical features from 0% for persons with asymmetric hearing loss to 56% for persons with bilateral autosomal recessive NSHL.
These findings will direct the use of TGE and MPS strategies for genetic diagnosis for NSHL. Our diagnostic rate highlights the need for further research on genetic deafness focused on novel gene identification and an improved understanding of the role of non-exonic mutations. The unsolved families we have identified provide a valuable resource to address these areas.