ABSTRACT
Autosomal dominant nonsyndromic hearing loss (ADNSHL) is a common and often progressive sensory deficit. ADNSHL displays a high degree of genetic heterogeneity and varying rates of ...progression. Accurate, comprehensive, and cost‐effective genetic testing facilitates genetic counseling and provides valuable prognostic information to affected individuals. In this article, we describe the algorithm underlying AudioGene, a software system employing machine‐learning techniques that utilizes phenotypic information derived from audiograms to predict the genetic cause of hearing loss in persons segregating ADNSHL. Our data show that AudioGene has an accuracy of 68% in predicting the causative gene within its top three predictions, as compared with 44% for a majority classifier. We also show that AudioGene remains effective for audiograms with high levels of clinical measurement noise. We identify audiometric outliers for each genetic locus and hypothesize that outliers may reflect modifying genetic effects. As personalized genomic medicine becomes more common, AudioGene will be increasingly useful as a phenotypic filter to assess pathogenicity of variants identified by massively parallel sequencing.
This article describes an accurate method for the diagnosis of mutations in genes, with pseudogene copies, at the single cell level for preimplantation genetic diagnostic purposes. The method is sensitive enough to detect germline mosaicism in single gametes.
Dinoflagellates are important marine primary producers and grazers and cause toxic "red tides". These taxa are characterized by many unique features such as immense genomes, the absence of ...nucleosomes, and photosynthetic organelles (plastids) that have been gained and lost multiple times. We generated EST sequences from non-normalized and normalized cDNA libraries from a culture of the toxic species Alexandrium tamarense to elucidate dinoflagellate evolution. Previous analyses of these data have clarified plastid origin and here we study the gene content, annotate the ESTs, and analyze the genes that are putatively involved in DNA packaging.
Approximately 20% of the 6,723 unique (11,171 total 3'-reads) ESTs data could be annotated using Blast searches against GenBank. Several putative dinoflagellate-specific mRNAs were identified, including one novel plastid protein. Dinoflagellate genes, similar to other eukaryotes, have a high GC-content that is reflected in the amino acid codon usage. Highly represented transcripts include histone-like (HLP) and luciferin binding proteins and several genes occur in families that encode nearly identical proteins. We also identified rare transcripts encoding a predicted protein highly similar to histone H2A.X. We speculate this histone may be retained for its role in DNA double-strand break repair.
This is the most extensive collection to date of ESTs from a toxic dinoflagellate. These data will be instrumental to future research to understand the unique and complex cell biology of these organisms and for potentially identifying the genes involved in toxin production.
Dinoflagellate algae are important primary producers and of significant ecological and economic impact because of their ability to form “red tides”1. They are also models for evolutionary research ...because of an unparalleled ability to capture photosynthetic organelles (plastids) through endosymbiosis 2. The nature and extent of the plastid genome in the dominant perdinin-containing dinoflagellates remain, however, two of the most intriguing issues in plastid evolution. The plastid genome in these taxa is reduced to single-gene minicircles 3, 4 encoding an incomplete (until now 15) set of plastid proteins. The location of the remaining photosynthetic genes is unknown. We generated a data set of 6,480 unique expressed sequence tags (ESTs) from the toxic dinoflagellate Alexandrium tamarense (for details, see the Experimental Procedures in the Supplemental Data) to find the missing plastid genes and to understand the impact of endosymbiosis on genome evolution. Here we identify 48 of the non-minicircle-encoded photosynthetic genes in the nuclear genome of A. tamarense, accounting for the majority of the photosystem. Fifteen genes that are always found on the plastid genome of other algae and plants have been transferred to the nucleus in A. tamarense. The plastid-targeted genes have red and green algal origins. These results highlight the unique position of dinoflagellates as the champions of plastid gene transfer to the nucleus among photosynthetic eukaryotes.
Computational simulation of biomolecules can provide important insights into protein design, protein-ligand binding interactions, and ab initio biomolecular folding, among other applications. ...Accurate treatment of the solvent environment is essential in such applications, but the use of explicit solvents can add considerable cost. Implicit treatment of solvent effects using a dielectric continuum model is an attractive alternative to explicit solvation since it is able to describe solvation effects without the inclusion of solvent degrees of freedom. Previously, we described the development and parameterization of implicit solvent models for small molecules. Here, we extend the parameterization of the generalized Kirkwood (GK) implicit solvent model for use with biomolecules described by the AMOEBA force field via the addition of corrections to the calculation of effective radii that account for interstitial spaces that arise within biomolecules. These include element-specific pairwise descreening scale factors, a short-range neck contribution to describe the solvent-excluded space between pairs of nearby atoms, and finally tanh-based rescaling of the overall descreening integral. We then apply the AMOEBA/GK implicit solvent to a set of ten proteins and achieve an average coordinate root mean square deviation for the experimental structures of 2.0 Å across 500 ns simulations. Overall, the continued development of implicit solvent models will help facilitate the simulation of biomolecules on mechanistically relevant timescales.
Complex biological processes require coordinated function of many genes. One evolutionary solution to the problem of coordinately expressing functionally related genes in bacteria and nematodes is ...organization of genes in operons. Surprisingly, eukaryotic operons are considered rare outside the nematode lineage. In Drosophila melanogaster, we found lounge lizard (llz), which encodes a degenerin/ENaC cation channel, cotranscribed with CheB42a, a nonhomologous gene of unknown function residing <100 bp upstream. These two genes were transcribed from a single promoter as one primary transcript and were processed posttranscriptionally to generate individual mRNAs. The mechanism did not involve alternative splicing, and it differed from the trans splicing used in nematode operons. Both genes were expressed in the same tissues, and previous work suggested that both may be involved in courtship behavior. A bioinformatic approach identified numerous additional loci as potential Drosophila operons. These data reveal eukaryotic operon-like transcription of functionally related genes in DROSOPHILA: The results also suggest that operon-based transcription may be more common in eukaryotes than previously appreciated.
Autosomal dominant non-syndromic hearing loss (ADNSHL) displays gene-specific progression of hearing loss, which is amenable to sequential audioprofiling. We sought to refine the natural history of ...ADNSHL by examining audiometric data in 5-year increments. 2175 audiograms were included from four genetic causes of ADNSHL—
KCNQ4
(DFNA2),
GSDME
(DFNA5),
WFS1
(DFNA6/14/38), and
COCH
(DFNA9). Annual threshold deterioration (ATD) was calculated for each gene: for the speech-frequency pure tone average, the ATD, respectively, was 0.72 dB/year, 0.94 dB/year, 0.53 dB/year, and 1.41 dB/year, with the largest drops occurring from ages 45–50 (0.89 dB/year;
KCNQ4
), 5–10 (1.42 dB/year;
GSDME
), 40–45 (0.83 dB/year;
WFS1
), and 50–55 (2.09 dB/year;
COCH
). 5-year interval analysis of audiograms reveals the gene specific natural history of
KCNQ4
,
GSDME
,
WFS1
and
COCH
-related progressive hearing loss. Identifying ages at which hearing loss is most rapid informs clinical care and patient expectations. Natural history data are also essential to define outcomes of clinical trials that test novel therapies designed to correct or ameliorate these genetic forms of hearing loss.
Chondrosarcomas are malignant cartilage tumors that do not respond to traditional chemotherapy or radiation. The 5-year survival rate of histologic grade III chondrosarcoma is less than 30%. An ...animal model of chondrosarcoma has been established--namely, the Swarm Rat Chondrosarcoma (SRC)--and shown to resemble the human disease. Previous studies with this model revealed that tumor microenvironment could significantly influence chondrosarcoma malignancy.
To examine the effect of the microenvironment, SRC tumors were initiated at different transplantation sites. Pyrosequencing assays were utilized to assess the DNA methylation of the tumors, and SAGE libraries were constructed and sequenced to determine the gene expression profiles of the tumors. Based on the gene expression analysis, subsequent functional assays were designed to determine the relevancy of the specific genes in the development and progression of the SRC.
The site of transplantation had a significant impact on the epigenetic and gene expression profiles of SRC tumors. Our analyses revealed that SRC tumors were hypomethylated compared to control tissue, and that tumors at each transplantation site had a unique expression profile. Subsequent functional analysis of differentially expressed genes, albeit preliminary, provided some insight into the role that thymosin-β4, c-fos, and CTGF may play in chondrosarcoma development and progression.
This report describes the first global molecular characterization of the SRC model, and it demonstrates that the tumor microenvironment can induce epigenetic alterations and changes in gene expression in the SRC tumors. We documented changes in gene expression that accompany changes in tumor phenotype, and these gene expression changes provide insight into the pathways that may play a role in the development and progression of chondrosarcoma. Furthermore, specific functional analysis indicates that thymosin-β4 may have a role in chondrosarcoma metastasis.
Hearing loss is the leading sensory deficit, affecting ~ 5% of the population. It exhibits remarkable heterogeneity across 223 genes with 6328 pathogenic missense variants, making deafness-specific ...expertise a prerequisite for ascribing phenotypic consequences to genetic variants. Deafness-implicated variants are curated in the Deafness Variation Database (DVD) after classification by a genetic hearing loss expert panel and thorough informatics pipeline. However, seventy percent of the 128,167 missense variants in the DVD are “variants of uncertain significance” (VUS) due to insufficient evidence for classification. Here, we use the deep learning protein prediction algorithm, AlphaFold2, to curate structures for all DVD genes. We refine these structures with global optimization and the AMOEBA force field and use DDGun3D to predict folding free energy differences (∆∆G
Fold
) for all DVD missense variants. We find that 5772 VUSs have a large, destabilizing ∆∆G
Fold
that is consistent with pathogenic variants. When also filtered for CADD scores (> 25.7), we determine 3456 VUSs are likely pathogenic at a probability of 99.0%. Of the 224 genes in the DVD, 166 genes (74%) exhibit one or more missense variants predicted to cause a pathogenic change in protein folding stability. The VUSs prioritized here affect 119 patients (~ 3% of cases) sequenced by the OtoSCOPE targeted panel. Approximately half of these patients previously received an inconclusive report, and reclassification of these VUSs as pathogenic provides a new genetic diagnosis for six patients.
Ethnic-specific differences in minor allele frequency impact variant categorization for genetic screening of nonsyndromic hearing loss (NSHL) and other genetic disorders. We sought to evaluate all ...previously reported pathogenic NSHL variants in the context of a large number of controls from ethnically distinct populations sequenced with orthogonal massively parallel sequencing methods. We used HGMD, ClinVar, and dbSNP to generate a comprehensive list of reported pathogenic NSHL variants and re-evaluated these variants in the context of 8,595 individuals from 12 populations and 6 ethnically distinct major human evolutionary phylogenetic groups from three sources (Exome Variant Server, 1000 Genomes project, and a control set of individuals created for this study, the OtoDB). Of the 2,197 reported pathogenic deafness variants, 325 (14.8%) were present in at least one of the 8,595 controls, indicating a minor allele frequency (MAF) >0.00006. MAFs ranged as high as 0.72, a level incompatible with pathogenicity for a fully penetrant disease like NSHL. Based on these data, we established MAF thresholds of 0.005 for autosomal-recessive variants (excluding specific variants in GJB2) and 0.0005 for autosomal-dominant variants. Using these thresholds, we recategorized 93 (4.2%) of reported pathogenic variants as benign. Our data show that evaluation of reported pathogenic deafness variants using variant MAFs from multiple distinct ethnicities and sequenced by orthogonal methods provides a powerful filter for determining pathogenicity. The proposed MAF thresholds will facilitate clinical interpretation of variants identified in genetic testing for NSHL. All data are publicly available to facilitate interpretation of genetic variants causing deafness.