The novel coronavirus severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the cause of COVID-19. The main receptor of SARS-CoV-2, angiotensin I converting enzyme 2 (ACE2), is now ...undergoing extensive scrutiny to understand the routes of transmission and sensitivity in different species. Here, we utilized a unique dataset of ACE2 sequences from 410 vertebrate species, including 252 mammals, to study the conservation of ACE2 and its potential to be used as a receptor by SARS-CoV-2. We designed a five-category binding score based on the conservation properties of 25 amino acids important for the binding between ACE2 and the SARS-CoV-2 spike protein. Only mammals fell into the medium to very high categories and only catarrhine primates into the very high category, suggesting that they are at high risk for SARS-CoV-2 infection. We employed a protein structural analysis to qualitatively assess whether amino acid changes at variable residues would be likely to disrupt ACE2/SARS-CoV-2 spike protein binding and found the number of predicted unfavorable changes significantly correlated with the binding score. Extending this analysis to human population data, we found only rare (frequency <0.001) variants in 10/25 binding sites. In addition, we found significant signals of selection and accelerated evolution in the ACE2 coding sequence across all mammals, and specific to the bat lineage. Our results, if confirmed by additional experimental data, may lead to the identification of intermediate host species for SARS-CoV-2, guide the selection of animal models of COVID-19, and assist the conservation of animals both in native habitats and in human care.
Marine stickleback fish have colonized and adapted to thousands of streams and lakes formed since the last ice age, providing an exceptional opportunity to characterize genomic mechanisms underlying ...repeated ecological adaptation in nature. Here we develop a high-quality reference genome assembly for threespine sticklebacks. By sequencing the genomes of twenty additional individuals from a global set of marine and freshwater populations, we identify a genome-wide set of loci that are consistently associated with marine-freshwater divergence. Our results indicate that reuse of globally shared standing genetic variation, including chromosomal inversions, has an important role in repeated evolution of distinct marine and freshwater sticklebacks, and in the maintenance of divergent ecotypes during early stages of reproductive isolation. Both coding and regulatory changes occur in the set of loci underlying marine-freshwater evolution, but regulatory changes appear to predominate in this well known example of repeated adaptive evolution in nature.
Angiosarcoma is a highly aggressive cancer of blood vessel-forming cells with few effective treatment options and high patient mortality. It is both rare and heterogenous, making large, well-powered ...genomic studies nearly impossible. Dogs commonly suffer from a similar cancer, called hemangiosarcoma, with breeds like the golden retriever carrying heritable genetic factors that put them at high risk. If the clinical similarity of canine hemangiosarcoma and human angiosarcoma reflects shared genomic etiology, dogs could be a critically needed model for advancing angiosarcoma research. We assessed the genomic landscape of canine hemangiosarcoma via whole-exome sequencing (47 golden retriever hemangiosarcomas) and RNA sequencing (74 hemangiosarcomas from multiple breeds). Somatic coding mutations occurred most frequently in the tumor suppressor
(59.6% of cases) as well as two genes in the PI3K pathway: the oncogene
(29.8%) and its regulatory subunit
(8.5%). The predominant mutational signature was the age-associated deamination of cytosine to thymine. As reported in human angiosarcoma,
was recurrently deleted and
, and
recurrently gained. We compared the canine data to human data recently released by The Angiosarcoma Project, and found many of the same genes and pathways significantly enriched for somatic mutations, particularly in breast and visceral angiosarcomas. Canine hemangiosarcoma closely models the genomic landscape of human angiosarcoma of the breast and viscera, and is a powerful tool for investigating the pathogenesis of this devastating disease. IMPLICATIONS: We characterize the genomic landscape of canine hemangiosarcoma and demonstrate its similarity to human angiosarcoma.
Lymphoma is the most common hematological malignancy in developed countries. Outcome is strongly determined by molecular subtype, reflecting a need for new and improved treatment options. Dogs ...spontaneously develop lymphoma, and the predisposition of certain breeds indicates genetic risk factors. Using the dog breed structure, we selected three lymphoma predisposed breeds developing primarily T-cell (boxer), primarily B-cell (cocker spaniel), and with equal distribution of B- and T-cell lymphoma (golden retriever), respectively. We investigated the somatic mutations in B- and T-cell lymphomas from these breeds by exome sequencing of tumor and normal pairs. Strong similarities were evident between B-cell lymphomas from golden retrievers and cocker spaniels, with recurrent mutations in TRAF3-MAP3K14 (28% of all cases), FBXW7 (25%), and POT1 (17%). The FBXW7 mutations recurrently occur in a specific codon; the corresponding codon is recurrently mutated in human cancer. In contrast, T-cell lymphomas from the predisposed breeds, boxers and golden retrievers, show little overlap in their mutation pattern, sharing only one of their 15 most recurrently mutated genes. Boxers, which develop aggressive T-cell lymphomas, are typically mutated in the PTEN-mTOR pathway. T-cell lymphomas in golden retrievers are often less aggressive, and their tumors typically showed mutations in genes involved in cellular metabolism. We identify genes with known involvement in human lymphoma and leukemia, genes implicated in other human cancers, as well as novel genes that could allow new therapeutic options.
Obsessive-compulsive disorder is a severe psychiatric disorder linked to abnormalities in glutamate signaling and the cortico-striatal circuit. We sequenced coding and regulatory elements for 608 ...genes potentially involved in obsessive-compulsive disorder in human, dog, and mouse. Using a new method that prioritizes likely functional variants, we compared 592 cases to 560 controls and found four strongly associated genes, validated in a larger cohort. NRXN1 and HTR2A are enriched for coding variants altering postsynaptic protein-binding domains. CTTNBP2 (synapse maintenance) and REEP3 (vesicle trafficking) are enriched for regulatory variants, of which at least six (35%) alter transcription factor-DNA binding in neuroblastoma cells. NRXN1 achieves genome-wide significance (p = 6.37 × 10
) when we include 33,370 population-matched controls. Our findings suggest synaptic adhesion as a key component in compulsive behaviors, and show that targeted sequencing plus functional annotation can identify potentially causative variants, even when genomic data are limited.Obsessive-compulsive disorder (OCD) is a neuropsychiatric disorder with symptoms including intrusive thoughts and time-consuming repetitive behaviors. Here Noh and colleagues identify genes enriched for functional variants associated with increased risk of OCD.
The domestic ferret (Mustela putorius furo) is an important animal model for multiple human respiratory diseases. It is considered the 'gold standard' for modeling human influenza virus infection and ...transmission. Here we describe the 2.41 Gb draft genome assembly of the domestic ferret, constituting 2.28 Gb of sequence plus gaps. We annotated 19,910 protein-coding genes on this assembly using RNA-seq data from 21 ferret tissues. We characterized the ferret host response to two influenza virus infections by RNA-seq analysis of 42 ferret samples from influenza time-course data and showed distinct signatures in ferret trachea and lung tissues specific to 1918 or 2009 human pandemic influenza virus infections. Using microarray data from 16 ferret samples reflecting cystic fibrosis disease progression, we showed that transcriptional changes in the CFTR-knockout ferret lung reflect pathways of early disease that cannot be readily studied in human infants with cystic fibrosis disease.
Sporadic angiosarcomas are aggressive vascular sarcomas whose rarity and genomic complexity present significant obstacles in deciphering the pathogenic significance of individual genetic alterations. ...Numerous fusion genes have been identified across multiple types of cancers, but their existence and significance remain unclear in sporadic angiosarcomas. In this study, we leveraged RNA-sequencing data from 13 human angiosarcomas and 76 spontaneous canine hemangiosarcomas to identify fusion genes associated with spontaneous vascular malignancies. Ten novel protein-coding fusion genes, including
and
, were identified in seven of the 13 human tumors, with two tumors showing mutations of
.
and
mutations were found in angiosarcomas without fusions or
mutations. We found 15 novel protein-coding fusion genes including
, and
in 11 of the 76 canine hemangiosarcomas; these fusion genes were seen exclusively in tumors of the angiogenic molecular subtype that contained recurrent mutations in
, and
. In particular, fusion genes and mutations of
cooccurred in tumors with higher frequency than expected by random chance, and they enriched gene signatures predicting activation of angiogenic pathways. Comparative transcriptomic analysis of human angiosarcomas and canine hemangiosarcomas identified shared molecular signatures associated with activation of PI3K/AKT/mTOR pathways. Our data suggest that genome instability induced by
mutations might create a predisposition for fusion events that may contribute to tumor progression by promoting selection and/or enhancing fitness through activation of convergent angiogenic pathways in this vascular malignancy. IMPLICATIONS: This study shows that, while drive events of malignant vasoformative tumors of humans and dogs include diverse mutations and stochastic rearrangements that create novel fusion genes, convergent transcriptional programs govern the highly conserved morphologic organization and biological behavior of these tumors in both species.
The domestic dog, Canis familiaris, is a valuable model for studying human diseases. The publication of the latest Canine genome build and annotation, CanFam3.1 provides an opportunity to enhance our ...understanding of gene regulation across tissues in the dog model system. In this study, we used the latest dog genome assembly and small RNA sequencing data from 9 different dog tissues to predict novel miRNAs in the dog genome, as well as to annotate conserved miRNAs from the miRBase database that were missing from the current dog annotation. We used both miRCat and miRDeep2 algorithms to computationally predict miRNA loci. The resulting, putative hairpin sequences were analysed in order to discard false positives, based on predicted secondary structures and patterns of small RNA read alignments. Results were further divided into high and low confidence miRNAs, using the same criteria. We generated tissue specific expression profiles for the resulting set of 811 loci: 720 conserved miRNAs, (207 of which had not been previously annotated in the dog genome) and 91 novel miRNA loci. Comparative analyses revealed 8 putative homologues of some novel miRNA in ferret, and one in microbat. All miRNAs were also classified into the genic and intergenic categories, based on the Ensembl RefSeq gene annotation for CanFam3.1. This additionally allowed us to identify four previously undescribed MiRtrons among our total set of miRNAs. We additionally annotated piRNAs, using proTRAC on the same input data. We thus identified 263 putative clusters, most of which (211 clusters) were found to be expressed in testis. Our results represent an important improvement of the dog genome annotation, paving the way to further research on the evolution of gene regulation, as well as on the contribution of post-transcriptional regulation to pathological conditions.
Cichlid fishes have evolved tremendous morphological and behavioral diversity in the waters of East Africa. Within each of the Great Lakes Tanganyika, Malawi, and Victoria, the phenomena of ...hybridization and retention of ancestral polymorphism explain allele sharing across species. Here, we explore the sharing of single nucleotide polymorphisms (SNPs) between the major East African cichlid assemblages. A set of approximately 200 genic and nongenic SNPs was ascertained in five Lake Malawi species and genotyped in a diverse collection of ~160 species from across Africa. We observed segregating polymorphism outside of the Malawi lineage for more than 50% of these loci; this holds similarly for genic versus nongenic SNPs, as well as for SNPs at putative CpG versus non-CpG sites. Bayesian and principal component analyses of genetic structure in the data demonstrate that the Lake Malawi endemic flock is not monophyletic and that river species have likely contributed significantly to Malawi genomes. Coalescent simulations support the hypothesis that river cichlids have transported polymorphism between lake assemblages. We observed strong genetic differentiation between Malawi lineages for approximately 8% of loci, with contributions from both genic and nongenic SNPs. Notably, more than half of these outlier loci between Malawi groups are polymorphic outside of the lake. Cichlid fishes have evolved diversity in Lake Malawi as new mutations combined with standing genetic variation shared across East Africa.
Canine degenerative myelopathy (DM) is a naturally occurring neurodegenerative disease with similarities to some forms of amyotrophic lateral sclerosis (ALS). Most dogs that develop DM are homozygous ...for a common superoxide dismutase 1 gene (SOD1) mutation. However, not all dogs homozygous for this mutation develop disease. We performed a genome-wide association analysis in the Pembroke Welsh Corgi (PWC) breed comparing DM-affected and -unaffected dogs homozygous for the SOD1 mutation. The analysis revealed a modifier locus on canine chromosome 25. A haplotype within the SP110 nuclear body protein (SP110) was present in 40% of affected compared with 4% of unaffected dogs (P = 1.5 × 10−5), and was associated with increased probability of developing DM (P = 4.8 × 10−6) and earlier onset of disease (P = 1.7 × 10−5). SP110 is a nuclear body protein involved in the regulation of gene transcription. Our findings suggest that variations in SP110-mediated gene transcription may underlie, at least in part, the variability in risk for developing DM among PWCs that are homozygous for the disease-related SOD1 mutation. Further studies are warranted to clarify the effect of this modifier across dog breeds.