The domestic cat (Felis catus) numbers over 94 million in the USA alone, occupies households as a companion animal, and, like humans, suffers from cancer and common and rare diseases. However, ...genome-wide sequence variant information is limited for this species. To empower trait analyses, a new cat genome reference assembly was developed from PacBio long sequence reads that significantly improve sequence representation and assembly contiguity. The whole genome sequences of 54 domestic cats were aligned to the reference to identify single nucleotide variants (SNVs) and structural variants (SVs). Across all cats, 16 SNVs predicted to have deleterious impacts and in a singleton state were identified as high priority candidates for causative mutations. One candidate was a stop gain in the tumor suppressor FBXW7. The SNV is found in cats segregating for feline mediastinal lymphoma and is a candidate for inherited cancer susceptibility. SV analysis revealed a complex deletion coupled with a nearby potential duplication event that was shared privately across three unrelated cats with dwarfism and is found within a known dwarfism associated region on cat chromosome B1. This SV interrupted UDP-glucose 6-dehydrogenase (UGDH), a gene involved in the biosynthesis of glycosaminoglycans. Importantly, UGDH has not yet been associated with human dwarfism and should be screened in undiagnosed patients. The new high-quality cat genome reference and the compilation of sequence variation demonstrate the importance of these resources when searching for disease causative alleles in the domestic cat and for identification of feline biomedical models.
Current sequencing methods produce large amounts of data, but genome assemblies based on these data are often woefully incomplete. These incomplete and error-filled assemblies result in many ...annotation errors, especially in the number of genes present in a genome. In this paper we investigate the magnitude of the problem, both in terms of total gene number and the number of copies of genes in specific families. To do this, we compare multiple draft assemblies against higher-quality versions of the same genomes, using several new assemblies of the chicken genome based on both traditional and next-generation sequencing technologies, as well as published draft assemblies of chimpanzee. We find that upwards of 40% of all gene families are inferred to have the wrong number of genes in draft assemblies, and that these incorrect assemblies both add and subtract genes. Using simulated genome assemblies of Drosophila melanogaster, we find that the major cause of increased gene numbers in draft genomes is the fragmentation of genes onto multiple individual contigs. Finally, we demonstrate the usefulness of RNA-Seq in improving the gene annotation of draft assemblies, largely by connecting genes that have been fragmented in the assembly process.
A major challenge in biology is to understand the genetic basis of adaptation. One compelling idea is that groups of tightly linked genes (i.e., “supergenes” 1, 2) facilitate adaptation in suites of ...traits that determine fitness. Despite their likely importance, little is known about how alternate supergene alleles arise and become differentiated, nor their ultimate fate within species. Herein we address these questions by investigating the evolutionary history of a supergene in white-throated sparrows, Zonotrichia albicollis. This species comprises two morphs, tan and white, that differ in pigmentation and components of social behavior 3–5. Morph is determined by alternative alleles at a balanced >100-Mb inversion-based supergene, providing a unique system for studying gene-behavior relationships. Using over two decades of field data, we document near-perfect disassortative mating among morphs, as well as the fitness consequences of rare assortative mating. We use de novo whole-genome sequencing coupled with population- and phylogenomic data to show that alternate supergene alleles are highly divergent at over 1,000 genes and that these alleles originated prior to the split of Z. albicollis from its sister species and may be polymorphic in Z. albicollis due to a past hybridization event. We provide evidence that the “white” allele may be degrading, similar to neo-Y/W sex chromosomes. We further show that the “tan” allele has surprisingly low levels of genetic diversity yet does not show several canonical signatures of recurrent positive selection. We discuss these results in the context of the origin, molecular evolution, and possible fate of this remarkable polymorphism.
Display omitted
Display omitted
•A balanced supergene determines disassortative mating between bird morphs•One supergene allele appears to be degrading similar to neo-Y/W chromosomes•Alternate alleles are ancient and likely became polymorphic via hybridization
Supergenes are linked clusters of genes that contribute to adaptive variation. The origin and fate of polymorphic supergenes within species are not well understood. To advance the understanding of supergenes, Tuttle et al. examine the origin, evolutionary history, and possible fate of a fitness-related supergene in the white-throated sparrow.
We sequenced the MSY (male-specific region of the Y chromosome) of the C57BL/6J strain of the laboratory mouse Mus musculus. In contrast to theories that Y chromosomes are heterochromatic and gene ...poor, the mouse MSY is 99.9% euchromatic and contains about 700 protein-coding genes. Only 2% of the MSY derives from the ancestral autosomes that gave rise to the mammalian sex chromosomes. Instead, all but 45 of the MSY's genes belong to three acquired, massively amplified gene families that have no homologs on primate MSYs but do have acquired, amplified homologs on the mouse X chromosome. The complete mouse MSY sequence brings to light dramatic forces in sex chromosome evolution: lineage-specific convergent acquisition and amplification of X-Y gene families, possibly fueled by antagonism between acquired X-Y homologs. The mouse MSY sequence presents opportunities for experimental studies of a sex-specific chromosome in its entirety, in a genetically tractable model organism.
In order to provide a comprehensive resource for human structural variants (SVs), we generated long-read sequence data and analyzed SVs for fifteen human genomes. We sequence resolved 99,604 ...insertions, deletions, and inversions including 2,238 (1.6 Mbp) that are shared among all discovery genomes with an additional 13,053 (6.9 Mbp) present in the majority, indicating minor alleles or errors in the reference. Genotyping in 440 additional genomes confirms the most common SVs in unique euchromatin are now sequence resolved. We report a ninefold SV bias toward the last 5 Mbp of human chromosomes with nearly 55% of all VNTRs (variable number of tandem repeats) mapping to this portion of the genome. We identify SVs affecting coding and noncoding regulatory loci improving annotation and interpretation of functional variation. These data provide the framework to construct a canonical human reference and a resource for developing advanced representations capable of capturing allelic diversity.
Display omitted
•We sequence resolve and annotate 99,604 common human structural variants•55% of VNTRs map to the end of chromosomes and correlate with double-strand breaks•Alternate alleles facilitate accurate genotyping with short reads and new associations•We patch the reference and add diversity needed for developing a pan human genome
Long-read sequencing allows generation of a large catalog of human structural variants and the development of an algorithm for genotyping SVs from short-read data, clarifying the spectrum and importance of structural variation in the human genome.
The human X and Y chromosomes evolved from an ordinary pair of autosomes, but millions of years ago genetic decay ravaged the Y chromosome, and only three per cent of its ancestral genes survived. We ...reconstructed the evolution of the Y chromosome across eight mammals to identify biases in gene content and the selective pressures that preserved the surviving ancestral genes. Our findings indicate that survival was nonrandom, and in two cases, convergent across placental and marsupial mammals. We conclude that the gene content of the Y chromosome became specialized through selection to maintain the ancestral dosage of homologous X-Y gene pairs that function as broadly expressed regulators of transcription, translation and protein stability. We propose that beyond its roles in testis determination and spermatogenesis, the Y chromosome is essential for male viability, and has unappreciated roles in Turner's syndrome and in phenotypic differences between the sexes in health and disease.
After birds diverged from mammals, different ancestral autosomes evolved into sex chromosomes in each lineage. In birds, females are ZW and males are ZZ, but in mammals females are XX and males are ...XY. We sequenced the chicken W chromosome, compared its gene content with our reconstruction of the ancestral autosomes, and followed the evolutionary trajectory of ancestral W-linked genes across birds. Avian W chromosomes evolved in parallel with mammalian Y chromosomes, preserving ancestral genes through selection to maintain the dosage of broadly expressed regulators of key cellular processes. We propose that, like the human Y chromosome, the chicken W chromosome is essential for embryonic viability of the heterogametic sex. Unlike other sequenced sex chromosomes, the chicken W chromosome did not acquire and amplify genes specifically expressed in reproductive tissues. We speculate that the pressures that drive the acquisition of reproduction-related genes on sex chromosomes may be specific to the male germ line.
The eastern oyster, Crassostrea virginica, provides important ecological and economical services, making it the target of restoration projects and supporting a significant fishery/aquaculture ...industry with landings valued at more than $100 million in 2012 in the United States of America. Due to the impact of infectious diseases on wild, restored, and cultured populations, the eastern oyster has been the focus of studies on host–pathogen interactions and immunity, as well as the target of selective breeding efforts for disease resistant oyster lines. Despite these efforts, relatively little is known about the genetic basis of resistance to diseases or environmental stress, not only in eastern oyster, but also in other molluscan species of commercial interest worldwide. In order to develop tools and resources to assist in the elucidation of the genomic basis of traits of commercial, biological, and ecological interest in oysters, a team of genome and bioinformatics experts, in collaboration with the oyster research community, is sequencing, assembling, and annotating the first reference genome for the eastern oyster and producing an exhaustive transcriptome from a variety of oyster developmental stages and tissues in response to a diverse set of environmentally-relevant stimuli. These transcriptomes and reference genome for the eastern oyster, added to the already available genome and transcriptomes for the Pacific oyster (Crassostrea gigas) and other bivalve species, will be an essential resource for the discovery of candidate genes and markers associated with traits of commercial, biological, and ecologic importance in bivalve molluscs, including those related to host–pathogen interactions and immunity.
•The genome of the eastern oyster, Crassostreae virginica, will be sequenced through a community effort.•Transcriptomes from a diverse set of tissues and developmental stages in response to a variety of stimuli will be produced.•We call to members of the research community to become involved in this project through sharing of data and tool evaluation.•Tools produced through this project will facilitate the study of host-pathogen interactions and immunity in bivalves.
We compared the human and mouse X chromosomes to systematically test Ohno's law, which states that the gene content of X chromosomes is conserved across placental mammals. First, we improved the ...accuracy of the human X-chromosome reference sequence through single-haplotype sequencing of ampliconic regions. The new sequence closed gaps in the reference sequence, corrected previously misassembled regions and identified new palindromic amplicons. Our subsequent analysis led us to conclude that the evolution of human and mouse X chromosomes was bimodal. In accord with Ohno's law, 94-95% of X-linked single-copy genes are shared by humans and mice; most are expressed in both sexes. Notably, most X-ampliconic genes are exceptions to Ohno's law: only 31% of human and 22% of mouse X-ampliconic genes had orthologs in the other species. X-ampliconic genes are expressed predominantly in testicular germ cells, and many were independently acquired since divergence from the common ancestor of humans and mice, specializing portions of their X chromosomes for sperm production.
Significance We present highlights of the first complete domestic cat reference genome, to our knowledge. We provide evolutionary assessments of the feline protein-coding genome, population genetic ...discoveries surrounding domestication, and a resource of domestic cat genetic variants. These analyses span broadly, from carnivore adaptations for hunting behavior to comparative odorant and chemical detection abilities between cats and dogs. We describe how segregating genetic variation in pigmentation phenotypes has reached fixation within a single breed, and also highlight the genomic differences between domestic cats and wildcats. Specifically, the signatures of selection in the domestic cat genome are linked to genes associated with gene knockout models affecting memory, fear-conditioning behavior, and stimulus-reward learning, and potentially point to the processes by which cats became domesticated.
Little is known about the genetic changes that distinguish domestic cat populations from their wild progenitors. Here we describe a high-quality domestic cat reference genome assembly and comparative inferences made with other cat breeds, wildcats, and other mammals. Based upon these comparisons, we identified positively selected genes enriched for genes involved in lipid metabolism that underpin adaptations to a hypercarnivorous diet. We also found positive selection signals within genes underlying sensory processes, especially those affecting vision and hearing in the carnivore lineage. We observed an evolutionary tradeoff between functional olfactory and vomeronasal receptor gene repertoires in the cat and dog genomes, with an expansion of the feline chemosensory system for detecting pheromones at the expense of odorant detection. Genomic regions harboring signatures of natural selection that distinguish domestic cats from their wild congeners are enriched in neural crest-related genes associated with behavior and reward in mouse models, as predicted by the domestication syndrome hypothesis. Our description of a previously unidentified allele for the gloving pigmentation pattern found in the Birman breed supports the hypothesis that cat breeds experienced strong selection on specific mutations drawn from random bred populations. Collectively, these findings provide insight into how the process of domestication altered the ancestral wildcat genome and build a resource for future disease mapping and phylogenomic studies across all members of the Felidae.