Inferring a realistic demographic model from genetic data is an important challenge to gain insights into the historical events during the speciation process and to detect molecular signatures of ...selection along genomes. Recent advances in divergence population genetics have reported that speciation in face of gene flow occurred more frequently than theoretically expected, but the approaches used did not account for genome-wide heterogeneity (GWH) in introgression rates. Here, we investigate the impact of GWH on the inference of divergence with gene flow between two cryptic species of the marine model Ciona intestinalis by analyzing polymorphism and divergence patterns in 852 protein-coding sequence loci. These morphologically similar entities are highly diverged molecular-wise, but evidence of hybridization has been reported in both laboratory and field studies. We compare various speciation models and test for GWH under the approximate Bayesian computation framework. Our results demonstrate the presence of significant extents of gene flow resulting from a recent secondary contact after >3 My of divergence in isolation. The inferred rates of introgression are relatively low, highly variable across loci and mostly unidirectional, which is consistent with the idea that numerous genetic incompatibilities have accumulated over time throughout the genomes of these highly diverged species. A genomic map of the level of gene flow identified two hotspots of introgression, that is, large genome regions of unidirectional introgression. This study clarifies the history and degree of isolation of two cryptic and partially sympatric model species and provides a methodological framework to investigate GWH at various stages of speciation process.
The evolution of cetaceans, fromtheir early transition to an aquatic lifestyle to their subsequent diversification, has been the subject of numerous studies. However, although the higher-level ...relationships among cetacean families have been largely settled, several aspects of the systematics within these groups remain unresolved. Problematic clades include the oceanic dolphins (37 spp.), which have experienced a recent rapid radiation, and the beaked whales (22 spp.), which have not been investigated in detail using nuclear loci. The combined application of high-throughput sequencing with techniques that target specific genomic sequences provide a powerful means of rapidly generating large volumes of orthologous sequence data for use in phylogenomic studies. To elucidate the phylogenetic relationships within the Cetacea,we combined sequence capture with Illumina sequencing to generate data for ∼3200 protein-coding genes for 68 cetacean species and their close relatives including the pygmy hippopotamus. By combining data from >38,000 exons with existing sequences from 11 cetaceans and seven outgroup taxa, we produced the first comprehensive comparative genomic data set for cetaceans, spanning 6,527,596 aligned base pairs (bp) and 89 taxa. Phylogenetic trees reconstructed with maximum likelihood and Bayesian inference of concatenated loci, as well as with coalescence analyses of individual gene trees, produced mostly concordant and well-supported trees. Our results completely resolve the relationships among beaked whales as well as the contentious relationships among oceanic dolphins, especially the problematic subfamily Delphinidae. We carried out Bayesian estimation of species divergence times using MCMCTree andcompared ourcomplete data set to a subset of clocklike genes. Analyses using the complete data set consistently showed less variance in divergence times than the reduced data set. In addition, integration of new fossils (e.g., Mystacodon selenensis) indicates that the diversification of Crown Cetacea began before the Late Eocene and the divergence of Crown Delphinidae as early as theMiddle Miocene.
Celotno besedilo
Dostopno za:
BFBNIB, DOBA, IZUM, KILJ, NMLJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Tunicates are the closest relatives of vertebrates and are widely used as models to study the evolutionary developmental biology of chordates. Their phylogeny, however, remains poorly understood, and ...to date, only the 18S rRNA nuclear gene and mitogenomes have been used to delineate the major groups of tunicates. To resolve their evolutionary relationships and provide a first estimate of their divergence times, we used a transcriptomic approach to build a phylogenomic dataset including all major tunicate lineages, consisting of 258 evolutionarily conserved orthologous genes from representative species.
Phylogenetic analyses using site-heterogeneous CAT mixture models of amino acid sequence evolution resulted in a strongly supported tree topology resolving the relationships among four major tunicate clades: (1) Appendicularia, (2) Thaliacea + Phlebobranchia + Aplousobranchia, (3) Molgulidae, and (4) Styelidae + Pyuridae. Notably, the morphologically derived Thaliacea are confirmed as the sister group of the clade uniting Phlebobranchia + Aplousobranchia within which the precise position of the model ascidian genus Ciona remains uncertain. Relaxed molecular clock analyses accommodating the accelerated evolutionary rate of tunicates reveal ancient diversification (~ 450-350 million years ago) among the major groups and allow one to compare their evolutionary age with respect to the major vertebrate model lineages.
Our study represents the most comprehensive phylogenomic dataset for the main tunicate lineages. It offers a reference phylogenetic framework and first tentative timescale for tunicates, allowing a direct comparison with vertebrate model species in comparative genomics and evolutionary developmental biology studies.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Phylogenomics has revealed the existence of fast-evolving animal phyla in which the amino acid substitution rate, averaged across many proteins, is consistently higher than in other lineages. The ...reasons for such differences in proteome-wide evolutionary rates are still unknown, largely because only a handful of species offer within-species genomic data from which molecular evolutionary processes can be deduced. In this study, we use next-generation sequencing technologies and individual whole-transcriptome sequencing to gather extensive polymorphism sequence data sets from Ciona intestinalis. Ciona is probably the best-characterized member of the fast-evolving Urochordata group (tunicates), which was recently identified as the sister group of the slow-evolving vertebrates. We introduce and validate a maximum-likelihood framework for single-nucleotide polymorphism and genotype calling, based on high-throughput short-read typing. We report that the C. intestinalis proteome is characterized by a high level of within-species diversity, efficient purifying selection, and a substantial percentage of adaptive amino acid substitutions. We conclude that the increased rate of amino acid sequence evolution in tunicates, when compared with vertebrates, is the consequence of both a 2-6 times higher per-year mutation rate and prevalent adaptive evolution.
Next Generation Sequencing technologies (NGS) are rapidly invading many evolutionary and ecological fields, such as phylogenomics, molecular evolution, population genomics and molecular ecology. ...Among the potential targets of NGS is transcriptome sequencing, a fast and relatively cheap way to generate massive amounts of coding sequence data, offering promising perspectives for the analysis of molecular diversity in the wild. A number of molecular ecology research groups therefore may switch from DNA‐based to RNA‐based typing in the near future. Sample preparation from natural populations, however, requires specific care and protocols when RNA is the target. Furthermore, NGS sequencing of transcriptome requires high amount of good‐quality RNA. Here we present the results of RNA extraction experiments from various samples of 39 animal species caught in the wild. We compared tissue preparation and storage conditions, evaluated and improved standard RNA extraction protocols, and achieved RNA yield and quality suitable for NGS in all cases. We derive general guidelines for the production of ready‐to‐sequence RNA in nonmodel animals sampled in the field.
Tunicates have been recently revealed to be the closest living relatives of vertebrates. Yet, with more than 2500 described species, details of their evolutionary history are still obscure. From a ...molecular point of view, tunicate phylogenetic relationships have been mostly studied based on analyses of 18S rRNA sequences, which indicate several major clades at odds with the traditional class-level arrangements. Nonetheless, substantial uncertainty remains about the phylogenetic relationships and taxonomic status of key groups such as the Aplousobranchia, Appendicularia, and Thaliacea.
Thirty new complete 18S rRNA sequences were acquired from previously unsampled tunicate species, with special focus on groups presenting high evolutionary rate. The updated 18S rRNA dataset has been aligned with respect to the constraint on homology imposed by the rRNA secondary structure. A probabilistic framework of phylogenetic reconstruction was adopted to accommodate the particular evolutionary dynamics of this ribosomal marker. Detailed Bayesian analyses were conducted under the non-parametric CAT mixture model accounting for site-specific heterogeneity of the evolutionary process, and under RNA-specific doublet models accommodating the occurrence of compensatory substitutions in stem regions. Our results support the division of tunicates into three major clades: 1) Phlebobranchia + Thaliacea + Aplousobranchia, 2) Appendicularia, and 3) Stolidobranchia, but the position of Appendicularia could not be firmly resolved. Our study additionally reveals that most Aplousobranchia evolve at extremely high rates involving changes in secondary structure of their 18S rRNA, with the exception of the family Clavelinidae, which appears to be slowly evolving. This extreme rate heterogeneity precluded resolving with certainty the exact phylogenetic placement of Aplousobranchia. Finally, the best fitting secondary-structure and CAT-mixture models suggest a sister-group relationship between Salpida and Pyrosomatida within Thaliacea.
An updated phylogenetic framework for tunicates is provided based on phylogenetic analyses using the most realistic evolutionary models currently available for ribosomal molecules and an unprecedented taxonomic sampling. Detailed analyses of the 18S rRNA gene allowed a clear definition of the major tunicate groups and revealed contrasting evolutionary dynamics among major lineages. The resolving power of this gene nevertheless appears limited within the clades composed of Phlebobranchia + Thaliacea + Aplousobranchia and Pyuridae + Styelidae, which were delineated as spots of low resolution. These limitations underline the need to develop new nuclear markers in order to further resolve the phylogeny of this keystone group in chordate evolution.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Gene loss and gain during genome evolution are thought to play important roles in adaptive phenotypic diversification. Among mammals, bats possess the smallest genomes and have evolved the unique ...abilities of powered flight and laryngeal echolocation. To investigate whether gene family evolution has contributed to the genome downsizing and phenotypic diversification in this group, we performed comparative evolutionary analyses of complete proteome data for eight bat species, including echolocating and non-echolocating forms, together with the proteomes of 12 other laurasiatherian mammals. Our analyses revealed extensive gene loss in the most recent ancestor of bats, and also of carnivores (both >1,000 genes), although this gene contraction did not appear to correlate with the reduction in genome size in bats. Comparisons of highly dynamic families suggested that expansion and contraction affected genes with similar functions (immunity, response to stimulus) in all laurasiatherian lineages. However, the magnitude and direction of these changes varied greatly among groups. In particular, our results showed contraction of the Olfactory Receptor (OR) gene repertoire in the last common ancestor of all bats, as well as that of the echolocating species studied. In contrast, non-echolocating fruit bats showed evidence of expansion in ORs, supporting a "trade-off" between sensory modalities.
Altered dystrophin expression was found in some tumors and recent studies identified a developmental onset of Duchenne muscular dystrophy (DMD). Given that embryogenesis and carcinogenesis share many ...mechanisms, we analyzed a broad spectrum of tumors to establish whether dystrophin alteration evokes related outcomes. Transcriptomic, proteomic, and mutation datasets from fifty tumor tissues and matching controls (10,894 samples) and 140 corresponding tumor cell lines were analyzed. Interestingly, dystrophin transcripts and protein expression were found widespread across healthy tissues and at housekeeping gene levels. In 80% of tumors,
expression was reduced due to transcriptional downregulation and not somatic mutations. The full-length transcript encoding Dp427 was decreased in 68% of tumors, while Dp71 variants showed variability of expression. Notably, low expression of dystrophins was associated with a more advanced stage, older age of onset, and reduced survival across different tumors. Hierarchical clustering analysis of
transcripts distinguished malignant from control tissues. Transcriptomes of primary tumors and tumor cell lines with low
expression showed enrichment of specific pathways in the differentially expressed genes. Pathways consistently identified: ECM-receptor interaction, calcium signaling, and PI3K-Akt are also altered in DMD muscle. Therefore, the importance of this largest known gene extends beyond its roles identified in DMD, and certainly into oncology.
RNA methylation plays an important role in functional regulation of RNAs, and has thus attracted an increasing interest in biology and drug discovery. Here, we collected and collated transcriptomic, ...proteomic, structural and physical interaction data from the Harmonizome database, and applied supervised machine learning to predict novel genes associated with RNA methylation pathways in human. We selected five types of classifiers, which we trained and evaluated using cross-validation on multiple training sets. The best models reached 88% accuracy based on cross-validation, and an average 91% accuracy on the test set. Using protein-protein interaction data, we propose six molecular sub-networks linking model predictions to previously known RNA methylation genes, with roles in mRNA methylation, tRNA processing, rRNA processing, but also protein and chromatin modifications. Our study exemplifies how access to large omics datasets joined by machine learning methods can be used to predict gene function.