Species delimitation is one of the most contested areas in modern biology, with widespread disagreement about almost every aspect of the definition and implementation of the “species” label. While ...this debate is intellectually stimulating, it also has real implications for conservation, where its impacts on taxonomic inflation or inertia can mean that specific populations receive adequate conservation measures or are ignored. Recently, the rise of next generation sequencing and phylogenomics has revolutionised phylogenetic understanding of many organismal groups but has simultaneously highlighted the porosity of genomes in terms of admixture across previously delineated species barriers. The extraordinary power of genomic data is increasingly being used to delineate species, and several publications in this domain have recently attracted significant attention and criticism. Here we revisit the question of species delimitation, but from a genomic context. We ask how and whether the large amounts of data provided by genomic methods can resolve the longstanding discussion on the validity and application of phylogenetic and allied species concepts, and how some recent examples can inform this debate. We argue that conserving adaptive potential is a priority for conservation, and no single species concept currently does that adequately on its own. Genomic data holds the potential to add unprecedented detail, but frequently falls short of this potential.
Abstract
Phylogenomic data provide valuable opportunities for studying evolutionary rates and timescales. These analyses require theoretical and statistical tools based on molecular clocks. We ...present ClockstaRX, a flexible platform for exploring and testing evolutionary rate signals in phylogenomic data. Here, information about evolutionary rates in branches across gene trees is placed in Euclidean space, allowing data transformation, visualization, and hypothesis testing. ClockstaRX implements formal tests for identifying groups of loci and branches that make a large contribution to patterns of rate variation. This information can then be used to test for drivers of genomic evolutionary rates or to inform models for molecular dating. Drawing on the results of a simulation study, we recommend forms of data exploration and filtering that might be useful prior to molecular-clock analyses.
The South Caucasus, situated between the Black and Caspian Seas, geographically links Europe with the Near East and has served as a crossroad for human migrations for many millennia 1–7. Despite a ...vast archaeological record showing distinct cultural turnovers, the demographic events that shaped the human populations of this region is not known 8, 9. To shed light on the maternal genetic history of the region, we analyzed the complete mitochondrial genomes of 52 ancient skeletons from present-day Armenia and Artsakh spanning 7,800 years and combined this dataset with 206 mitochondrial genomes of modern Armenians. We also included previously published data of seven neighboring populations (n = 482). Coalescence-based analyses suggest that the population size in this region rapidly increased after the Last Glacial Maximum ca. 18 kya. We find that the lowest genetic distance in this dataset is between modern Armenians and the ancient individuals, as also reflected in both network analyses and discriminant analysis of principal components. We used approximate Bayesian computation to test five different demographic scenarios explaining the formation of the modern Armenian gene pool. Despite well documented cultural shifts in the South Caucasus across this time period, our results strongly favor a genetic continuity model in the maternal gene pool. This has implications for interpreting prehistoric migration dynamics and cultural shifts in this part of the world.
•We analyzed 52 full mitochondrial genomes from ancient humans in the South Caucasus•The results show a high level of maternal genetic continuity in this region•Cultural shifts across eight millennia have not changed the maternal gene pool
Margaryan et al. analyze whole mitochondrial genomes of 206 modern and 52 ancient individuals that represent various cultural groups from the South Caucasus spanning eight millennia. The results clearly indicate genetic continuity of human maternal gene pool since Neolithic times despite well documented cultural shifts in the South Caucasus.
Chinese indicine cattle harbor a much higher genetic diversity compared with other domestic cattle, but their genome architecture remains uninvestigated. Using PacBio HiFi sequencing data from 10 ...Chinese indicine cattle across southern China, we assembled 20 high-quality partially phased genomes and integrated them into a multiassembly graph containing 148.5 Mb (5.6%) of novel sequence. We identified 156,009 high-confidence nonredundant structural variants (SVs) and 206 SV hotspots spanning ∼195 Mb of gene-rich sequence. We detected 34,249 archaic introgressed fragments in Chinese indicine cattle covering 1.93 Gb (73.3%) of the genome. We inferred an average of 3.8%, 3.2%, 1.4%, and 0.5% of introgressed sequence originating, respectively, from banteng-like, kouprey-like, gayal-like, and gaur-like
Bos
species, as well as 0.6% of unknown origin. Introgression from multiple donors might have contributed to the genetic diversity of Chinese indicine cattle. Altogether, this study highlights the contribution of interspecies introgression to the genomic architecture of an important livestock population and shows how exotic genomic elements can contribute to the genetic variation available for selection.
Genotyping‐by‐sequencing methods such as RADseq are popular for generating genomic and population‐scale data sets from a diverse range of organisms. These often lack a usable reference genome, ...restricting users to RADseq specific software for processing. However, these come with limitations compared to generic next generation sequencing (NGS) toolkits. Here, we describe and test a simple pipeline for reference‐free RADseq data processing that blends de novo elements from STACKS with the full suite of state‐of‐the art NGS tools. Specifically, we use the de novo RADseq assembly employed by STACKS to create a catalogue of RAD loci that serves as a reference for read mapping, variant calling and site filters. Using RADseq data from 28 zebra sequenced to ~8x depth‐of‐coverage we evaluate our approach by comparing the site frequency spectra (SFS) to those from alternative pipelines. Most pipelines yielded similar SFS at 8x depth, but only a genotype likelihood based pipeline performed similarly at low sequencing depth (2–4x). We compared the RADseq SFS with medium‐depth (~13x) shotgun sequencing of eight overlapping samples, revealing that the RADseq SFS was persistently slightly skewed towards rare and invariant alleles. Using simulations and human data we confirm that this is expected when there is allelic dropout (AD) in the RADseq data. AD in the RADseq data caused a heterozygosity deficit of ~16%, which dropped to ~5% after filtering AD. Hence, AD was the most important source of bias in our RADseq data.
Abstract
African wild pigs have a contentious evolutionary and biogeographic history. Until recently, desert warthog (Phacochoerus aethiopicus) and common warthog (P. africanus) were considered a ...single species. Molecular evidence surprisingly suggested they diverged at least 4.4 million years ago, and possibly outside of Africa. We sequenced the first whole-genomes of four desert warthogs and 35 common warthogs from throughout their range. We show that these two species diverged much later than previously estimated, 400,000–1,700,000 years ago depending on assumptions of gene flow. This brings it into agreement with the paleontological record. We found that the common warthog originated in western Africa and subsequently colonized eastern and southern Africa. During this range expansion, the common warthog interbred with the desert warthog, presumably in eastern Africa, underlining this region’s importance in African biogeography. We found that immune system–related genes may have adaptively introgressed into common warthogs, indicating that resistance to novel diseases was one of the most potent drivers of evolution as common warthogs expanded their range. Hence, we solve some of the key controversies surrounding warthog evolution and reveal a complex evolutionary history involving range expansion, introgression, and adaptation to new diseases.
Strong genetic structure has prompted discussion regarding giraffe taxonomy,1,2,3 including a suggestion to split the giraffe into four species: Northern (Giraffa c. camelopardalis), Reticulated ...(G. c. reticulata), Masai (G. c. tippelskirchi), and Southern giraffes (G. c. giraffa).4,5,6 However, their evolutionary history is not yet fully resolved, as previous studies used a simple bifurcating model and did not explore the presence or extent of gene flow between lineages. We therefore inferred a model that incorporates various evolutionary processes to assess the drivers of contemporary giraffe diversity. We analyzed whole-genome sequencing data from 90 wild giraffes from 29 localities across their current distribution. The most basal divergence was dated to 280 kya. Genetic differentiation, FST, among major lineages ranged between 0.28 and 0.62, and we found significant levels of ancient gene flow between them. In particular, several analyses suggested that the Reticulated lineage evolved through admixture, with almost equal contribution from the Northern lineage and an ancestral lineage related to Masai and Southern giraffes. These new results highlight a scenario of strong differentiation despite gene flow, providing further context for the interpretation of giraffe diversity and the process of speciation in general. They also illustrate that conservation measures need to target various lineages and sublineages and that separate management strategies are needed to conserve giraffe diversity effectively. Given local extinctions and recent dramatic declines in many giraffe populations, this improved understanding of giraffe evolutionary history is relevant for conservation interventions, including reintroductions and reinforcements of existing populations.
Display omitted
•Giraffes show exceptional genetic structure despite lack of physical barriers•Giraffes have a complex evolutionary history, with high levels of gene flow•Reticulated giraffes are a hybrid lineage•For effective conservation, diversity within giraffes needs to be taken into account
Giraffes consist of four major lineages, which show strong divergence despite being geographically close. Following analyses of whole genomes from 90 wild giraffes from throughout their range, Bertola et al. show that the evolutionary history of giraffes is complex and marked by major gene flow, in particular affecting Reticulated giraffes.
As sequencing technologies become more affordable, it is now realistic to propose studying the evolutionary history of virtually any organism on a genomic scale. However, when dealing with non-model ...organisms it is not always easy to choose the best approach given a specific biological question, a limited budget, and challenging sample material. Furthermore, although recent advances in technology offer unprecedented opportunities for research in non-model organisms, they also demand unprecedented awareness from the researcher regarding the assumptions and limitations of each method.
In this review we present an overview of the current sequencing technologies and the methods used in typical high-throughput data analysis pipelines. Subsequently, we contextualize high-throughput DNA sequencing technologies within their applications in non-model organism biology. We include tips regarding managing unconventional sample material, comparative and population genetic approaches that do not require fully assembled genomes, and advice on how to deal with low depth sequencing data.
The origin of the elephant on the island of Borneo remains elusive. Research has suggested two alternative hypotheses: the Bornean elephant stems either from a recent introduction in the 17th century ...or from an ancient colonization several hundreds of thousands years ago. Lack of elephant fossils has been interpreted as evidence for a very recent introduction, whereas mtDNA divergence from other Asian elephants has been argued to favor an ancient colonization. We investigated the demographic history of Bornean elephants using full-likelihood and approximate Bayesian computation analyses. Our results are at odds with both the recent and ancient colonization hypotheses, and favour a third intermediate scenario. We find that genetic data favour a scenario in which Bornean elephants experienced a bottleneck during the last glacial period, possibly as a consequence of the colonization of Borneo, and from which it has slowly recovered since. Altogether the data support a natural colonization of Bornean elephants at a time when large terrestrial mammals could colonise from the Sunda shelf when sea levels were much lower. Our results are important not only in understanding the unique history of the colonization of Borneo by elephants, but also for their long-term conservation.
The riverine barrier model suggests that rivers play a significant role in separating widespread organisms into isolated populations. In this study, we used a comparative approach to investigate the ...phylogeography of 6 didelphid marsupial species in central Brazil. Specifically, we evaluate the role of the mid-Araguaia River in differentiating populations and estimate divergence time among lineages to assess the timing of differentiation of these species, using mitochondrial DNA sequence data. The 6 didelphid marsupials revealed different intraspecific genetic patterns and structure. The 3 larger and more generalist species, Didelphis albiventris, Didelphis marsupialis, and Philander opossum, showed connectivity across the Araguaia River. In contrast the genetic structure of the 3 smaller and specialist species, Gracilinanus agilis, Marmosa (Marmosa) murina, and Marmosa (Micoureus) demerarae was shaped by the mid-Araguaia. Moreover, the split of eastern and western bank populations of the 2 latter species is consistent with the age of Araguaia River sediments formation. We hypothesize that the role of the Araguaia as a riverine barrier is linked to the level of ecological specialization among the 6 didelphid species and differences in their ability to cross rivers or disperse through the associated habitat types.