Patterns of genetic diversity between populations are often used to detect loci under selection in genome scans. Indeed, loci involved in local adaptations should show high F(ST) values, whereas loci ...under balancing selection should rather show low F(ST) values. Most tests of selection based on F(ST) use a null distribution generated under a simple island model of population differentiation. Although this model has been shown to be robust, many species have a more complex genetic structure, with some populations sharing a recent ancestry or due to the presence of barriers to gene flow between different parts of a species range. In this paper, we propose the use of a hierarchical island model, in which demes exchange more migrants within groups than between groups, to generate the joint distribution of genetic diversity within and between populations. We show that tests not accounting for a hierarchical structure, when it exists, do generate a large excess of false positive loci, whereas the hierarchical island model is robust to uncertainties about the exact number of groups and demes per group in the system. Our approach also explicitly takes into account the mutational process, and does not just rely on allele frequencies, which is important for short tandem repeat (STR) data. An application to human and stickleback STR data sets reveals a much lower number of significant loci than previously obtained under a non-hierarchical model. The elimination of false positive loci from genome scans should allow us to better determine on which specific class of genes selection is operating.
We investigate the effect of spatial range expansions on the evolution of fitness when beneficial and deleterious mutations cosegregate. We perform individual‐based simulations of 1D and 2D range ...expansions and complement them with analytical approximations for the evolution of mean fitness at the edge of the expansion. We find that deleterious mutations accumulate steadily on the wave front during range expansions, thus creating an expansion load. Reduced fitness due to the expansion load is not restricted to the wave front, but occurs over a large proportion of newly colonized habitats. The expansion load can persist and represent a major fraction of the total mutation load for thousands of generations after the expansion. The phenomenon of expansion load may explain growing evidence that populations that have recently expanded, including humans, show an excess of deleterious mutations. To test the predictions of our model, we analyse functional genetic diversity in humans and find patterns that are consistent with our model.
We present a new approach for defining groups of populations that are geographically homogeneous and maximally differentiated from each other. As a by‐product, it also leads to the identification of ...genetic barriers between these groups. The method is based on a simulated annealing procedure that aims to maximize the proportion of total genetic variance due to differences between groups of populations (spatial analysis of molecular variance; samova). Monte Carlo simulations were used to study the performance of our approach and, for comparison, the behaviour of the Monmonier algorithm, a procedure commonly used to identify zones of sharp genetic changes in a geographical area. Simulations showed that the samova algorithm indeed finds maximally differentiated groups, which do not always correspond to the simulated group structure in the presence of isolation by distance, especially when data from a single locus are available. In this case, the Monmonier algorithm seems slightly better at finding predefined genetic barriers, but can often lead to the definition of groups of populations not differentiated genetically. The samova algorithm was then applied to a set of European roe deer populations examined for their mitochondrial DNA (mtDNA) HVRI diversity. The inferred genetic structure seemed to confirm the hypothesis that some Italian populations were recently reintroduced from a Balkanic stock, as well as the differentiation of groups of populations possibly due to the postglacial recolonization of Europe or the action of a specific barrier to gene flow.
We investigate the effect of habitat fragmentation on the genetic diversity of a species experiencing a range expansion. These two evolutionary processes have not been studied yet, at the same time, ...owing to the difficulties of deriving analytic results for non-equilibrium models. Here we provide a description of their interaction by using extensive spatial and temporal coalescent simulations and we suggest guidelines for a proper genetic sampling to detect fragmentation. To model habitat fragmentation, we simulated a two-dimensional lattice of demes partitioned into groups (patches) by adding barriers to dispersal. After letting a population expand on this grid, we sampled lineages from the lattice at several scales and studied their coalescent history. We find that in order to detect fragmentation, one needs to extensively sample at a local level rather than at a landscape level. This is because the gene genealogy of a scattered sample is less sensitive to the presence of genetic barriers. Considering the effect of temporal changes of fragmentation intensities, we find that at least 10, but often >100, generations are needed to affect local genetic diversity and population structure. This result explains why recent habitat fragmentation does not always lead to detectable signatures in the genetic structure of populations. Finally, as expected, long-distance dispersal increases local genetic diversity and decreases levels of population differentiation, efficiently counteracting the effects of fragmentation.
Due to an almost complete absence of fossil record, the evolutionary history of chimpanzees has only been studied recently on the basis of genetic data. Although the general topology of the ...chimpanzee phylogeny is well established, uncertainties remain concerning the size of current and past populations, the occurrence of bottlenecks or population expansions, or about divergence times and migrations rates between subspecies. Here, we present a novel attempt at globally inferring the detailed evolution of the Pan genus based on approximate Bayesian computation, an approach preferentially applied to complex models where the likelihood cannot be computed analytically. Based on two microsatellite and DNA sequence data sets and adjusting simulated data for local levels of inbreeding and patterns of missing data, we find support for several new features of chimpanzee evolution as compared with previous studies based on smaller data sets and simpler evolutionary models. We find that the central chimpanzees are certainly the oldest population of all P. troglodytes subspecies and that the other two P. t. subspecies diverged from the central chimpanzees by founder events. We also find an older divergence time (1.6 million years My) between common chimpanzee and Bonobos than previous studies (0.9-1.3 My), but this divergence appears to have been very progressive with the maintenance of relatively high levels of gene flow between the ancestral chimpanzee population and the Bonobos. Finally, we could also confirm the existence of strong unidirectional gene flow from the western into the central chimpanzee. These results show that interesting and innovative features of chimpanzee history emerge when considering their whole evolutionary history in a single analysis, rather than relying on simpler models involving several comparisons of pairs of populations.
Gene set enrichment approaches have been increasingly successful in finding signals of recent polygenic selection in the human genome. In this study, we aim at detecting biological pathways affected ...by positive selection in more ancient human evolutionary history. Focusing on four branches of the primate tree that lead to modern humans, we tested all available protein coding gene trees of the Primates clade for signals of adaptation in these branches, using the likelihood-based branch site test of positive selection. The results of these locus-specific tests were then used as input for a gene set enrichment test, where whole pathways are globally scored for a signal of positive selection, instead of focusing only on outlier "significant" genes. We identified signals of positive selection in several pathways that are mainly involved in immune response, sensory perception, metabolism, and energy production. These pathway-level results are highly significant, even though there is no functional enrichment when only focusing on top scoring genes. Interestingly, several gene sets are found significant at multiple levels in the phylogeny, but different genes are responsible for the selection signal in the different branches. This suggests that the same function has been optimized in different ways at different times in primate evolution.
Summary
Several studies have found strikingly different allele frequencies between continents. This has been mainly interpreted as being due to local adaptation. However, demographic factors can ...generate similar patterns. Namely, allelic surfing during a population range expansion may increase the frequency of alleles in newly colonised areas. In this study, we examined 772 STRs, 210 diallelic indels, and 2834 SNPs typed in 53 human populations worldwide under the HGDP‐CEPH Diversity Panel to determine to which extent allele frequency differs among four regions (Africa, Eurasia, East Asia, and America). We find that large allele frequency differences between continents are surprisingly common, and that Africa and America show the largest number of loci with extreme frequency differences. Moreover, more STR alleles have increased rather than decreased in frequency outside Africa, as expected under allelic surfing. Finally, there is no relationship between the extent of allele frequency differences and proximity to genes, as would be expected under selection. We therefore conclude that most of the observed large allele frequency differences between continents result from demography rather than from positive selection.
Recent studies of large portions of the human genome support a recent origin of modern humans from an African stock after a bottleneck of moderate size followed by a range expansion out of Africa. ...Under this simple scenario, patterns of molecular diversity suggest that balancing selection could be more prevalent than positive selection in coding regions.
Studies of large portions of the human genome support a recent origin of modern humans from an African stock after a bottleneck of moderate size followed by a range expansion out of Africa.
Despite its often featureless appearance, the deep-ocean floor includes some of the most diverse habitats on Earth. However, the accurate assessment of global deep-sea diversity is impeded by a ...paucity of data on the geographical ranges of bottom-dwelling species, particularly at the genetic level. Here, we present molecular evidence for exceptionally wide distribution of benthic foraminifera, which constitute the major part of deep-sea meiofauna. Our analyses of nuclear ribosomal RNA genes revealed high genetic similarity between Arctic and Antarctic populations of three common deep-sea foraminiferal species (Epistominella exigua, Cibicides wuellerstorfi and Oridorsalis umbonatus), separated by distances of up to 17 000 km. Our results contrast with the substantial level of cryptic diversity usually revealed by molecular studies, of shallow-water benthic and planktonic marine organisms. The very broad ranges of the deep-sea foraminifera that we examined support the hypothesis of global distribution of small eukaryotes and suggest that deep-sea biodiversity may be more modest at global scales than present estimates suggest.