Phylogenetic trees in insects are frequently dated by applying a "standard" mitochondrial DNA (mtDNA) clock estimated at 2.3% My(-1), but despite its wide use reliable calibration points have been ...lacking. Here, we used a well-established biogeographic barrier, the mid-Aegean trench separating the western and eastern Aegean archipelago, to estimate substitution rates in tenebrionid beetles. Cytochrome oxidase I (cox1) for six codistributed genera across 28 islands (444 individuals) on both sides of the mid-Aegean trench revealed 60 independently coalescing entities delimited with a mixed Yule-coalescent model. One representative per entity was used for phylogenetic analysis of mitochondrial (cox1, 16S rRNA) and nuclear (Mp20, 28S rRNA) genes. Six nodes marked geographically congruent east-west splits whose separation was largely contemporaneous and likely to reflect the formation of the mid-Aegean trench at 9-12 Mya. Based on these "known" dates, a divergence rate of 3.54% My(-1) for the cox1 gene (2.69% when combined with the 16S rRNA gene) was obtained under the preferred partitioning scheme and substitution model selected using Bayes factors. An extensive survey suggests that discrepancies in mtDNA substitution rates in the entomological literature can be attributed to the use of different substitution models, the use of different mitochondrial gene regions, mixing of intraspecific with interspecific data, and not accounting for variance in coalescent times or postseparation gene flow. Different treatments of these factors in the literature confound estimates of mtDNA substitution rates in opposing directions and obscure lineage-specific differences in rates when comparing data from various sources.
Extant terrestrial biodiversity arguably is driven by the evolutionary success of angiosperm plants, but the evolutionary mechanisms and timescales of angiosperm-dependent radiations remain poorly ...understood. The Scarabaeoidea is a diverse lineage of predominantly plant- and dung-feeding beetles. Here, we present a phylogenetic analysis of Scarabaeoidea based on four DNA markers for a taxonomically comprehensive set of specimens and link it to recently described fossil evidence. The phylogeny strongly supports multiple origins of coprophagy, phytophagy and anthophagy. The ingroup-based fossil calibration of the tree widely confirmed a Jurassic origin of the Scarabaeoidea crown group. The crown groups of phytophagous lineages began to radiate first (Pleurostict scarabs: 108 Ma; Glaphyridae between 101 Ma), followed by the later diversification of coprophagous lineages (crown-group age Scarabaeinae: 76 Ma; Aphodiinae: 50 Ma). Pollen feeding arose even later, at maximally 62 Ma in the oldest anthophagous lineage. The clear time lag between the origins of herbivores and coprophages suggests an evolutionary path driven by the angiosperms that first favoured the herbivore fauna (mammals and insects) followed by the secondary radiation of the dung feeders. This finding makes it less likely that extant dung beetle lineages initially fed on dinosaur excrements, as often hypothesized.
DNA-based species delimitation may be compromised by limited sampling effort and species rarity, including "singleton" representatives of species, which hampers estimates of intra-versus interspecies ...evolutionary processes. In a case study of southern African chafers (beetles in the family Scarabaeidae), many species and subclades were poorly represented and 48.5% of species were singletons. Using cox1 sequences from >500 specimens and ~100 species, the Generalized Mixed Yule Coalescent (GMYC) analysis as well as various other approaches for DNA-based species delimitation (Automatic Barcode Gap Discovery (ABGD), Poisson tree processes (PTP), Species Identifier, Statistical Parsimony), frequently produced poor results if analyzing a narrow target group only, but the performance improved when several subclades were combined. Hence, low sampling may be compensated for by "clade addition" of lineages outside of the focal group. Similar findings were obtained in reanalysis of published data sets of taxonomically poorly known species assemblages of insects from Madagascar. The low performance of undersampled trees is not due to high proportions of singletons per se, as shown in simulations (with 13%, 40% and 52% singletons). However, the GMYC method was highly sensitive to variable effective population size (Ne), which was exacerbated by variable species abundances in the simulations. Hence, low sampling success and rarity of species affect the power of the GMYC method only if they reflect great differences in Ne among species. Potential negative effects of skewed species abundances and prevalence of singletons are ultimately an issue about the variation in Ne and the degree to which this is correlated with the census population size and sampling success. Clade addition beyond a limited study group can overcome poor sampling for the GMYC method in particular under variable Ne. This effect was less pronounced for methods of species delimitation not based on coalescent models.
Eight years after DNA barcoding was formally proposed on a large scale, COI sequences are rapidly accumulating from around the world. While studies to date have mostly targeted local or regional ...species assemblages, the recent launch of the global iBOL project (International Barcode of Life), highlights the need to understand the effects of geographical scale on Barcoding's goals. Sampling has been central in the debate on DNA Barcoding, but the effect of the geographical scale of sampling has not yet been thoroughly and explicitly tested with empirical data. Here, we present a COI data set of aquatic predaceous diving beetles of the tribe Agabini, sampled throughout Europe, and use it to investigate how the geographic scale of sampling affects 1) the estimated intraspecific variation of species, 2) the genetic distance to the most closely related heterospecific, 3) the ratio of intraspecific and interspecific variation, 4) the frequency of taxonomically recognized species found to be monophyletic, and 5) query identification performance based on 6 different species assignment methods. Intraspecific variation was significantly correlated with the geographical scale of sampling (R-square = 0.7), and more than half of the species with 10 or more sampled individuals (N = 29) showed higher intraspecific variation than 1% sequence divergence. In contrast, the distance to the closest heterospecific showed a significant decrease with increasing geographical scale of sampling. The average genetic distance dropped from > 7% for samples within 1 km, to < 3.5% for samples up to > 6000 km apart. Over a third of the species were not monophyletic, and the proportion increased through locally, nationally, regionally, and continentally restricted subsets of the data. The success of identifying queries decreased with increasing spatial scale of sampling; liberal methods declined from 100% to around 90%, whereas strict methods dropped to below 50% at continental scales. The proportion of query identifications considered uncertain (more than one species < 1% distance from query) escalated from zero at local, to 50% at continental scale. Finally, by resampling the most widely sampled species we show that even if samples are collected to maximize the geographical coverage, up to 70 individuals are required to sample 95% of intraspecific variation. The results show that the geographical scale of sampling has a critical impact on the global application of DNA barcoding. Scale-effects result from the relative importance of different processes determining the composition of regional species assemblages (dispersal and ecological assembly) and global clades (demography, speciation, and extinction). The incorporation of geographical information, where available, will be required to obtain identification rates at global scales equivalent to those in regional barcoding studies. Our result hence provides an impetus for both smarter barcoding tools and sprouting national barcoding initiatives— smaller geographical scales deliver higher accuracy.
Plastoceridae Crowson, 1972, Drilidae Blanchard, 1845 and Omalisidae Lacordaire, 1857 (Elateroidea) are families of the Coleoptera with obscure phylogenetic relationships and modified morphology ...showing neotenic traits such as soft bodies, reduced wing cases and larviform females. We shotgun sequenced genomes of Plastocerus, Drilus and Omalisus and incorporated them into data matrices of 66 and 4202 single-copy nuclear genes representing Elateroidea. Phylogenetic analyses indicate their terminal positions within the broadly defined well-sclerotized and fully metamorphosed Elateridae and thus Omalisidae should now be considered as Omalisinae stat. nov. in Elateridae Leach, 1815. The results support multiple independent origins of incomplete metamorphosis in Elateridae and indicate the parallel evolution of morphological and ecological traits. Unlike other neotenic elateroids derived from the supposedly pre-adapted aposematically coloured and unpalatable soft-bodied elateroids, such as fireflies (Lampyridae) and net-winged beetles (Lycidae), omalisids and drilids evolved from well-sclerotized click beetles. These findings suggest sudden morphological shifts through incomplete metamorphosis, with important implications for macroevolution, including reduced speciation rate and high extinction risk in unstable habitats. Precise phylogenetic placement is necessary for studies of the molecular mechanisms of ontogenetic shifts leading to profoundly changed morphology.
Faecal samples are of great value as a non‐invasive means to gather information on the genetics, distribution, demography, diet and parasite infestation of endangered species. Direct shotgun ...sequencing of faecal DNA could give information on these simultaneously, but this approach is largely untested. Here, we used two faecal samples to characterize the diet of two red‐shanked doucs langurs (Pygathrix nemaeus) that were fed known foliage, fruits, vegetables and cereals. Illumina HiSeq produced ~74 and 67 million paired reads for these samples, of which ~10 000 (0.014%) and ~44 000 (0.066%), respectively, were of chloroplast origin. Sequences were matched against a database of available chloroplast ‘barcodes’ for angiosperms. The results were compared with ‘metabarcoding’ using PCR amplification of the P6 loop of trnL. Metagenomics identified seven and nine of the likely 16 diet plants while six and five were identified by metabarcoding. Metabarcoding produced thousands of reads consistent with the known diet, but the barcodes were too short to identify several plant species to genus. Metagenomics utilized multiple, longer barcodes that combined had greater power of identification. However, rare diet items were not recovered. Read numbers for diet species in metagenomic and metabarcoding data were correlated, indicating that both are useful for determining relative sequence abundance. Metagenomic reads were uniformly distributed across the chloroplast genomes; thus, if chloroplast genomes were used as reference, the precision of identifications and species recovery would improve further. Metagenomics also recovered the host mitochondrial genome and numerous intestinal parasite sequences in addition to generating data useful for characterizing the microbiome.
Disentangling the relative role of environmental filtering and spatial processes in driving metacommunity structure across mountainous regions remains challenging, as the way we quantify spatial ...connectivity in topographically and environmentally heterogeneous landscapes can influence our perception of which process predominates. More empirical data sets are required to account for taxon‐ and context‐dependency, but relevant research in understudied areas is often compromised by the taxonomic impediment. Here we used haplotype‐level community DNA metabarcoding, enabled by stringent filtering of amplicon sequence variants (ASVs), to characterize metacommunity structure of soil microarthropod assemblages across a mosaic of five forest habitats on the Troodos mountain range in Cyprus. We found similar β diversity patterns at ASV and species (OTU, operational taxonomic unit) levels, which pointed to a primary role of habitat filtering resulting in the existence of largely distinct metacommunities linked to different forest types. Within‐habitat turnover was correlated to topoclimatic heterogeneity, again emphasizing the role of environmental filtering. However, when integrating landscape matrix information for the highly fragmented Quercus alnifolia habitat, we also detected a major role of spatial isolation determined by patch connectivity, indicating that stochastic and niche‐based processes synergistically govern community assembly. Alpha diversity patterns varied between ASV and OTU levels, with OTU richness decreasing with elevation and ASV richness following a longitudinal gradient, potentially reflecting a decline of genetic diversity eastwards due to historical pressures. Our study demonstrates the utility of haplotype‐level community metabarcoding for characterizing metacommunity structure of complex assemblages and improving our understanding of biodiversity dynamics across mountainous landscapes worldwide.
Mitochondrial genome sequences are important markers for phylogenetics but taxon sampling remains sporadic because of the great effort and cost required to acquire full-length sequences. Here, we ...demonstrate a simple, cost-effective way to sequence the full complement of protein coding mitochondrial genes from pooled samples using the 454/Roche platform. Multiplexing was achieved without the need for expensive indexing tags ('barcodes'). The method was trialled with a set of long-range polymerase chain reaction (PCR) fragments from 30 species of Coleoptera (beetles) sequenced in a 1/16th sector of a sequencing plate. Long contigs were produced from the pooled sequences with sequencing depths ranging from ~10 to 100x per contig. Species identity of individual contigs was established via three 'bait' sequences matching disparate parts of the mitochondrial genome obtained by conventional PCR and Sanger sequencing. This proved that assembly of contigs from the sequencing pool was correct. Our study produced sequences for 21 nearly complete and seven partial sets of protein coding mitochondrial genes. Combined with existing sequences for 25 taxa, an improved estimate of basal relationships in Coleoptera was obtained. The procedure could be employed routinely for mitochondrial genome sequencing at the species level, to provide improved species 'barcodes' that currently use the cox1 gene only.
An international consortium of major natural history museums, herbaria and other organizations has launched an ambitious project, the 'Barcode of Life Initiative', to promote a process enabling the ...rapid and inexpensive identification of the estimated 10 million species on Earth. DNA barcoding is a diagnostic technique in which short DNA sequence(s) can be used for species identification. The first international scientific conference on Barcoding of Life was held at the Natural History Museum in London in February 2005, and here we review the scientific challenges discussed during this conference and in previous publications. Although still controversial, the scientific benefits of DNA barcoding include: (i) enabling species identification, including any life stage or fragment, (ii) facilitating species discoveries based on cluster analyses of gene sequences (e.g. cox1=CO1, in animals), (iii) promoting development of handheld DNA sequencing technology that can be applied in the field for biodiversity inventories and (iv) providing insight into the diversity of life.
Neoteny, the maintenance of larval features in sexually mature adults, is a radical way of generating evolutionary novelty through shifts in relative timing of developmental programmes. While ...controlled by the environment in facultative neotenics, retention of larval features is obligatory in many species of Lycidae (net-winged beetles). They are studied here as an example of how developmental shifts and ecology interact to produce macroevolutionary impacts. We conducted a phylogenetic analysis of Lycidae based on DNA sequences from nuclear (18S and 28S rRNA) and mitochondrial (rrnL, cox1, cob and nad5) genes from a representative set of lineages (73 species), including 17 neotenic taxa. Major changes of basal relationships compared with those implied in the current classification generally supported three independent origins of neotenics in Lycidae. The southeast Asian Lyropaeinae and Ateliinae were in basal positions indicating evolutionary antiquity, also confirmed by molecular clock estimates, unlike the neotropical leptolycines nested within Calopterini and presumably much younger. neotenics exhibit typical K-selected traits including slow development, large body size, high investment in offspring and low dispersal. This correlated with low species richness and restricted ranges of neotenic lineages compared with their sisters. Yet, these factors did not impede the evolutionary persistence of affected lineages, even without reversals to fully metamorphosed forms, contradicting earlier suggestions of recent evolution from dispersive non-neotenics.