Increasing our understanding of the genetic architecture of complex traits, through analyses of genotype-phenotype associations and of the genes/polymorphisms accounting for trait variation, is ...crucial, to improve the integration of molecular markers into forest tree breeding. In this study, two full-sib families and one breeding population of maritime pine were used to identify quantitative trait loci (QTLs) for height growth and stem straightness, through linkage analysis (LA) and linkage disequilibrium (LD) mapping approaches.
The populations used for LA consisted of two unrelated three-generation full-sib families (n = 197 and n = 477). These populations were assessed for height growth or stem straightness and genotyped for 248 and 217 markers, respectively. The population used for LD mapping consisted of 661 founders of the first and second generations of the breeding program. This population was phenotyped for the same traits and genotyped for 2,498 single-nucleotide polymorphism (SNP) markers corresponding to 1,652 gene loci. The gene-based reference genetic map of maritime pine was used to localize and compare the QTLs detected by the two approaches, for both traits. LA identified three QTLs for stem straightness and two QTLs for height growth. The LD study yielded seven significant associations (P ≤ 0.001): four for stem straightness and three for height growth. No colocalisation was found between QTLs identified by LA and SNPs detected by LD mapping for the same trait.
This study provides the first comparison of LA and LD mapping approaches in maritime pine, highlighting the complementary nature of these two approaches for deciphering the genetic architecture of two mandatory traits of the breeding program.
Abstract
Next‐generation biomonitoring proposes to combine machine‐learning algorithms with environmental DNA data to automate the monitoring of the Earth's major ecosystems. In the present study, we ...searched for molecular biomarkers of tree water status to develop next‐generation biomonitoring of forest ecosystems. Because phyllosphere microbial communities respond to both tree physiology and climate change, we investigated whether environmental DNA data from tree phyllosphere could be used as molecular biomarkers of tree water status in forest ecosystems. Using an amplicon sequencing approach, we analysed phyllosphere microbial communities of four tree species (
Quercus ilex
,
Quercus robur
,
Pinus pinaster
and
Betula pendula
) in a forest experiment composed of irrigated and non‐irrigated plots. We used these microbial community data to train a machine‐learning algorithm (Random Forest) to classify irrigated and non‐irrigated trees. The Random Forest algorithm detected tree water status from phyllosphere microbial community composition with more than 90% accuracy for oak species, and more than 75% for pine and birch. Phyllosphere fungal communities were more informative than phyllosphere bacterial communities in all tree species. Seven fungal amplicon sequence variants were identified as candidates for the development of molecular biomarkers of water status in oak trees. Altogether, our results show that microbial community data from tree phyllosphere provides information on tree water status in forest ecosystems and could be included in next‐generation biomonitoring programmes that would use in situ, real‐time sequencing of environmental DNA to help monitor the health of European temperate forest ecosystems.
Most existing forests are subjected to natural and human‐mediated selection pressures, which have increased due to climate change and the increasing needs of human societies for wood, fibre and fuel ...resources. It remains largely unknown how these pressures trigger evolutionary changes. We address this issue here for temperate European oaks (Quercus petraea and Q. robur), which grow in mixed stands, under even‐aged management regimes. We screened numerous functional traits for univariate selection gradients and for expected and observed genetic changes over two successive generations. In both species, growth, leaf morphology and physiology, and defence‐related traits displayed significant selection gradients and predicted shifts, whereas phenology, water metabolism, structure and resilience‐related traits did not. However, the direction of the selection response and the potential for adaptive evolution differed between the two species. Quercus petraea had a much larger phenotypic and genetic variance of fitness than Q. robur. This difference raises concerns about the adaptive response of Q. robur to contemporary selection pressures. Our investigations suggest that Q. robur will probably decline steadily, particularly in mixed stands with Q. petraea, consistent with the contrasting demographic dynamics of the two species.
Abstract Key message Mining genome-wide DNA sequences enabled the discovery of near-diagnostic markers for species assignment in four European white oaks ( Quercus petraea (Matt.) Liebl., Quercus ...pubescens Willd., Quercus pyrenaica Willd., and Quercus robur L.) despite their low interspecific differentiation. Near-diagnostic markers are almost fully fixed in one species and absent in the three others. As a result, only a handful of markers are needed for species identification, making this genetic assay a very promising operational taxonomic assignment procedure in research and forestry. Context Identifying species in the European white oak complex has been a long-standing concern in taxonomy, evolution, forest research, and management. Quercus petraea (Matt.) Liebl., Q. robur L., Q. pubescens Willd., and Q. pyrenaica Willd. are part of this species complex in western temperate Europe and hybridize in mixed stands, challenging species identification. Aims Our aim was to identify near-diagnostic single-nucleotide polymorphisms (SNPs) for each of the four species that are suitable for routine use and rapid diagnosis in research and applied forestry. Methods We first scanned existing whole-genome and target-capture data sets in a reduced number of samples (training set) to identify candidate diagnostic SNPs, i.e., genomic positions being characterized by a reference allele in one species and by the alternative allele in all other species. Allele frequencies of the candidates SNPs were then explored in a larger, range-wide sample of populations in each species (validation step). Results We found a subset of 38 SNPs (10 for Q. petraea , 7 for Q. pubescens , 9 for Q. pyrenaica , and 12 for Q. robur ) that showed near-diagnostic features across their species distribution ranges with Q. pyrenaica and Q. pubescens exhibiting the highest (0.876) and lowest (0.747) diagnosticity, respectively. Conclusions We provide a new, efficient, and reliable molecular tool for the identification of the species Q. petraea , Q. robur , Q. pubescens , and Q. pyrenaica , which can be used as a routine tool in forest research and management. This study highlights the resolution offered by whole-genome sequencing data to design near-diagnostic marker sets for taxonomic assignment, even for species complexes with relatively low differentiation.
Microgeographical adaptation occurs when the effects of directional selection persist despite gene flow. Traits and genetic loci under selection can then show adaptive divergence, against the ...backdrop of little differentiation at other traits or loci. How common such events are and how strong the selection is that underlies them remain open questions. Here, we discovered and analysed microgeographical patterns of genomic divergence in four European and Mediterranean conifers with widely differing life‐history traits and ecological requirements (Abies alba MIll., Cedrus atlantica Endl. Manetti, Pinus halepensis Mill. and Pinus pinaster Aiton) by screening pairs from geographically close forest stands sampled along steep ecological gradients. We inferred patterns of genomic divergence by applying a combination of divergence outlier detection methods, demographic modelling, Approximate Bayesian Computation inferences and genomic annotation to genomic data. Surprisingly for such small geographical scales, we showed that selection is strong in all species but generally affects different loci in each. A clear signature of selection was systematically detected on a fraction of the genome, of the order of 0.1%–1% of the loci depending on the species. The novel modelling method we designed for estimating selection coefficients showed that the microgeographical selection coefficient scaled by population size (Ns) was 2–30. Our results convincingly suggest that selection maintains within‐population diversity at microgeographical scales in spatially heterogeneous environments. Such genetic diversity is likely to be a major reservoir of adaptive potential, helping populations to adapt under fluctuating environmental conditions.
Anticipating the evolutionary responses of long-lived organisms, such as trees, to environmental changes, requires the assessment of genetic variation of adaptive traits in natural populations. To ...this end, high-density markers are needed to calculate genomic relatedness between individuals allowing to estimate the genetic variance of traits in wild populations. We designed a targeted capture-based, next-generation sequencing assay based on the highly heterozygous pedunculate oak (
) reference genome, for the sequencing of 3 Mb of genic and intergenic regions. Using a mixed stand of 293
and
genotypes we successfully captured over 97% of the target sequences, corresponding to 0.39% of the oak genome, with sufficient depth (97×) for the detection of about 190,000 SNPs evenly spread over the targeted regions. We validated the technique by evaluating its reproducibility, and comparing the genomic relatedness of trees with their known pedigree relationship. We explored the use of the technique on other related species and highlighted the advantages and limitations of this approach. We found that 92.07% of target sequences in
and 70.36% of sequences in
were captured. We used this SNP resource to estimate genetic relatedness in the mixed oak stand. Mean pairwise genetic relatedness was low within each species with a few values exceeding 0.25 (half-sibs) or 0.5 (full-sibs). Finally, we applied the technique to a long-standing issue in population genetics of trees regarding the relationship between inbreeding and components of fitness. We found very weak signals for inbreeding depression for reproductive success and no signal for growth within both species.
The accessibility of high-throughput genotyping technologies has contributed greatly to the development of genomic resources in non-model organisms. High-density genotyping arrays have only recently ...been developed for some economically important species such as conifers. The potential for using genomic technologies in association mapping and breeding depends largely on the genome wide patterns of diversity and linkage disequilibrium in current breeding populations. This study aims to deepen our knowledge regarding these issues in maritime pine, the first species used for reforestation in south western Europe.
Using a new map merging algorithm, we first established a 1,712 cM composite linkage map (comprising 1,838 SNP markers in 12 linkage groups) by bringing together three already available genetic maps. Using rigorous statistical testing based on kernel density estimation and resampling we identified cold and hot spots of recombination. In parallel, 186 unrelated trees of a mass-selected population were genotyped using a 12k-SNP array. A total of 2,600 informative SNPs allowed to describe historical recombination, genetic diversity and genetic structure of this recently domesticated breeding pool that forms the basis of much of the current and future breeding of this species. We observe very low levels of population genetic structure and find no evidence that artificial selection has caused a reduction in genetic diversity. By combining these two pieces of information, we provided the map position of 1,671 SNPs corresponding to 1,192 different loci. This made it possible to analyze the spatial pattern of genetic diversity (He) and long distance linkage disequilibrium (LD) along the chromosomes. We found no particular pattern in the empirical variogram of He across the 12 linkage groups and, as expected for an outcrossing species with large effective population size, we observed an almost complete lack of long distance LD.
These results are a stepping stone for the development of strategies for studies in population genomics, association mapping and genomic prediction in this economical and ecologically important forest tree species.
Testing how populations are locally adapted and predicting their response to their future environment is of key importance in view of climate change. Landscape genomics is a powerful approach to ...investigate genes and environmental factors involved in local adaptation. In a pooled amplicon sequencing approach of 94 genes in 71 populations, we tested whether >3500 single nucleotide polymorphisms (SNPs) in the three most common oak species in Switzerland (Quercus petraea, Q. pubescens, Q. robur) show an association with abiotic factors related to local topography, historical climate and soil characteristics. In the analysis including all species, the most frequently associated environmental factors were those best describing the habitats of the species. In the species‐specific analyses, the most important environmental factors and associated SNPs greatly differed among species. However, we identified one SNP and seven genes that were associated with the same environmental factor across all species. We finally used regressions of allele frequencies of the most strongly associated SNPs along environmental gradients to predict the risk of nonadaptedness (RONA), which represents the average change in allele frequency at climate‐associated loci theoretically required to match future climatic conditions. RONA is considerable for some populations and species (up to 48% in single populations) and strongly differs among species. Given the long generation time of oaks, some of the required allele frequency changes might not be realistic to achieve based on standing genetic variation. Hence, future adaptedness requires gene flow or planting of individuals carrying beneficial alleles from habitats currently matching future climatic conditions.
The European Beech is the dominant climax tree in most regions of Central Europe and valued for its ecological versatility and hardwood timber. Even though a draft genome has been published recently, ...higher resolution is required for studying aspects of genome architecture and recombination. Here, we present a chromosome-level assembly of the more than 300 year-old reference individual, Bhaga, from the Kellerwald-Edersee National Park (Germany). Its nuclear genome of 541 Mb was resolved into 12 chromosomes varying in length between 28 and 73 Mb. Multiple nuclear insertions of parts of the chloroplast genome were observed, with one region on chromosome 11 spanning more than 2 Mb which fragments up to 54,784 bp long and covering the whole chloroplast genome were inserted randomly. Unlike in
, ribosomal cistrons are present in
only in four major regions, in line with FISH studies. On most assembled chromosomes, telomeric repeats were found at both ends, while centromeric repeats were found to be scattered throughout the genome apart from their main occurrence per chromosome. The genome-wide distribution of SNPs was evaluated using a second individual from Jamy Nature Reserve (Poland). SNPs, repeat elements and duplicated genes were unevenly distributed in the genomes, with one major anomaly on chromosome 4. The genome presented here adds to the available highly resolved plant genomes and we hope it will serve as a valuable basis for future research on genome architecture and for understanding the past and future of European Beech populations in a changing climate.
Effective population size (Ne) is a pivotal evolutionary parameter with crucial implications in conservation practice and policy. Genetic methods to estimate Ne have been preferred over demographic ...methods because they rely on genetic data rather than time‐consuming ecological monitoring. Methods based on linkage disequilibrium (LD), in particular, have become popular in conservation as they require a single sampling and provide estimates that refer to recent generations. A software program based on the LD method, GONE, looks particularly promising to estimate contemporary and recent‐historical Ne (up to 200 generations in the past). Genomic datasets from non‐model species, especially plants, may present some constraints to the use of GONE, as linkage maps and reference genomes are seldom available, and SNP genotyping is usually based on reduced‐representation methods. In this study, we use empirical datasets from four plant species to explore the limitations of plant genomic datasets when estimating Ne using the algorithm implemented in GONE, in addition to exploring some typical biological limitations that may affect Ne estimation using the LD method, such as the occurrence of population structure. We show how accuracy and precision of Ne estimates potentially change with the following factors: occurrence of missing data, limited number of SNPs/individuals sampled, and lack of information about the location of SNPs on chromosomes, with the latter producing a significant bias, previously unexplored with empirical data. We finally compare the Ne estimates obtained with GONE for the last generations with the contemporary Ne estimates obtained with the programs currentNe and NeEstimator.