While several studies have investigated general properties of the genetic architecture of natural variation in gene expression, few of these have considered natural, outbreeding populations. In ...parallel, systems biology has established that a general feature of biological networks is that they are scale-free, rendering them buffered against random mutations. To date, few studies have attempted to examine the relationship between the selective processes acting to maintain natural variation of gene expression and the associated co-expression network structure. Here we utilised RNA-Sequencing to assay gene expression in winter buds undergoing bud flush in a natural population of Populus tremula, an outbreeding forest tree species. We performed expression Quantitative Trait Locus (eQTL) mapping and identified 164,290 significant eQTLs associating 6,241 unique genes (eGenes) with 147,419 unique SNPs (eSNPs). We found approximately four times as many local as distant eQTLs, with local eQTLs having significantly higher effect sizes. eQTLs were primarily located in regulatory regions of genes (UTRs or flanking regions), regardless of whether they were local or distant. We used the gene expression data to infer a co-expression network and investigated the relationship between network topology, the genetic architecture of gene expression and signatures of selection. Within the co-expression network, eGenes were underrepresented in network module cores (hubs) and overrepresented in the periphery of the network, with a negative correlation between eQTL effect size and network connectivity. We additionally found that module core genes have experienced stronger selective constraint on coding and non-coding sequence, with connectivity associated with signatures of selection. Our integrated genetics and genomics results suggest that purifying selection is the primary mechanism underlying the genetic architecture of natural variation in gene expression assayed in flushing leaf buds of P. tremula and that connectivity within the co-expression network is linked to the strength of purifying selection.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Despite the global economic and ecological importance of forest trees, the genomic basis of differential adaptation and speciation in tree species is still poorly understood. Populus tremula and ...Populus tremuloides are two of the most widespread tree species in the Northern Hemisphere. Using whole-genome re-sequencing data of 24 P. tremula and 22 P. tremuloides individuals, we find that the two species diverged ∼2.2-3.1 million years ago, coinciding with the severing of the Bering land bridge and the onset of dramatic climatic oscillations during the Pleistocene. Both species have experienced substantial population expansions following long-term declines after species divergence. We detect widespread and heterogeneous genomic differentiation between species, and in accordance with the expectation of allopatric speciation, coalescent simulations suggest that neutral evolutionary processes can account for most of the observed patterns of genetic differentiation. However, there is an excess of regions exhibiting extreme differentiation relative to those expected under demographic simulations, which is indicative of the action of natural selection. Overall genetic differentiation is negatively associated with recombination rate in both species, providing strong support for a role of linked selection in generating the heterogeneous genomic landscape of differentiation between species. Finally, we identify a number of candidate regions and genes that may have been subject to positive and/or balancing selection during the speciation process.
Abstract
Hybridization and resulting introgression are important processes shaping the tree of life and appear to be far more common than previously thought. However, how the genome evolution was ...shaped by various genetic and evolutionary forces after hybridization remains unresolved. Here we used whole-genome resequencing data of 227 individuals from multiple widespread Populus species to characterize their contemporary patterns of hybridization and to quantify genomic signatures of past introgression. We observe a high frequency of contemporary hybridization and confirm that multiple previously ambiguous species are in fact F1 hybrids. Seven species were identified, which experienced different demographic histories that resulted in strikingly varied efficacy of selection and burdens of deleterious mutations. Frequent past introgression has been found to be a pervasive feature throughout the speciation of these Populus species. The retained introgressed regions, more generally, tend to contain reduced genetic load and to be located in regions of high recombination. We also find that in pairs of species with substantial differences in effective population size, introgressed regions are inferred to have undergone selective sweeps at greater than expected frequencies in the species with lower effective population size, suggesting that introgression likely have higher potential to provide beneficial variation for species with small populations. Our results, therefore, illustrate that demography and recombination have interplayed with both positive and negative selection in determining the genomic evolution after hybridization.
Seasonal cues influence several aspects of the secondary growth of tree stems, including cambial activity, wood chemistry, and transition to latewood formation. We investigated seasonal changes in ...cambial activity, secondary cell wall formation, and tracheid cell death in woody tissues of Norway spruce (Picea abies) throughout one seasonal cycle. RNA sequencing was performed simultaneously in both the xylem and cambium/phloem tissues of the stem. Principal component analysis revealed gradual shifts in the transcriptomes that followed a chronological order throughout the season. A notable remodeling of the transcriptome was observed in the winter, with many genes having maximal expression during the coldest months of the year. A highly coexpressed set of monolignol biosynthesis genes showed high expression during the period of secondary cell wall formation as well as a second peak in midwinter. This midwinter peak in expression did not trigger lignin deposition, as determined by pyrolysis-gas chromatography/mass spectrometry. Coexpression consensus network analyses suggested the involvement of transcription factors belonging to the ASYMMETRIC LEAVES2/LATERAL ORGAN BOUNDARIES and MYELOBLASTOSIS-HOMEOBOX families in the seasonal control of secondary cell wall formation of tracheids. Interestingly, the lifetime of the latewood tracheids stretched beyond the winter dormancy period, correlating with a lack of cell death-related gene expression. Our transcriptomic analyses combined with phylogenetic and microscopic analyses also identified the cellulose and lignin biosynthetic genes and putative regulators for latewood formation and tracheid cell death in Norway spruce, providing a toolbox for further physiological and functional assays of these important phase transitions.
Accessing and exploring large‐scale genomics data sets remains a significant challenge to researchers without specialist bioinformatics training. We present the integrated PlantGenIE.org platform for ...exploration of Populus, conifer and Arabidopsis genomics data, which includes expression networks and associated visualization tools. Standard features of a model organism database are provided, including genome browsers, gene list annotation, Blast homology searches and gene information pages. Community annotation updating is supported via integration of WebApollo. We have produced an RNA‐sequencing (RNA‐Seq) expression atlas for Populus tremula and have integrated these data within the expression tools. An updated version of the ComPlEx resource for performing comparative plant expression analyses of gene coexpression network conservation between species has also been integrated. The PlantGenIE.org platform provides intuitive access to large‐scale and genome‐wide genomics data from model forest tree species, facilitating both community contributions to annotation improvement and tools supporting use of the included data resources to inform biological insight.
CONTENTS: Summary 909 I. Introduction 910 II. Genotyping 910 III. Phenotyping 911 IV. Study designs 912 V. The genetics of the ‘omics' 912 VI. Missing heritability: the dark matter of the genome 913 ...VII. Gene interactions 914 VIII. Many rare alleles 914 IX. Looking in the wrong place 914 X. Looking but not seeing 915 XI. Needles in a haystack 915 XII. Confounding effects 916 XIII. Replicating and verifying associations 916 XIV. The genetic architecture of quantitative traits in plants 917 XV. Outlook 918 Acknowledgements 919 References 919 SUMMARY: Association mapping is rapidly becoming the main method for dissecting the genetic architecture of complex traits in plants. Currently most association mapping studies in plants are preformed using sets of genes selected to be putative candidates for the trait of interest, but rapid developments in genomics will allow for genome-wide mapping in virtually any plant species in the near future. As the costs for genotyping are decreasing, the focus has shifted towards phenotyping. In plants, clonal replication and/or inbred lines allows for replicated phenotyping under many different environmental conditions. Reduced sequencing costs will increase the number of studies that use RNA sequencing data to perform expression quantitative trait locus (eQTL) mapping, which will increase our knowledge of how gene expression variation contributes to phenotypic variation. Current population sizes used in association mapping studies are modest in size and need to be greatly increased if mutations explaining less than a few per cent of the phenotypic variation are to be detected. Association mapping has started to yield insights into the genetic architecture of complex traits in plants, and future studies with greater genome coverage will help to elucidate how plants have managed to adapt to a wide variety of environmental conditions.
Summary
Carbohydrate‐active enzymes (CAZymes) catalyze the formation and modification of glycoproteins, glycolipids, starch, secondary metabolites and cell wall biopolymers. They are key enzymes for ...the biosynthesis of food and renewable biomass. Woody biomass is particularly important for long‐term carbon storage and as an abundant renewable natural resource for many industrial applications. This study presents a re‐annotation of CAZyme genes in the current Populus trichocarpa genome assembly and in silico functional characterization, based on high‐resolution RNA‐Seq data sets. Altogether, 1914 CAZyme and expansin genes were annotated in 101 families. About 1797 of these genes were found expressed in at least one Populus organ. We identified genes involved in the biosynthesis of different cell wall polymers and their paralogs. Whereas similar families exist in poplar and Arabidopsis thaliana (with the exception of CBM13 found only in poplar), a few families had significantly different copy numbers between the two species. To identify the transcriptional coordination and functional relatedness within the CAZymes and other proteins, we performed co‐expression network analysis of CAZymes in wood‐forming tissues using the AspWood database (http://aspwood.popgenie.org/aspwood-v3.0/) for Populus tremula. This provided an overview of the transcriptional changes in CAZymes during the transition from primary to secondary wall formation, and the clustering of transcripts into potential regulons. Candidate enzymes involved in the biosynthesis of polysaccharides were identified along with many tissue‐specific uncharacterized genes and transcription factors. These collections offer a rich source of targets for the modification of secondary cell wall biosynthesis and other developmental processes in woody plants.
Significance Statement
Re‐annotation of carbohydrate‐active enzymes in poplar and their transcript analyses reveal genes and gene families involved in wood cell wall biosynthesis.
Trees represent the largest terrestrial carbon sink and a renewable source of ligno-cellulose. There is significant scope for yield and quality improvement in these largely undomesticated species, ...and efforts to engineer elite varieties will benefit from improved understanding of the transcriptional network underlying cambial growth and wood formation. We generated highspatial- resolution RNA sequencing data spanning the secondary phloem, vascular cambium, and wood-forming tissues of Populus tremula. The transcriptome comprised 28,294 expressed, annotated genes, 78 novel protein-coding genes, and 567 putative long intergenic noncoding RNAs. Most paralogs originating from the Salicaceae whole-genome duplication had diverged expression, with the exception of those highly expressed during secondary cell wall deposition. Coexpression network analyses revealed that regulation of the transcriptome underlying cambial growth and wood formation comprises numerous modules forming a continuum of active processes across the tissues. A comparative analysis revealed that a majority of these modules are conserved in Picea abies. The high spatial resolution of our data enabled identification of novel roles for characterized genes involved in xylan and cellulose biosynthesis, regulators of xylem vessel and fiber differentiation and lignification. An associated web resource (AspWood, http://aspwood.popgenie.org) provides interactive tools for exploring the expression profiles and coexpression network.
There is momentum, globally, to increase the use of plant biomass for the production of heat, power and liquid transport fuels. This review assesses the evidence base for potential impacts of ...large-scale bioenergy crop deployment principally within the UK context, but with wider implications for Europe, the USA and elsewhere. We focus on second generation, dedicated lignocellulosic crops, but where appropriate draw comparison with current first-generation oil and starch crops, often primarily grown for food.
For lignocellulosic crops, positive effects on soil properties, biodiversity, energy balance, greenhouse gas (GHG) mitigation, carbon footprint and visual impact are likely, when growth is compared to arable crops. Compared to replacement of set-aside and permanent unimproved grassland, benefits are less apparent. For hydrology, strict guidelines on catchment management must be enforced to ensure detrimental effects do not occur to hydrological resources. The threat of climate change suggests that action will be required to ensure new genotypes are available with high water use efficiency and that catchment-scale management is in place to secure these resources in future. In general, for environmental impacts, less is known about the consequences of large-scale deployment of the C4 grass Miscanthus, compared to short rotation coppice (SRC) willow and poplar, including effects on biodiversity and hydrology and this requires further research.
Detailed consideration of GHG mitigation and energy balance for both crop growth and utilization suggest that perennial crops are favoured over annual crops, where energy balances may be poor. Similarly, crops for heat and power generation, especially combined heat and power (CHP), are favoured over the production of liquid biofuels. However, it is recognized that in contrast to heat and power, few alternatives exist for liquid transportation fuels at present and research to improve the efficiency and energy balance of liquid transport fuel production from lignocellulosic sources is a high current priority.
Although SRC, and to a lesser extent energy grasses such as Miscanthus, may offer significant benefits for the environment, this potential will only be realized if landscape-scale issues are effectively managed and the whole chain of crop growth and utilization is placed within a regulatory framework where sustainability is a central driver. Land resource in the UK and throughout Europe will limit the contribution that crops can make to biofuel and other renewable targets, providing a strong driver to consider sustainability in a global context.
Abstract
Recalcitrant adventitious root (AR) development is a major hurdle in propagating commercially important woody plants. Although significant progress has been made to identify genes involved ...in subsequent steps of AR development, the molecular basis of differences in apparent recalcitrance to form AR between easy-to-root and difficult-to-root genotypes remains unknown. To address this, we generated cambium tissue-specific transcriptomic data from stem cuttings of hybrid aspen, T89 (difficult-to-root) and hybrid poplar OP42 (easy-to-root), and used transgenic approaches to verify the role of several transcription factors in the control of adventitious rooting. Increased peroxidase activity was positively correlated with better rooting. We found differentially expressed genes encoding reactive oxygen species scavenging proteins to be enriched in OP42 compared with T89. A greater number of differentially expressed transcription factors in cambium cells of OP42 compared with T89 was revealed by a more intense transcriptional reprograming in the former. PtMYC2, a potential negative regulator, was less expressed in OP42 compared with T89. Using transgenic approaches, we demonstrated that PttARF17.1 and PttMYC2.1 negatively regulate adventitious rooting. Our results provide insights into the molecular basis of genotypic differences in AR and implicate differential expression of the master regulator MYC2 as a critical player in this process.
Comparative transcriptomic analysis from the cambium of two Populusgenotypes with contrasting adventitious rooting phenotype reveals higher transcriptional activity in the easy-to-root compared with the difficult-to-root genotype.