We identified and characterized the pseudogene complements of five plant species: four dicots (Arabidopsis thaliana, Vitis vinifera, Populus trichocarpa and Phaseolus vulgaris) and one monocot (Oryza ...sativa). Retroposition was considered of modest importance for pseudogene formation in all investigated species except V. vinifera, which showed an unusually high number of retro-pseudogenes in non coding genic regions. By using a pipeline for the classification of sequence duplicates in plant genomes, we compared the relative importance of whole genome, tandem, proximal, transposed and dispersed duplication modes in the pseudo and functional gene complements. Pseudogenes showed higher tendencies than functional genes to genomic dispersion. Dispersed pseudogenes were prevalently fragmented and showed high sequence divergence at flanking regions. On the contrary, those deriving from whole genome duplication were proportionally less than expected based on observations on functional loci and showed higher levels of flanking sequence conservation than dispersed pseudogenes. Pseudogenes deriving from tandem and proximal duplications were in excess compared to functional loci, probably reflecting the high evolutionary rate associated with these duplication modes in plant genomes. These data are compatible with high rates of sequence turnover at neutral sites and double strand break repairs mediated duplication mechanisms.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Abstract
With his bicentennial breeding history based on athletic performance, the Thoroughbred horse can be considered the equine sport breed. Although genomic and transcriptomic tools and knowledge ...are at the state of the art in equine species, the epigenome and its modifications in response to environmental stimuli, such as training, are less studied. One of the major epigenetic modifications is cytosine methylation at 5′ of DNA molecules. This crucial biochemical modification directly mediates biological processes and, to some extent, determines the organisms' phenotypic plasticity. Exercise indeed affects the epigenomic state, both in humans and in horses. In this study, we highlight, with a genome-wide analysis of methylation, how the adaptation to training in the Thoroughbred can modify the methylation pattern throughout the genome. Twenty untrained horses, kept under the same environmental conditions and sprint training regimen, were recruited, collecting peripheral blood at the start of the training and after 30 and 90 days. Extracted leukocyte DNA was analyzed with the methylation content sensitive enzyme ddRAD (MCSeEd) technique for the first time applied to animal cells. Approximately one thousand differently methylated genomic regions (DMRs) and nearby genes were called, revealing that methylation changes can be found in a large part of the genome and, therefore, referable to the physiological adaptation to training. Functional analysis via GO enrichment was also performed. We observed significant differences in methylation patterns throughout the training stages: we hypothesize that the methylation profile of some genes can be affected early by training, while others require a more persistent stimulus.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Saponins, a group of glycosidic compounds present in several plant species, have aglycone moieties that are formed using triterpenoid or steroidal skeletons. In spite of their importance as ...antimicrobial compounds and their possible benefits for human health, knowledge of the genetic control of saponin biosynthesis is still poorly understood. In the Medicago genus, the hemolytic activity of saponins is related to the nature of their aglycone moieties. We have identified a cytochrome P450 gene (CYP716A12) involved in saponin synthesis in Medicago truncatula using a combined genetic and biochemical approach. Genetic loss-of-function analysis and complementation studies showed that CYP716A12 is responsible for an early step in the saponin biosynthetic pathway. Mutants in CYP716A12 were unable to produce hemolytic saponins and only synthetized soyasaponins, and were thus named lacking hemolytic activity Ina). In vitro enzymatic activity assays indicate that CYP716A12 catalyzes the oxidation of β-amyrin and erythrodiol at the C-28 position, yielding oleanolic acid. Transcriptome changes in the lha mutant showed a modulation in the main steps of triterpenic saponin biosynthetic pathway: squalene cyclization, β-amyrin oxidation, and glycosylation. The analysis of CYP716A12 expression in planta is reported together with the sapogenin content in different tissues and stages. This article provides evidence for CYP716A12 being a key gene in hemolytic saponin biosynthesis.
Full text
Available for:
BFBNIB, NMLJ, NUK, PNG, SAZU, UL, UM, UPUK
The genetic diversity among grapevine (
L.) cultivars that underlies differences in agronomic performance and wine quality reflects the accumulation of single nucleotide polymorphisms (SNPs) and ...small indels as well as larger genomic variations. A combination of high throughput sequencing and mapping against the grapevine reference genome allows the creation of comprehensive sequence variation maps. We used next generation sequencing and bioinformatics to generate an inventory of SNPs and small indels in four widely cultivated Sardinian grape cultivars (Bovale sardo, Cannonau, Carignano and Vermentino). More than 3,200,000 SNPs were identified with high statistical confidence. Some of the SNPs caused the appearance of premature stop codons and thus identified putative pseudogenes. The analysis of SNP distribution along chromosomes led to the identification of large genomic regions with uninterrupted series of homozygous SNPs. We used a digital comparative genomic hybridization approach to identify 6526 genomic regions with significant differences in copy number among the four cultivars compared to the reference sequence, including 81 regions shared between all four cultivars and 4953 specific to single cultivars (representing 1.2 and 75.9% of total copy number variation, respectively). Reads mapping at a distance that was not compatible with the insert size were used to identify a dataset of putative large deletions with cultivar Cannonau revealing the highest number. The analysis of genes mapping to these regions provided a list of candidates that may explain some of the phenotypic differences among the Bovale sardo, Cannonau, Carignano and Vermentino cultivars.
Optimization of transgene expression can be achieved by designing coding sequences with the synonymous codon usage of genes which are highly expressed in the host organism. The identification of the ...so-called "favoured codons" generally requires the access to either the genome or the coding sequences and the availability of expression data.
Here we describe corseq, a fast and reliable software for detecting the favoured codons directly from RNAseq data without prior knowledge of genomic sequence or gene annotation. The presented tool allows the inference of codons that are preferentially used in highly expressed genes while estimating the transcripts abundance by a new kmer based approach. corseq is implemented in Python and runs under any operating system. The software requires the Biopython 1.65 library (or later versions) and is available under the 'GNU General Public License version 3' at the project webpage https://sourceforge.net/projects/corseq/files.
corseq represents a faster and easy-to-use alternative for the detection of favoured codons in non model organisms.
A meta-analysis was carried out on published literature covering the topic of interactive plant microbiology for botanical species of legumes occurring within the boundary of the Italian island ...Sardinia, lying between the Tyrrhenian and the western Mediterranean seas. Reports were screened for the description of three types of bacterial occurrences; namely, (a) the nitrogen-fixing symbionts dwelling in root nodules; (b) other bacteria co-hosted in nodules but having the ancillary nature of endophytes; (c) other endophytes isolated from different non-nodular portions of the legume plants. For 105 plant species or subspecies, over a total of 290 valid taxonomical descriptions of bacteria belonging to either one or more of these three categories were found, yielding 85 taxa of symbionts, 142 taxa of endophytes in nodules, and 33 in other plant parts. The most frequent cases were within the
,
,
,
and
genera, the majority of symbionts belonged to the
,
,
and
taxa. Both nodular and extra-nodular endophytes were highly represented by Gammaproteobacteria (
,
,
) and Firmicutes (
,
), along with a surprisingly high diversity of the Actinobacteria genus
. The most plant-promiscuous bacteria were
as symbiont and
as endophyte. In addition to the microbial analyses we introduce a practical user-friendly software tool for plant taxonomy determination working in a Microsoft Excel spreadsheet that we have purposely elaborated for the classification of legume species of Sardinia. Its principle is based on subtractive keys that progressively filter off the plants that do not comply with the observed features, eventually leaving only the name of the specimen under examination.
Helichrysum italicum (Asteraceae) is a small shrub endemic to the Mediterranean Basin, growing in fragmented and diverse habitats. The species has attracted attention due to its secondary metabolite ...content, but little effort has as yet been dedicated to assessing the genetic and metabolite diversity present in these populations. Here, we describe the diversity of 50 H. italicum populations collected from a range of habitats in Sardinia.
H. italicum plants were AFLP fingerprinted and the composition of their leaf essential oil characterized by GC-MS. The relationships between the genetic structure of the populations, soil, habitat and climatic variables and the essential oil chemotypes present were evaluated using Bayesian clustering, contingency analyses and AMOVA.
The Sardinian germplasm could be partitioned into two AFLP-based clades. Populations collected from the southwestern region constituted a homogeneous group which remained virtually intact even at high levels of K. The second, much larger clade was more diverse. A positive correlation between genetic diversity and elevation suggested the action of natural purifying selection. Four main classes of compounds were identified among the essential oils, namely monoterpenes, oxygenated monoterpenes, sesquiterpenes and oxygenated sesquiterpenes. Oxygenated monoterpene levels were significantly correlated with the AFLP-based clade structure, suggesting a correspondence between gene pool and chemical diversity.
The results suggest an association between chemotype, genetic diversity and collection location which is relevant for the planning of future collections aimed at identifying valuable sources of essential oil.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Pseudogenes are dead copies of genes. Owing to the absence of functional constraint, all nucleotide substitutions that occur in these sequences are selectively neutral, and thus represent the ...spontaneous pattern of substitution within a genome. Here, we analysed the patterns of nucleotide substitutions in Vitis vinifera processed pseudogenes. In total, 259 processed pseudogenes were used to compile two datasets of nucleotide substitutions. The ancestral states of polymorphic sites were determined based on either parsimony or site functional constraints. An overall tendency towards an increase in the pseudogene A:T content was suggested by all of the datasets analysed. Low association was seen between the patterns and rates of substitutions, and the compositional background of the region where the pseudogene was inserted. The flanking nucleotide significantly influenced the substitution rates. In particular, we noted that the transition of G→A was influenced by the presence of C at the contiguous 5′ end base. This finding is in agreement with the targeting of cytosine to methylation, and the consequent methyl-cytosine deamination. These data will be useful to interpret the roles of selection in shaping the genetic diversity of grape cultivars.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Histone methylation and acetylation are key processes in the epigenetic regulation of plant growth, development, and responses to environmental stimuli. The genes encoding for the enzymes that are ...responsible for these chromatin post-translational modifications, referred to as histone modification genes (HMGs), have been poorly investigated in
species, despite their importance for establishment and activity of nitrogen-fixing nodules. In silico analysis of
HMGs identified 81 histone methyltransferases, 46 histone demethylases, 64 histone acetyltransferases, and 15 histone deacetylases. MtHMGs were analyzed for their structure and domain composition, and some combinations that were not yet reported in other plant species were identified. Genes have been retrieved from
A17 and R108 genotypes as well as
CADL and Zhongmu No.1; the gene number and distribution were compared with
. Furthermore, by analyzing the expression data that were obtained at various developmental stages and in different zones of nitrogen-fixing nodules, we identified MtHMG loci that could be involved in nodule development and function. This work sets a reference for HMG genomic organization in legumes which will be useful for functional investigation that is aimed at elucidating HMGs involvement in nodule development and symbiotic nitrogen fixation.