Theobroma cacao L., native to the Amazonian basin of South America, is an economically important fruit tree crop for tropical countries as a source of chocolate. The first draft genome of the ...species, from a Criollo cultivar, was published in 2011. Although a useful resource, some improvements are possible, including identifying misassemblies, reducing the number of scaffolds and gaps, and anchoring un-anchored sequences to the 10 chromosomes.
We used a NGS-based approach to significantly improve the assembly of the Belizian Criollo B97-61/B2 genome. We combined four Illumina large insert size mate paired libraries with 52x of Pacific Biosciences long reads to correct misassembled regions and reduced the number of scaffolds. We then used genotyping by sequencing (GBS) methods to increase the proportion of the assembly anchored to chromosomes.
The scaffold number decreased from 4,792 in assembly V1 to 554 in V2 while the scaffold N50 size has increased from 0.47 Mb in V1 to 6.5 Mb in V2. A total of 96.7% of the assembly was anchored to the 10 chromosomes compared to 66.8% in the previous version. Unknown sites (Ns) were reduced from 10.8% to 5.7%. In addition, we updated the functional annotations and performed a new RefSeq structural annotation based on RNAseq evidence.
Theobroma cacao Criollo genome version 2 will be a valuable resource for the investigation of complex traits at the genomic level and for future comparative genomics and genetics studies in cacao tree. New functional tools and annotations are available on the Cocoa Genome Hub ( http://cocoa-genome-hub.southgreen.fr ).
Criollo cacao (Theobroma cacao ssp. cacao) was cultivated by the Mayas over 1500 years ago. It has been suggested that Criollo cacao originated in Central America and that it evolved independently ...from the cacao populations in the Amazon basin. Cacao populations from the Amazon basin are included in the second morphogeographic group: Forastero, and assigned to T. cacao ssp. sphaerocarpum. To gain further insight into the origin and genetic basis of Criollo cacao from Central America, RFLP and microsatellite analyses were performed on a sample that avoided mixing pure Criollo individuals with individuals classified as Criollo but which might have been introgressed with Forastero genes. We distinguished these two types of individuals as Ancient and Modern Criollo. In contrast to previous studies, Ancient Criollo individuals formerly classified as 'wild', were found to form a closely related group together with Ancient Criollo individuals from South America. The Ancient Criollo trees were also closer to Colombian-Ecuadorian Forastero individuals than these Colombian-Ecuadorian trees were to other South American Forastero individuals. RFLP and microsatellite analyses revealed a high level of homozygosity and significantly low genetic diversity within the Ancient Criollo group. The results suggest that the Ancient Criollo individuals represent the original Criollo group. The results also implies that this group does not represent a separate subspecies and that it probably originated from a few individuals in South America that may have been spread by man within Central America.
The wild cacao trees (
Theobroma cacao
L.) of French Guiana were investigated during several survey and collecting expeditions between 1985 and 1995. The many studies made of this material showed it ...to be highly original. New surveys were undertaken in May 2012 in south-eastern French Guiana, in the upper reaches of the Tanpok River and some of its tributaries, in order to enrich the cacao collection preserved at the Paracou-Combi site at Sinnamary (French Guiana). One of the aims of the expedition was to determine whether the populations from the upper-Tanpok were original and different from those already represented in the collection. From the 2012 mission four populations were identified. The material collected was described (location, tree and pod morphology). Genetic diversity of the collected genotypes and various controls was studied using SSR markers. The collected trees belonged to the “Guiana” group, but did not reveal any distinctive feature within the group. The genetic diversity encountered was low (H
e
= 0.124). The outcome of these surveys is positive, since new populations were located. In order to preserve and carry out phenotypical studies (agro-morphology) on the new material collected, a plot was planted at Paracou-Combi in January 2013.
Theobroma cacao, is a tropical understorey tree that is a major economic resource to several tropical countries. However, the crop is under increased threat from several diseases that are responsible ...for 30% loss of harvest globally. Although QTL data related to the genetic determinism of disease resistance exist in cocoa, QTL mapping experiments are heterogeneous, thus making comparative QTL mapping essential for marker assisted selection (MAS). Sixteen QTL experiments were analysed, and the 76 QTLs detected were projected on a progressively established consensus map. Several hot spots, with QTLs related to different Phytophthora species and other diseases, were observed. The likely number of “real” QTLs was estimated by using a meta-analysis implemented in BioMercator software. There was a twofold reduction in average confidence interval observed when compared to the confidence interval of individual QTLs. This alternative approach confirms the existence of several sources of resistance to different diseases of cocoa which could be cumulated in new varieties to increase the sustainability of cocoa resistance using MAS strategies.
Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We ...discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr.
Cocoa black pod caused by Phytophthora is one of the most serious diseases affecting cocoa production. This project set out was to improve our knowledge of the molecular mechanisms involved in ...Theobroma cacao resistance. We performed a transcriptional screening using macroarrays containing 89 ESTs from subtractive libraries enriched with genes differentially expressed in response to T. cacao/Phytophthora interactions. A gene expression analysis revealed the induction of genes involved in stress signal transduction, general and specific defence genes and genes involved in hormone signalling pathways. RT-qPCR showed that the level expression of two genes in the resistant plant was higher during the first eight hours after pathogen inoculation. Three genes from a family encoding a protease inhibitor displayed different levels of expression depending on resistant and susceptible T. cacao clones. A comparison of promoter sequences revealed a motif present only in genes that were up-expressed in the resistant plant.
•We improve molecular mechanisms involved in Theobroma cacao resistance to Phytophthora.•Comparison of promoter sequences of protease inhibitor genes.•Functional genomic and differential expression analysis.
Sugarcane cultivars are polyploid, aneuploid, interspecific hybrids between the domesticated species Saccharum officinarum and the wild relative S. spontaneum. Cultivar chromosome numbers range from ...100 to 130 with approximately 10% contributed by S. spontaneum. We have undertaken a mapping study on the progeny of a selfed cultivar, R570, to analyze this complex genome structure. A set of 128 restriction fragment length polymorphism probes and one isozyme was used. Four hundred and eight markers were placed onto 96 cosegregation groups, based on linkages in coupling only. These groups could tentatively be assembled into 10 basic linkage groups on the basis of common probes. Origin of markers was investigated for 61 probes and the isozyme, leading to the identification of 80 S. officinarum and 66 S. spontaneum derived markers, respectively. Their distribution in cosegregation groups showed better map coverage for the S. spontaneum than for the S. officinarum genome fraction and occasional recombination between the two genomes. The study of repulsions between markers suggested the prevalence of random pairing between chromosomes, typical of autopolyploids. However, cases of preferential pairing between S. spontaneum chromosomes were also detected. A tentative Saccharum map was constructed by pooling linkage information for each linkage group.
A linkage map of cacao based on codominant markers has been constructed by integrating 201 new simple sequence repeats (SSR) developed in this study with a number of isoenzymes, restriction fragment ...length polymorphisms (RFLP), microsatellite markers and resistance and defence gene analogs (Rgenes-RFLP) previously mapped in cacao. A genomic library enriched for (GA)n and (CA)n was constructed, and 201 new microsatellite loci were mapped on 135 individuals from the same mapping population used to establish the first reference maps. This progeny resulted from a cross between two heterozygous cacao clones: an Upper-Amazon Forastero (UPA 402) and a Trinitario (UF 676). The new map contains 465 markers (268 SSRs, 176 RFLPs, five isoenzymes and 16 Rgenes-RFLP) arranged in ten linkage groups corresponding to the haploid chromosome number of cacao. Its length is 782.8 cM, with an average interval distance between markers of 1.7 cM. The new microsatellite markers were distributed throughout all linkage groups of the map, but their distribution was not random. The length of the map established with only SSRs was 769.6 cM, representing 94.8% of the total map. The current level of genome coverage is approximately one microsatellite every 3 cM. This new reference map provides a set of useful markers that is transferable across different mapping populations and will allow the identification and comparison of the most important regions involved in the variation of the traits of interest and the development of marker-assisted selection strategies.