Abstract
Identification of genetically diverse cacao with disease resistance, high productivity, and desirable organoleptic traits is vitally important to the agricultural crop’s long-term sustainability. Environmental changes, pests, and diseases as well as nation’s sovereign property rights have led to a decrease in accessibility and exchange of germplasm of interest. Having been introduced during colonial times, naturalized cacao in Puerto Rico could serve as an unexplored source of genetic diversity in improvement programs. An island-wide survey was carried out to identify naturalized trees and to determine their genetic associations to reference cacao accessions. Samples were genotyped with Expressed Sequence Tag-derived single nucleotide polymorphism (SNP) markers. Principal coordinate, cluster, and population structure analysis using the genotype data for both local and reference samples assigned individuals into five distinct genetic backgrounds: Criollo, Trinitario, Amelonado, Upper Amazon Forastero (UAF), and Nacional. Puerto Rican cacao fit into four (Criollo, Trinitario, Amelonado and UAF) of the five genetic backgrounds, being mainly composed of individuals of Criollo ancestry. Based on historical evidence, cacao of Criollo background was probably brought to Puerto Rico from Venezuela and/or Central America during colonial times. Trinitario, Amelonado, and UAF genetic backgrounds are most likely products of more modern introductions. Genotyping cacao in Puerto Rico provides information on the history and possible origin of the naturalized trees on the island. In addition, the assessment has allowed the targeting of material for incorporation and long-term conservation filling gaps in the existing collection and providing new germplasm to be evaluated for agronomic performance.
Similar content being viewed by others
Notes
Mention of trade names or commercial products in this article is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the US Department of Agriculture.
References
Allegre M, Argout X, Bocarra M, Fouet O, et al. (2011) Discovery and mapping of a new expressed sequence tag-single nucleotide polymorphism and simple sequence repeat panel for large-scale genetic studies and breeding of Theobroma cacao L. DNA Res 19:23–35
Argout X, Fouet O, Wincker P, Gramacho K, et al. (2008) Towards the understanding of the cocoa transcriptome: Production and analysis of an exhaustive dataset of ESTs of Theobroma cacao L. generated from various tissues and under various conditions. BMC Genomics 9:512
Argout X, Salse J, Aury JM, Guiltinan M, et al. (2011) The genome of Theobroma cacao. Nat Genet 43:101–109
Barrett OW (1925) The food plants of Porto Rico. J Agr Porto Rico 9:61–208
Bartley BGD (2005) The Genetic Diversity of Cacao and its Utilization. CABI Publishing, Cambridge
Bekele FL, Bekele I, Butler DR, Bidaisee GG (2006) Patterns of morphological variation in a sample of cacao (Theobroma cacao L.) germplasm from the International Cacao Genebank, Trinidad. Genet Resour Crop Evol 53:933–948
Bekele FL, Bekele I (1996) A sampling of the phenetic diversity of cacao in the International Cocoa Gene Bank of Trinidad. Crop Sci 36:57–64
Bidot Martínez I, Riera Nelson M, Flammand M-C, Bertin P (2015) Genetic diversity and population structure of anciently introduced Cuban cacao Theobroma cacao plants. Genet Resour Crop Evol 62:67–84
Boza E, Irish BM, Meerow A, Tondo C, et al. (2013) Genetic diversity, conservation and utilization of Theobroma cacao L. genetic resources in the Dominican Republic. Genet Resour Crop Evol 30:605–619
Carroll HK (1898) Report on the industrial and commercial conditions of Puerto Rico. Government Printing Office, Washington, pp. 109–155
Cheesman EE (1944) Notes on the nomenclature, classification and possible relationships of cacao populations. Trop Agric 21:144–159
Cuatrecasas J (1964) Cacao and its allies: a taxonomic revision of the genus Theobroma. Contrib US Herbarium 35:379–614
De la Cruz M, Whitkus R, Mota-Bravo L (1995) Origins of cacao cultivation. Nature 375:542–543
Di Rienzo JA, Casanoves F, Balzarini MG, Gonzalez L et al (2011) InfoStat versión 2011. Grupo InfoStat, FCA, Universidad Nacional de Córdoba, Argentina. URL http://www.infostat.com.ar.
Dinwiddie W (1899) Puerto Rico: Its Conditions and Possibilities. Harper and Brothers Publishers, New York
Earl DA, von Holdt B (2012) STRUCTURE HARVESTER: a website program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour 4:359–361
Efombagn MIB, Sounigo O, Nyasse S, Manzanares-Dauleux M, et al. (2009) Phenotypic variation of cacao (Theobroma cacao L.) on farms and in the gene bank in Cameroon. Plant Breed Crop Sci 1:258–264
Engels JMM, Bartley BGD, Enríquez GA (1980) Cacao descriptors, their states and modus operandi. Turrialba 30:209–218
Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol 14:2611–2620
Fang W-P, Meinhardt LW, Tan H-W, Zhou L, et al. (2014) Varietal identification of tea (Camellia sinensis) using nanofluidic array of single nucleotide polymorphism (SNP) markers. Hortic Res 1:14035. doi:10.1038/hortres.2014.35
FAOSTAT (2014) FAO, Rome, Italy. http://faostat3.fao.org/home/. Accessed Oct 2015
Gepts P (2006) Plant genetic resources conservation and utilization. Crop Sci 46:2278–2292
Glaszmann JC, Kilian B, Upadhaya HD, Varshney RK (2010) Accessing genetic diversity for crop improvement. Curr Opin Plant Biol 13:167–173
Grivetti L, Shapiro H (2009) Chocolate: History, Culture and Heritage. Wiley and Sons, New Jersey
Henshall MJ, Richardson P (2009) Caribbean cocoa: Planting and Production in Chocolate: History, Culture and Heritage. Wiley and Sons, New Jersey
Irish B, Goenaga R, Zhang D, Schnell R, et al. (2010) Microsatellite fingerprinting of the USDA-ARS Tropical Agriculture Research Station cacao (Theobroma cacao L.) germplasm collection. Crop Sci 50:656–667
Jakobson M, Rosenberg NA (2007) CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23:1801–1806
Ji K, Zhang D, Motilal LA, Boccara M, et al. (2012) Genetic diversity and parentage in farmer varieties of cacao (Theobroma cacao L.) from Honduras and Nicaragua as revealed by single nucleotide polymorphism (SNP) markers. Genet Resour Crop Ev 60:441–453
Jones PG, Allaway D, Gilmour DM, Harris C, et al. (2002) Gene discovery and microarray analysis of cacao (Theobroma cacao L.) varieties. Planta 216:255–264
Kalinowski ST, Taper ML, Marshall TC (2007) Revising how the computer program CERVUS accommodates genotyping error increases success in paternity assignment. Mol Ecol 16:1099–1106
Kaufman T (1975) Studies on the ecology and biology of cocoa pollinator, Forcipomyia squamipennis I. and M. (Diptera, Ceratopogonidae) in Ghana. B Entomol Res 65:263–268
Kuhn DN, Livingstone D III, Main D, Zheng P, et al. (2012) Identification and mapping of conserved ortholog set (COS) II sequence of cacao and their conversion to SNP markers for marker-assisted selection in Theobroma cacao and comparative genomic studies. Tree Genet Genomes 8:97–111
Lachenaud P, Bonnot F, Oliver G (1999) Use of floral descriptors to study variability in wild cocoa trees (Theobroma cacao L.) in French Guiana. Genet Resour Crop Ev 46:491–500
Letunic I, Bork P (2006) Interactive Tree of Life (iTOL): an online tool for phylogenetic tree display and annotation. Bioinformatics 23:127–128
Letunic I, Bork P (2011) Interactive Tree of Life v.2: online annotation and display of phylogenetic trees made easy. Nucleic Acids Res. doi:10.1093/nar/gkr201
Liu K, Muse SV (2005) PowerMarker: an integrated analysis environment for genetic marker analysis. Bioinformatics 21:2128–2129
Livingstone DS III, Motamayor JC, Schnell RJ, Cariaga K, et al. (2010) Development of single nucleotide polymorphism markers in Theobroma cacao and comparison to simple sequence repeat markers for genotyping of Cameroon clones. Mol Breed 27:93–106
Loor RG, Risterucci AM, Courtois B, Fouet O, et al. (2009) Tracing the native ancestors of the modern Theobroma cacao L. population in Ecuador. Tree Genet Genomes 5:421–4733
Lukman, ZD, Susilo AW, Dinarti D, et al. (2014) Genetic identity, ancestry and parentage in farmer selections of cacao from aceh, Indonesia revealed by single nucleotide polymorphism (SNP) markers. Trop Plant Biol 7:133–143
Micheletti D, Troggio M, Zharkikh A, Costa F, et al. (2011) Genetic diversity of genus Malus and implications for linkage mapping with SNPs. Tree Genet Genomes 7:857–868
Miner-Solá E (1995) Historia de los huracanes de Puerto Rico. San Juan, Puerto Rico
Motamayor JC, Lachenaud P, da Silva e Mota J, Loor R, et al. (2008) Geographic and genetic population differentiation of Amazonian chocolate tree (Theobroma cacao L.). PLoS One 3(10):e3311. doi:10.1371/journal.pone.0003311
Motamayor JC, López PA, Ortiz CF, Moreno A, et al. (2002) Cacao domestication I: The origin of the cacao cultivated by the Mayas. Heredity 89:380–386
Ober FA (1899) Puerto Rico and its Resources. D. Appleton and Company, New York, pp. 67–68
Ouborg NJ, van Treuren R, van Damme JMM (1991) The significance of genetic erosion in the process of extinction. Oecologia 86:359–367
Peakall R, Smouse PE (2006) GENALEX 6: genetic analysis in Excel. Population genetic software for teaching and research. Mol Ecol Notes 6:288–295
Peakall R, Smouse PE (2012) GenAlEx 6: genetic analysis in Excel. Population genetic software for teaching and research. Mol Ecol Notes 6:288–295
Porebski S, Grant Bailey L, Rawn BR (1997) Modification of a CTAB DNA extraction method for plants containing high polysaccharide and polyphenol components. Plant Mol Biol Report 15:8–15
Pritchard JK, Stephens M, Donnelly P (2000) Inference population structure using multilocus genotype data. Genetics 155:945–959
Rafalski A (2001) Novel genetic mapping tools in plants: SNPs and LD-based approaches. Plant Sci 162:329–333
Rafalski A (2002) Applications of single nucleotide polymorphisms in crop genetics. Curr Opin Plant Biol 5:94–100
Risterucci AM, Grivet L, N’Goran JAK, Pieretti I, et al. (2000) A high-density linkage map of Theobroma cacao L. Theor Appl Genet 101:948–955
Rogers JS (1972) Measures of genetic similarity and genetic distance, in Studies in Genetics VII. University of Texas Publication 7213. Austin, TX, pp. 145–153
Wang J, Lin M, Crenshaw A, Hutchinson A, et al. (2009) High-throughput single nucleotide polymorphism genotyping using nanofluidic dynamic arrays. BMC Genomics 10:561
Willson KC (1999) Coffee, Cocoa and Tea. CABI Publishing, New York
Winder JA, Silva P (1977) Recent research of insect pollination of cocoa. Cocoa Grower’s Bulletin 26:11–19
Wood GAR, Lass RA (1985) Cocoa, 4th edn. Longman, London
Yang JY, Motilal LA, Dempewolf H, Maharaj K et al (2011) Chloroplast microsatellite primers for cacao (Theobroma cacao) and other Malvaceae. Am J Bot 98:e372–ee374.
Zhang D, Mischke S, Goenaga R, Hemeida AA, et al. (2006) Accuracy and reliability of highthroughput microsatellite genotyping for cacao clone identification. Crop Sci 46:2084–2092
Zhang D, Boccara M, Motilal L, Mischke S, et al. (2009) Molecular characterization of an earliest cacao (Theobroma cacao L.) collection from upper amazon using microsatellite DNA markers. Tree Genet Genomes 5:595–607
Acknowledgments
The authors thank Chocolove Inc., a Boulder, Colorado-based Chocolate company, for providing funding to support research conducted and for continued efforts in cacao conservation and use. In addition, the authors thank Drs. David Kuhn, Timothy Porch, and Chris Richards for their editorial comments. Lastly, authors thank the general public and all of the farmers (especially Mr. Juan Echevarria) who were instrumental in identifying cacao trees thought out the island and helped secure leaf and pod samples.
Data archiving statement
Expressed sequence tags which were used for developing the SNP markers have previously been deposited by Argout et al. (2008, 2011) into the European Nucleotide Archive (ENA) database under accession numbers CU469588 to CU633156. The current research does not plan on submitting any de novo sequence information to be accessioned into publicly available databases. However, a full list of cacao tree accessions collected and incorporated into the USDA-ARS TARS germplasm collection will be accessioned and their associated information loaded into the Germplasm Resources Information Network (GRIN) Global database and the crop specific International Cacao Germplasm Database (ICGD) http://www.icgd.reading.ac.uk/. In addition, a full list of accession numbers for sampled cacao trees is included as supplemental material in this manuscript.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by D. Grattapaglia
Electronic supplementary material
Supplemental Fig. 1.
Detection of the optimal number of clusters (the most likely value of K) inferred by the STRUCTURE software and set ΔK = mean (|L″(K)|)/sd (L(K)) as a function of K. ΔK was generated by the STRUCTURE HARVESTER, and is based on the approach of (Evanno et al. 2005). (GIF 16 kb)
Supplemental Fig. 2.
Radial dendrogram depicting relationships among and between Puerto Rican (90) cacao (Theobroma cacao L.) samples and reference samples (100). The blue labels represent synonymous groups (SYN 1-11; Table 1) within Puerto Rican cacao samples. Red labels are possible errors in reference samples. (GIF 114 kb)
Supplementary Table 1
(DOCX 69.2 kb)
Rights and permissions
About this article
Cite this article
Cosme, S..., Cuevas, H.E., Zhang, D. et al. Genetic diversity of naturalized cacao (Theobroma cacao L.) in Puerto Rico. Tree Genetics & Genomes 12, 88 (2016). https://doi.org/10.1007/s11295-016-1045-4
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11295-016-1045-4