Morphological and phytochemical data of Vanilla species in Mexico

This systematic determination of morphological and phytochemical data was conducted with the purpose of conserving and identifying the phylogenetic relationship among the Vanilla species of the Totonacapan region in Mexico to increase awareness of the genetic biodiversity. Samples of Vanilla planifolia, V. planifolia cv. “oreja de burro”, V. pompona, V. insignis, and V. inodora, are distributed across 19 municipalities of the State of Veracruz and 19 municipalities of the State of Puebla. Morphological data parameters were determined in situ and included leaf length, leaf width, leaf thickness, stem diameter, stem thickness, node distance, stem texture degree, flower colour intensity, and fruit length. Similarly, alkaloids, tannins, saponins, phenols, flavonoids, and terpenes were determined by specifically phytochemical tests and quantified by thin layer chromatography. Both, morphological and phytochemical data parameters, were successfully used in assembling dendrograms by using the Euclidian distance method and by principal component analysis.

and phytochemical data parameters, were successfully used in assembling dendrograms by using the Euclidian distance method and by principal component analysis.
& 2018 Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

Subject area
Botany, Taxonomy, Phytochemistry More specific subject area Genetic conservation, genetic resources and taxonomy Type of data Samples of stems and leaves from every plant were collected and then subjected to a dehydration process that required three days at a temperature of 50°C.

Experimental features
An ethnobotanical exploration was conducted in order to collect morphological data from the field. Plant samples were collected for phytochemical data standard determination in the lab.

Data source location
There were 38 municipalities, of which 19 are located in the State of Veracruz and 19 are in the State of Puebla, Mexico. The GPS coordinates are 21°10' North latitude and 98°0' West longitude.

Data accessibility
The data are included in this article.

Value of the data
The collected data could increase knowledge about the level of genetic recombination among different commercial varieties with the wild population of the Vanilla genus, which are currently in danger of extinction due to excessive gathering, jointly with the asexual reproduction way.
This data could displace some of the wild populations of Vanilla species since more than of 90% of local commercial vanillin production is extracted exclusively from Vanilla planifolia.
In spite of a continuous debate regarding the monophyletic origin of the Vanilla genus, the morphological and phytochemical data collected could help produce an intraspecific taxonomic classification of the Vanilla genus of the Totonacapan, because of their defined and restricted geographical distributions.
Data distribution and variability could serve as a reference for other Taxonomic studies in Mesoamerica region.

Data
The purpose of this article is to report on the collection of morphological and phytochemical data from five different vanilla species, which were successfully used in the dendrogram assembly process using the Euclidean distance method, as well as through the analysis of major components. The records relating to the morphological data indicated that the most significant differences identified were in the diameter and thickness of the stem, as well as in the distances between the knots, the degree of texture, and the intensity of the colour of the flower. As a result, the assembly dendrograms showed that V. planifolia or "oreja de burro" and V. planifolia shared the same morphological characteristics. However, they turned out to be widely different from V. pompona, V. insignis, and V. inodora. On the other hand, phytochemical data records showed that the most relevant differences found in phytochemical concentrations were in flavonoids, phenols and terpenes, in leaves and stems. In particular, the dendrogram assembled with phytochemical data showed that V. planifolia cv "oreja de burro" and V. insignis showed a phytochemical presence, which is similar to the other vanilla species tested.
The morphological and phytochemical data presented in this work addresses the opportunity to analyse phylogenetic relationships between local Vanilla species and also to preserve the genetic local background of this orchid.
Samples plant were obtained from five species of Vanilla sp. (Fig. 1) through an ethnobotanical exploration in the Totonacapan region of Puebla-Veracruz, Mexico (Fig. 2); in order to geo-reference their location a high-precision GPS (Garmin-650) was used in conjunction with a Universal Transverse Mercator system (TMS). Data for nine morphological traits (Table 1) were identified and the first principal components analysis explains around 89.5% of the total variation in the study ( Table 2). The first principal component (PC1) explained 46.6% of the total variation and the most relevant distinctions were stem diameter (SD) and stem thickness (ST) ( Table 2). The second principal component (PC2) explained 30.9% of the total variation and was determined by the distance between nodes (ND). The third principal component (PC3) explained 12% of the total variation and was obtained with stem texture degree (STD) and flower colour intensity (FC) traits (Table 2). Spatial morphological data regarding the dispersion of five vanilla species, as a function of the principal components analysis, clearly distinguished four groups of populations (Fig. 3). Assembly of the dendrogram using morphological data collected in the field is shown in Fig. 4, where V. pompona and V. planifolia showed the most significant differences ( Table 3); the dendrogram showed the phylogenetic relationships of the    In addition, six assays were used to determine secondary metabolite concentrations for acquiring phytochemical data ( Table 4). Of these, only three phytochemical data assays were successfully quantified through a thin layer chromatography technique ( Table 5).  The principal component analysis was performed using phytochemical data that only included three variables (Table 5) of six analysed. Therefore, dispersion of phytochemical data on all Vanilla species that was analysed was determined by the three first principal components which explained 92.4% of the total variation of this study ( Table 6). The first principal component (PC1) explained around 51.9% of the total variation and was represented by flavonoids in stems (FS) ( Table 6). PC2 explained 23.2% of the total variation and was determined by phenols in leaves (PL) and terpenes in stems (TS). The third principal component (PC3) explained only 17% of the total variation and was determined by flavonoids in leaves (PL) and terpenes in leaves (TL) ( Table 6). Spatial distribution of Vob (V. planifolia cv. "oreja de burro"); Vis (V. insignis); Vpl (V. planifolia); Vpo (V. pompona) and Via (V. inodora). Concentration scale: ( þ ) weak, ( þ þ) medium, ( þ þ þ) high and (-) absence. phytochemical data on five Vanilla species is shown in Fig. 5, which was obtained by principal component analysis. Therefore, the phenols, flavonoids, and terpene concentration in leaves and stems were assembled into the dendrogram in Fig. 6. With these phytochemical data, clado analysis identified the following pattern: 1) V. planifolia cv. "oreja de burro" and V. insignis, 2) V. planifolia, 3) V. pompona and 5) V. inodora.

Experimental background
In order to collect morphological and phytochemical data about the Vanilla sp, an ethnobotanical exploration was performed through in the Totonacapan region Puebla-Veracruz, Mexico; where five species of the genus Vanilla: Vanilla planifolia, V. planifolia cv. "oreja de burro", V. pompona, V. insignis and V. inodora were identified. Phytochemical data acquisition of leaves and stems of plants was followed with a methanolic extraction method in order to quantify alkaloids, tannins, saponins, phenols, flavonoids, and terpenes (Table 4) by using a thin layer chromatography technique. This technique identified nine morphological parameters of the plant (Table 3). Finally, a principal component analysis was used to identify the most influential morphological and phytochemical traits impacting the data dispersion of the Vanilla species. Then a clustered analysis of phytochemical and morphological data obtained via Euclidian methods was performed in order to assemble the phylogenetic relationships.

Study area
The study of five species of Vanilla was conducted in Totonacapan, a region located between the Sierra Madre Oriental (West) and the coastal plain of the Gulf of Mexico (East) [1]. The study area includes approximately 7551 km 2 (Fig. 2)

Phytochemical data acquisition
Samples of stems and leaves from every plant were collected and then subjected to a dehydration process that required three days at a temperature of 50°C. Then 0.5 g of dry matter was dissolved in 5 mL of 80% methanol [4]. The mixture was filtered using Whatman No.1 filter paper. The tests were performed in triplicate and phytochemical parameters were determined as follows: alkaloids in leaves (AL) and stems using the Dragendorff reagent [5]; tannins in leaves (TL) and stems (TS) via a ferric chloride reagent [6]; saponins in leaves (SL) and stems (SS) using a foaming index [7]; phenols in leaves (PL) and stems (PS) by the Folin Ciocalteu reagent [12]; flavonoids in leaves (FL) and stems (FS) using hydrochloric acid and magnesium reagents [8]; and terpenes in leaves (TL) and stems (TS) by the Lieberman test [9]. A thin-layer chromatography technique and corresponding standard of a secondary metabolite were used [10] in order to determine the number of metabolites (Table 5).

Statistical analysis
Data were analysed with a Statistical Analysis System software (v9.0), and ANOVA and Tukey's range test (p o 0.05) were later performed. Cluster analysis was conducted using the average data for every variable in order to identify morphological and phytochemical differences between analysed Vanilla species. Subsequently, a principal component analysis was used in order to identify the most influential morphological and phytochemical traits affecting the dispersion of Vanilla species.

Assembly of phylogenetic dendrograms
Subsequently, every phytochemical and morphological data point was hierarchically clustered with the Euclidian Distance Method in order to assemble the phylogenic dendrograms. This procedure was performed with the use of JMP Statistics (v10) [11].