Population and molecular datasets for Gymnadenia conopsea (Orchidaceae)

The paper presents data on the ecological and phytocoenotic conditions of habitats of the G. conopsea (L.) R. Br. orchid in the Northeast of European Russia (Komi Republic, Russia). The data include characteristics of the populations of this species on the northern border of its range (size and ontogenetic structure of the populations, density of specimens, phenology), as well as information demonstrating the genetic variability of the species by ISSR-markers that was not included in the main publication. The data presented here supplement our earlier published results O.E.Valuyskikh et al., 2019 and are useful for more detailed analysis of population biology and genetic variability of this rare orchid species. G. conopsea is the species of terrestrial orchids widespread in Europe and Asia and characterized by the widest ecological-cenotic amplitude and occurrence in different types of ecotopes. The size of G. conopsea populations in different parts of its range is usually small, 25–100 ind. but sometimes increases to 200–1000 ind. Hansen et al., 1999 to Travnichek et al., 2012. In the many regions of the Russian Federation, the G. conopsea are subject to protection due to the small number of habitats, long stages of ontogenesis, low population sizes and anthropogenic impact. The complex of G. conopsea s.l. included in the Red Data Book of the Komi Republic Taskaev, 2009.


a b s t r a c t
The paper presents data on the ecological and phytocoenotic conditions of habitats of the G. conopsea (L.) R. Br. orchid in the Northeast of European Russia (Komi Republic, Russia). The data include characteristics of the populations of this species on the northern border of its range (size and ontogenetic structure of the populations, density of specimens, phenology), as well as information demonstrating the genetic variability of the species by ISSR-markers that was not included in the main publication. The data presented here supplement our earlier published results O.E.Valuyskikh et al., 2019 and are useful for more detailed analysis of population biology and genetic variability of this rare orchid species. G. conopsea is the species of terrestrial orchids widespread in Europe and Asia and characterized by the widest ecologicalcenotic amplitude and occurrence in different types of ecotopes. The size of G. conopsea populations in different parts of its range is usually small, 25e100 ind. but sometimes increases to 200e1000 ind. Hansen et al., 1999to Travnichek et al., 2012 In the many regions of the Russian Federation, the G. conopsea are subject to protection due to the small number of habitats, long stages of ontogenesis, low population sizes and anthropogenic impact.

Data
The paper presents data supplementing our study of the morphological variability and genetic diversity of G. conopsea populations in the North-East of European Russia [1]. The fragrant orchid G. conopsea growing in the Komi Republic, near its northern distribution limit, where it occurs mainly on limestone outcrops in river valleys [7]. A comparative analysis of cenopopulations from different parts of the range has shown that their structure is often independent of geographic location but is determined mainly by ecocenotic conditions in habitats [2e6]. The obtained data reflects the ecological and coenotic conditions of the habitats of this species in the middle taiga subzone on limestone outcrops of the Timan Ridge (9 populations) and on the Mezen-Vychegda Plain (1 population). A brief description of the communities where the species can be found is provided in Table 1. It should be noted that species with arctic, arctic-alpine and subarctic types of habitats grow on the "cold" slopes, whereas forest steppe and pine forest species grow on the "warm" slopes. Such qualitative parameters of G. conopsea populations as size, density, shares of specimens at different ontogenetic stages, restoration index are provided in Table 1. The G. conopsea populations located in different ecotopes have different dates of beginning of phenological phases, e.g. blooming (Fig. 1).
The primary matrix of informative ISSR loci for 200 samples from 10 G. conopsea populations using two pairs of primers is shown in Table 2. An analysis of the genetic structure of the samples under study performed by the method of discriminant analysis of principal components (DAPC) made it possible to Specifications

Value of the data
The presented data can be used by other researchers to compare the habitats, phenology and population genetic characteristics of the G. conopsea orchid in different parts of the range and to develop methods for the protection of this rare species.
The raw data obtained from the ISSR analysis results will allow other researchers to expand the statistical analysis in this field.
A graphical representation of the results of data analysis using the Structure 2.3 program demonstrates the algorithm for selecting the number of groups (K) in the sample selection under study for different models [1].  group the sample selections according to their ecotopes (Fig. 2). Fig. 3 shows the graph of dependence of Delta K on K (K is a hypothetical number of isolated genetic groups in the sample selection under study) that is used to assess the population structure with the Structure 2.3 program [8,9]. The program allows estimating the genetic structure of populations and the probability of finding each individual as part of a cluster K within which the deviation from HardyeWeinberg equilibrium would be minimal. The most probable K value (the hypothetical number of isolated genetic groups in the studied sample) was determined by the method proposed in the study of G. Evanno et al. [10]. For a numerical evaluation of the homogeneity (convergence) of the results obtained at independent Structure startups, the CLUMPP version 1.1.2b program [ [11], http://clumpak.tau.ac.il/] was used. The obtained graphs (constructed with the DISTRUCT version 1.1 program, https://web.stanford.edu/ group/rosenberglab/distruct.html) show the probable population structure for 10 Structure replicates using 200 specimens from 10 to 9 G. conopsea populations (Figs. 4 and 5, respectively).

Experimental design, Materials, and Methods
Samples were collected in natural G. conopsea populations (2005e2007, 2016) in the North-East of European Russia in different orographic regions: the South Timan Ridge and Mezen-Vychegda Plain. The studied South Timan populations were located in several types of karst landscapes: on the northern and north-western slopes, on the southern and south-western slopes, as well as on the flattened surfaces in the river valleys (Fig. 6). The names of vascular plants are given according to The Plant List (http://www.theplantlist.org/). The plant communities were cut into transects which were divided into counting sites of 1 m 2 in size with registration of specimens at different ontogenetic stages (j, im, v1, v2, g). The sites were arranged linearly along a transect 20e30 m long or in parallel as adjacent strips 8e10 m long.
When identifying ontogenetic stages, we used the concept of discrete description of ontogenesis while taking into account the characteristics of individual development of G. conopsea [5,12]. Plants classified as juvenile (j) have a single middle leaf 2.7e13.5 cm long and 0.1e0.6 cm wide. The belowground sphere of such plants is diverse and may be represented by a protocorm alone, a protocorm and the first thickened adventitious root, or one or two adventitious roots and a root-stem tuberoid with several cordlike endings (lobes). The main diagnostic character for attributing a plant to the immature (im) group is the presence of two middle leaves 4.5e16 cm long and 0.2e0.9 cm wide.  Such plants usually have a root-stem tuberoid with two to four lobes and two to four adventitious roots (rarely, only one root). Our criterion for the virginile ontogenetic state is readiness for blooming in the next year, which is characteristic of plants with three to five or six assimilating leaves. We distinguish them into two subgroups, taking into account morphological adaptations of the species to a wide range of ecological conditions in the study region. The "young" vegetative subgroup (v 1 ) comprises plants with three assimilating leaves, tuberoid with two to six lobes, and three to six adventitious roots. The leaves are 5.5e15.5 cm long and 0.3e1.2 cm wide, with 5e9 (11) veins. Plants of the "adult" vegetative subgroup (v 2 ) have four to six leaves, tuberoid with three to nine lobes, and four to eight adventitious roots. The leaves are 6.3e16.5 cm long and 0.4e1.3 cm wide, with 5e10 veins. Generative (g) plants have a 13.5e51.5 cm tall shoot with 3e6 middle leaves and 3e10 adventitious roots. The counting unit was a specimen of seed origin. For each population, we determined the number of specimens (units), the mean density of plants (spec./m 2 ), the share of specimens at different developmental stages (%), and restoration index presenting the ratio of young specimens to adult ones. The studies of genetic diversity and structure of 200 samples from 10 G. conopsea populations were carried out at the Center for Collective Use "Molecular Biology" of the Institute of Biology of the Komi Scientific Center of the Ural Division of the Russian Academy of Sciences (Syktyvkar). The detailed methodology is described in the work [1]. Fig. 3. K values used to assess the population structure of G. conopsea: 1 d K values for 10 populations using the "admixture" model; 2 d K values for 10 populations using the "no admixture" model; 3 -K values for 9 populations from the South Timan (except for BF10 population) using the "admixture" model; 4 -K values for 9 populations from the South Timan (except for BF10 population) using the "no admixture" model.  Table 1).   Table 1).