Location of plant species in Norway gathered as a part of a survey vegetation mapping programme

Georeferenced species data have a wide range of applications and are increasingly used for e.g. distribution modelling and climate change studies. As an integrated part of an on-going survey programme for vegetation mapping, plant species have been recorded. The data described in this paper contains 18.521 registrations of plants from 1190 different circular plots throughout Norway. All species localities are georeferenced, the spatial uncertainty is provided, and additional ecological information is reported. The published data has been gathered from 1991 until 2015. The entries contain all higher vascular plants and pteridophytes, and some cryptogams. Other ecological information is also provided for the species locations, such as the vegetation type, the cover of the species and slope. The entire material is stored and available for download through the GBIF server.


a b s t r a c t
Georeferenced species data have a wide range of applications and are increasingly used for e.g. distribution modelling and climate change studies. As an integrated part of an on-going survey programme for vegetation mapping, plant species have been recorded. The data described in this paper contains 18.521 registrations of plants from 1190 different circular plots throughout Norway. All species localities are georeferenced, the spatial uncertainty is provided, and additional ecological information is reported. The published data has been gathered from 1991 until 2015. The entries contain all higher vascular plants and pteridophytes, and some cryptogams. Other ecological information is also provided for the species locations, such as the vegetation type, the cover of the species and slope. The entire material is stored and available for download through the GBIF server.
& Value of the data The species locations are gathered within plots, providing co-occurrences of plant species and thus enabling analyses at the community level.
The recordings are useful for species distribution modelling, since the spatial precision is high and species absences from plots can be derived.
Many recordings are far away from roads and other infrastructure, thereby providing data from remote areas with few previous recordings.
Additional ecological information is provided, for example slope and vegetation type, which opens for ecological studies.

Data
The data described in this paper contain 18.521 georeferenced entries of higher vascular plants and pteridophytes identified in the field. The most common bryophytes and lichens have also been registered, but the registration of these species groups is incomplete and varying strongly. Each entry is linked to a circular plots of 10 m 2 size, where ecological variables have also been registered ( Table 1). The average number of entries in the 1190 plots is 15.6 species. The entries have recently been published (4th October 2015) on the GBIF-server, they are stored there, and are available for download [5].

Temporal coverage
The data has been gathered from 1991 to 2015, varying in number from 2897 entries in the year 2001 to 13 entries in 2015 (Fig. 1). The vegetation survey is still ongoing and new data records from the coming years will be added as they are collected and become available.  Table 1 Additional information gathered for each plot and represented for each 18.521 species entry. When downloading data from GBIF, the information will be arranged according to the Darwin Core 2 data standard [3], but including the information provided in this

Geographic coverage
The species have been sampled from all over Norway, but some regions have far more entries than others (Fig. 2). The dataset is part of a national Norwegian survey vegetation mapping including vegetation observations from 1190 localities (within 53 municipalities) across Norway.

Taxonomic and environmental coverage
This dataset includes occurrence data from 96 families, 254 genera, and 478 species (489 taxa). A list of families with the number of occurrences is provided in Appendix A. The data has been gathered within the boreal and alpine bioclimatic regions [1].

Background
The data has been registered as a part of a national Norwegian survey vegetation mapping programme [2,10], and all species localities are therefore within areas documented through vegetation maps. The vegetation type system used for mapping has been almost stable during the entire sampling period, from 1991 until 2015 [8]. The vegetation type system used for mapping has 45 vegetation types and 9 other land cover types, and is adapted to a mapping scale of 1:20.000-1:50.000 [9]. To enable sufficient descriptions of the vegetation types, and how they vary regionally and throughout Norway, species has been registered systematically as a part of the mapping programme.

The species were registered within circular plots
Within each area targeted for mapping in the national Norwegian survey vegetation mapping programme, representative locations of the vegetation types have been documented by recording plant species within circular plots. A total number of 1190 plots are surveyed. All entries uploaded in the GBIF database are therefore linked to one of these circular plots. More than 99.6% (18.460 entries) of the species recorded have been located within plots of 10 m 2 (radius c. 1.8 m). The size of the plot is provided for all recordings (Table 1).

Information linked to each species registration and how they were measured
The data provided follow the standard given by GBIF, i.e. the Darwin Core data standard [11,12] and the Darwin Core Archive formatting rules [3]. Alongside the sampling of species within each plot, additional ecological information has been registered for the plot ( Table 1). All the information provided has been gathered during field-work by 11 different surveyors between May and October each summer.
Since 2003, all plots have been georeferenced using GPS, giving an estimated average uncertainty of 75 m. From 1991 until 2003, the plots were georeferenced using topographic maps, resulting in an estimated average uncertainty of 7 100 m.
The species cover represents each species' total vertical cover of the entire 10 m 2 plot. Trees do not have to be rooted inside the plot, but must vertically cover the plot. Recording of bush cover, fieldlayer cover and so forth follow the same logic, except for the tree cover. The tree cover has been recorded for a larger circular plot of 25 m 2 (including the original 10 m 2 plot). The cover has been registered in two ways; as cover within Hult-Sernanders' 6 classes [6], or/and as percentage cover. Hult-Sernander has been registered for 99.6% of all entries, whereas percentage cover has been registered for 66.3% of all entries. 23 entries lack a measure of the cover. Aspect and slope has been recorded for each plot, representing the mean aspect and slope of each 10 m 2 plot.

Quality control
All records have been validated before they were exported to GBIF. The 18.521 species entries that have been published, makes up 84 percentage of the original database. 3542 species entries were erased based on various errors. The most common errors were missing coordinates, uncertain identification of the species, or incomplete taxonomy (only genus provided). All such records were erased. In addition, we erased all plots with unlikely geographical coordinates, for example plots located outside of the original mapping area of the project they belonged to.
Difficult or doubtful species or individuals were originally registered, but have been erased from the data as part of the quality control. All species names were corrected according to the Species Nomenclature Database, provided by the Norwegian Biodiversity Information Centre [7]. During that process, the higher nomenclature for each species was added to all records, from Kingdom to Taxon.

Usage rights
Data is published under the Creative Commons Zero (CC0) public domain waiver [14,15]. The data in the dataset is published free and open for scientific reuse and other legible use. Please cite this publication or the resource when using parts of the data in your analyses. reviewers for their comments on the manuscript, and all field-workers for their effort in gathering the data.