Published October 20, 2015 | Version v1
Other Open

Data Citation of Evolving Data: Recommendations of the Working Group on Data Citation (WGDC)

Description

The WGDC recommendations enable researchers and data centers to identify and cite data used in experiments and studies. Instead of providing static data exports or textual descriptions of data subsets, we support a dynamic, query centric view of data sets. The proposed solution enables precise identification of the very subset and version of data used, supporting reproducibility of processes, sharing and reuse of data. The goal of the WG were to create identification mechanisms that (a) allow us to identify and cite arbitrary views of data, from a single record to an entire data set in a precise, machine-actionable manner; (b) allow us to cite and retrieve that data as it existed at a certain point in time, whether the database is static or highly dynamic; and (c) is stable across different technologies and technological changes. The WG recommends solving this challenge by (1) ensuring that data is stored in a versioned and timestamped manner and (2) identifying data sets by storing and assigning persistent identifiers (PIDs) to timestamped queries that can be re-executed against the timestamped data store.

Files

datacitation.pdf

Files (473.4 kB)

Name Size Download all
md5:043f9e8e5342fc815034fbf3d6b44f31
463.2 kB Preview Download
md5:d7aa9da0cdbd36da58f0734dd66307a8
10.2 kB Preview Download