Skip to content
Licensed Unlicensed Requires Authentication Published by De Gruyter April 21, 2022

IUPAC specification for the FAIR management of spectroscopic data in chemistry (IUPAC FAIRSpec) – guiding principles

  • Robert M. Hanson ORCID logo , Damien Jeannerat ORCID logo , Mark Archibald ORCID logo , Ian J. Bruno ORCID logo , Stuart J. Chalk ORCID logo , Antony N. Davies ORCID logo , Robert J. Lancashire ORCID logo , Jeffrey Lang ORCID logo and Henry S. Rzepa ORCID logo

Abstract

A set of guiding principles for the development of a standard for FAIR management of spectroscopic data are outlined and discussed. The principles form the basis for future recommendations of IUPAC Project 2019-031-1-024 specifying a detailed data model and metadata schema for describing the contents of an “IUPAC FAIRData Collection” and the organization of digital objects within that collection. Foremost among the recommendations will be a specification for an “IUPAC FAIRData Finding Aid” that describes the collection in such a way as to optimize the findability, accessibility, interoperability, and reusability of its contents. Results of an analysis of data provided by an American Chemical Society Publications pilot study are discussed in relation to potential workflows that might be used in implementing the “IUPAC FAIRSpec” standard based on these principles.


Article note:

A collection of invited papers on Cheminformatics: Data and Standards.



Corresponding author: Robert M. Hanson, Department of Chemistry, St Olaf College, Northfield, MN, USA, e-mail:

Acknowledgments

RMH thanks St. Olaf College students Kha Trinh and Lecheng Lyu for their assistance obtaining and unpacking the ACS pilot datasets early on in the development of our workflow prototype. This project is supported by IUPAC, Project 2019-031-1-024.

References

[1] R. M. Hanson, D. Jeannerat, M. Archibald, I. J. Bruno, S. J. Chalk, A. N. Davies, R. J. Lancashire, J. Lang, H. S. Rzepa. Development of a Standard for FAIR Data Management of Spectroscopic Data, https://iupac.org/projects/project-details/?project_nr=2019-031-1-024.Search in Google Scholar

[2] D. Martinsen. Chem. Int. 39, 35 (2017), https://doi.org/10.1515/ci-2017-0309.Search in Google Scholar

[3] A. N. Davies. Spectrosc. Eur. 30, 21 (2018), https://doi.org/10.1255/sew.2018.a2.Search in Google Scholar

[4] V. F. Scalfani, L. McEwen. in NSF OAC 2019 Workshop, FAIR Publishing Guidelines for Spectral Data and Chemical Structures, OSF Storage, United States (2019), https://osf.io/psq7k/.Search in Google Scholar

[5] GFISCO FAIR Principles, https://www.go-fair.org/fair-principles/.Search in Google Scholar

[6] L. McEwen. (Chapter 3.1.4) Res. Data Rep. Chem. (2020), https://doi.org/10.1021/acsguide.30104.Search in Google Scholar

[7] NIH Final NIH Policy for Data Management and Sharing, https://grants.nih.gov/grants/guide/notice-files/NOT-OD-21-013.html.Search in Google Scholar

[8] Q. Schiermeier. Nature 591, 20 (2021), https://doi.org/10.1038/d41586-021-00496-z.Search in Google Scholar PubMed

[9] NSF Division of Chemistry – Advice to Principal Investigators on Data Management Plans, https://www.nsf.gov/bfa/dias/policy/dmpdocs/che.pdf.Search in Google Scholar

[10] UKRI Common principles on data policy – UK Research and Innovation, https://www.ukri.org/funding/information-for-award-holders/data-policy/common-principles-on-data-policy/.Search in Google Scholar

[11] Wellcome Data, software and materials management and sharing policy, https://wellcome.org/grant-funding/guidance/data-software-materials-management-and-sharing-policy.Search in Google Scholar

[12] A. M. Hunter, E. M. Carreira, S. J. Miller. Org. Lett. 22, 1231 (2020), https://doi.org/10.1021/acs.orglett.0c00383.Search in Google Scholar PubMed

[13] IUPAC Analysis of thirteen submissions to the ACS Publications digital data pilot, https://github.com/IUPAC/IUPAC-FAIRSpec/tree/main/results.Search in Google Scholar

[14] J. G. Grasselli. Pure Appl. Chem. 63, 1781 (1991), https://doi.org/10.1351/pac199163121781.Search in Google Scholar

[15] IUPAC Digital Standards: JCAMP-DX, https://iupac.org/what-we-do/digital-standards/jcamp-dx/.Search in Google Scholar

[16] A. N. Davies, R. M. Hanson, P. Lampen, R. J. Lancashire. Pure Appl. Chem. 94, 705 (2022).10.1515/pac-2021-2010Search in Google Scholar

[17] FAIRsharing.org MIBBI – Minimum Information for Biological and Biomedical Investigations, https://fairsharing.org/3518.Search in Google Scholar

[18] M. Europe. MassBank: High Quality Mass Spectral Database, https://massbank.eu/MassBank/.Search in Google Scholar

[19] C. R. Groom, I. J. Bruno, M. P. Lightfoot, S. C. Ward. Acta Crystallogr. Sect. B Struct. Sci. Cryst. Eng. Mater. 72, 171 (2016), https://doi.org/10.1107/s2052520616003954.Search in Google Scholar

[20] S. Heller, A. McNaught, S. Stein, D. Tchekhovskoi, I. Pletnev. J. Cheminf. 5, 7 (2013), https://doi.org/10.1186/1758-2946-5-7.Search in Google Scholar PubMed PubMed Central

[21] Daylight Software Simplified Molecular Input Line Entry System, https://www.daylight.com/dayhtml/doc/theory/theory.smiles.html.Search in Google Scholar

[22] B. Mons. Nature 578, 491 (2020), https://doi.org/10.1038/d41586-020-00505-7.Search in Google Scholar PubMed

[23] LOC Encoded Archival Description, https://www.loc.gov/ead/.Search in Google Scholar

[24] DataCite DataCite: International Data Citation Initiative, https://datacite.org.Search in Google Scholar

[25] W3C Schema.org, https://schema.org.Search in Google Scholar

[26] DDI Data Documentation Initiative Alliance, https://ddialliance.org.Search in Google Scholar

[27] CNRI The Handle System, https://www.handle.net/.Search in Google Scholar

[28] R. S. McDonald, P. A. Wilks. Appl. Spectrosc. 42, 151 (1988), https://doi.org/10.1366/0003702884428734.Search in Google Scholar

[29] D. Schober, D. Jacob, M. Wilson, J. A. Cruz, A. Marcu, J. R. Grant, A. Moing, C. Deborde, L. F. de Figueiredo, K. Haug, P. Rocca-Serra, J. Easton, T. M. D. Ebbels, J. Hao, C. Ludwig, U. L. Günther, A. Rosato, M. S. Klein, I. A. Lewis, C. Luchinat, A. R. Jones, A. Grauslys, M. Larralde, M. Yokochi, N. Kobayashi, A. Porzel, J. L. Griffin, M. R. Viant, D. S. Wishart, C. Steinbeck, R. M. Salek, S. Neumann. Anal. Chem. 90, 649 (2017), https://doi.org/10.1021/acs.analchem.7b02795.Search in Google Scholar PubMed

[30] E. L. Ulrich, K. Baskaran, H. Dashti, Y. E. Ioannidis, M. Livny, P. R. Romero, D. Maziuk, J. R. Wedell, H. Yao, H. R. Eghbalnia, J. C. Hoch, J. L. Markley. J. Biomol. NMR 73, 5 (2018), https://doi.org/10.1007/s10858-018-0220-3.Search in Google Scholar PubMed PubMed Central

[31] HUPO-PSI, mzML – Reporting Spectra Information in MS-based experiments, https://github.com/HUPO-PSI/mzML.Search in Google Scholar

[32] AnIML the Analytical Information Markup Language, https://www.animl.org/.Search in Google Scholar

[33] Digital Science figshare.com, https://figshare.com.Search in Google Scholar

[34] IUPAC FAIRData Finding Aid, https://chemapps.stolaf.edu/iupac/demo/demo.htm.Search in Google Scholar

[35] IUPAC GitHub Repository for the FAIRSpec Project, https://github.com/IUPAC/IUPAC-FAIRSpec.Search in Google Scholar

[36] IUPAC FAIRSpec Working Draft Specification, https://github.com/IUPAC/IUPAC-FAIRSpec/blob/main/doc/IUPAC_FAIRSpec_Specification_draft.pdf.Search in Google Scholar

[37] G. Berg-Cross, R. Ritz, P. Wittenburg. in RDA Data Foundation and Terminology DFT: Results RFC, Research Data Alliance (2015), https://doi.org/10.15497/06825049-8CA4-40BD-BCAF-DE9F0EA2FADF (see file 'DFT Core.pdf').Search in Google Scholar

[38] RDA DFT IG Term Definitions Version 3.0, https://smw-rda.esc.rzg.mpg.de/dft-3.0.html.Search in Google Scholar

[39] UTL Metadata Basics: finding aid, https://dictionary.archivists.org/entry/finding-aid.html.Search in Google Scholar

[40] IDF Digital Object Identifiers, https://www.doi.org/.Search in Google Scholar

[41] M. D. Wilkinson, M. Dumontier, I. J. Aalbersberg, G. Appleton, M. Axton, A. Baak, N. Blomberg, J.-W. Boiten, L. B. da Silva Santos, P. E. Bourne, J. Bouwman, A. J. Brookes, T. Clark, M. Crosas, I. Dillo, O. Dumon, S. Edmunds, C. T. Evelo, R. Finkers, A. Gonzalez-Beltran, A. J. G. Gray, P. Groth, C. Goble, J. S. Grethe, J. Heringa, P. A. C. ’t Hoen, R. Hooft, T. Kuhn, R. Kok, J. Kok, S. J. Lusher, M. E. Martone, A. Mons, A. L. Packer, B. Persson, P. Rocca-Serra, M. Roos, R. van Schaik, S.-A. Sansone, E. Schultes, T. Sengstag, T. Slater, G. Strawn, M. A. Swertz, M. Thompson, J. van der Lei, E. van Mulligen, J. Velterop, A. Waagmeester, P. Wittenburg, K. Wolstencroft, J. Zhao, B. Mons. Sci. Data 3, 160018 (2016), https://doi.org/10.1038/sdata.2016.18.Search in Google Scholar PubMed PubMed Central

[42] UTL Metadata Basics: crosswalk, https://guides.lib.utexas.edu/metadata-basics/crosswalk.Search in Google Scholar

[43] UTL Metadata Basics: harvesting, https://guides.lib.utexas.edu/metadata-basics/harvesting.Search in Google Scholar

[44] H. Cousijn, R. Braukmann, M. Fenner, C. Ferguson, R. van Horik, R. Lammey, A. Meadows, S. Lambert. Patterns 2, (2021), https://doi.org/10.1016/j.patter.2020.100180.Search in Google Scholar PubMed PubMed Central

[45] IUPAC Gold Book – ‘sample, in analytical chemistry’, https://doi.org/10.1351/goldbook.S05451.Search in Google Scholar

[46] IGSN e.V. International Geo Sample Number: IGSN, https://www.igsn.org.Search in Google Scholar

Published Online: 2022-04-21
Published in Print: 2022-06-27

© 2022 IUPAC & De Gruyter. This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. For more information, please visit: http://creativecommons.org/licenses/by-nc-nd/4.0/

Downloaded on 26.4.2024 from https://www.degruyter.com/document/doi/10.1515/pac-2021-2009/html
Scroll to top button