TRIPLE Deliverable 2.6: Report on Global Data Retrieval
- 1. OPERAS AISBL
- 2. Net7
- 3. OPERAS-AMU
- 4. IBL PAN
Description
This deliverable summarises the different stages and processes of data acquisition for GoTriple, a platform created in the context of the TRIPLE project. It also presents the reasons for adopting them and the challenges and opportunities identified by the team. When relevant, the report points to other TRIPLE deliverables covering different aspects of the data retrieval process and the rationale behind selected solutions.
Firstly, the processes related to source selection and descriptions are presented. GoTriple collects metadata of documents in the social sciences and humanities (SSH) field in any language, with a focus on open access content and the European Research Area (ERA). It collects data from both aggregators and providers. Completed and ongoing (status: December 2022) data acquisition processes are summarised, together with the types of processes selected for them. The report also explains the purpose of the GoTriple content providers handbook that is currently set up and will constitute the support service for GoTriple providers.
Secondly, the harvesting process, including the Harvesting Management Support tool, are described. SCRE (Semantic Content Retrieval Engine), a dedicated platform for data ingestion and curation, has been developed as part of the project to process data by using a pipeline approach. Concrete steps that the tool offers to the user are described and illustrated with screenshots.
Lastly, the deliverable provides reflections on the data retrieval activities and choices that have taken place during the project and directions in which next steps of GoTriple development should be taken.
Notes
Files
D2.6.Report on global data retrieval_draft.pdf
Files
(3.0 MB)
Name | Size | Download all |
---|---|---|
md5:0825433fa730cc536805eca139776750
|
3.0 MB | Preview Download |