Zusammenfassung: | |
Research information, i.e., data about research projects, organisations, researchers or research outputs such as publications or patents, is spread across the web, usually residing in institutional and personal web pages or in semi-open databases and information systems. While there exists a wealth of unstructured information, structured data is limited and often exposed following proprietary or less-established schemas and interfaces. Therefore, a holistic and consistent view on research information across organisational and national boundaries is not feasible. On the other hand, web crawling and information extraction techniques have matured throughout the last decade, allowing for automated approaches of harvesting, extracting and consolidating research information into a more coherent knowledge graph. In this work, we give an overview of the current state of the art in research information sharing on the web and present initial ideas towards a more holistic approach for boot-strapping research information from available web sources.
|
|
Lizenzbestimmungen: | CC BY-NC-ND 3.0 Unported - https://creativecommons.org/licenses/by-nc-nd/3.0/ |
Publikationstyp: | BookPart |
Publikationsstatus: | publishedVersion |
Erstveröffentlichung: | 2014 |
Schlagwörter (deutsch): | Konferenzschrift |
Schlagwörter (englisch): | Information extraction, Linked data, Research information, Web crawling, Data mining, Information analysis, Information retrieval, Information retrieval systems, Information systems, Social networking (online), Sounding apparatus, Websites, Automated approach, Holistic approach, Information extraction techniques, Information sharing, Knowledge graphs, Linked datum, Research outputs, Web Crawling, World Wide Web |
Fachliche Zuordnung (DDC): | 000 | Informatik, Informationswissenschaft, allgemeine Werke, 020 | Bibliotheks- und Informationswissenschaft |
Kontrollierte Schlagwörter: | Konferenzschrift |
Anzeige der Dokumente mit ähnlichem Titel, Autor, Urheber und Thema.