WikiCite/2020 Virtual conference/Align your Open Access Journal with Wikidata - using Python and OpenRefine
Appearance

Open citations & linked bibliographic data | 26-28 October 2020 | #WikiCite
Part of Celebrating Wikidata's 8th Birthday | #WikidataBirthday
12:15 UTC
15min |
Summary
[edit]Open Access Repositories like journal websites offer free accessible APIs like OAI-PMH to get access to the bibliographical metadata. In this talk I will present a lightweight python script (as a jupyter notebook) available under gitlab.com/LibrErli1/parse_ojs_oai_2_wikidata. This script allows the scraping of any OAI2 conform site and extract all the necessary bibliographic values in a serialized json-file for a wikidata ingest. This json-output will be used for further processing in OpenRefine (e.g. linking and disambiguate authors or main subjects with Wikidata) and to prepare the upload to Wikidata.
Links
[edit]- Recording of talk online at TIB-AV-Portal.
- gitlab.com/LibrErli1/parse_ojs_oai_2_wikidata OAI-PMH Parser - Jupyter Notebook
Bio
[edit]Christian Erlinger works as systems librarian at Vienna Public Libraries. Twitter: @LibrErli