Jump to content

WikiCite/2020 Virtual conference/Align your Open Access Journal with Wikidata - using Python and OpenRefine

From Meta, a Wikimedia project coordination wiki

Open citations & linked bibliographic data | 26-28 October 2020 | #WikiCite

Part of Celebrating Wikidata's 8th Birthday | #WikidataBirthday

12:15 UTC




Open Access Repositories like journal websites offer free accessible APIs like OAI-PMH to get access to the bibliographical metadata. In this talk I will present a lightweight python script (as a jupyter notebook) available under gitlab.com/LibrErli1/parse_ojs_oai_2_wikidata. This script allows the scraping of any OAI2 conform site and extract all the necessary bibliographic values in a serialized json-file for a wikidata ingest. This json-output will be used for further processing in OpenRefine (e.g. linking and disambiguate authors or main subjects with Wikidata) and to prepare the upload to Wikidata.




Christian Erlinger works as systems librarian at Vienna Public Libraries. Twitter: @LibrErli