WikiCite/2020 Virtual conference/Align your Open Access Journal with Wikidata - using Python and OpenRefine

From Meta, a Wikimedia project coordination wiki

Open citations & linked bibliographic data | 26-28 October 2020 | #WikiCite

Part of Celebrating Wikidata's 8th Birthday | #WikidataBirthday

12:15 UTC

15min

Summary[edit]

Open Access Repositories like journal websites offer free accessible APIs like OAI-PMH to get access to the bibliographical metadata. In this talk I will present a lightweight python script (as a jupyter notebook) available under gitlab.com/LibrErli1/parse_ojs_oai_2_wikidata. This script allows the scraping of any OAI2 conform site and extract all the necessary bibliographic values in a serialized json-file for a wikidata ingest. This json-output will be used for further processing in OpenRefine (e.g. linking and disambiguate authors or main subjects with Wikidata) and to prepare the upload to Wikidata.

Links[edit]

Bio[edit]

Christian Erlinger works as systems librarian at Vienna Public Libraries. Twitter: @LibrErli