Wikicite/e-scholarship/Evolution and Evolvability (WikidataR for Wikicite)

From Meta, a Wikimedia project coordination wiki

Project name[edit]

WikidataR: Expanding read-write capabilities for R to write enhanced publication metadata to Wikidata

Contact information[edit]

application type

WikiCite/e-scholarship

your name and/or Wikimedia username

Evolution and Evolvability (Thomas Shafee) - Melbourne

e-mail

Thomas.Shafee@gmail.com

Your nearest city and country

Melbourne, Australia

Details of team members (optional)

If this application is for a team project, each additional team member (up to 5 total) should have their name/username, project role, location, and contact details here:

  • Canley (Alex Lum) – Melbourne: co-developer, experienced in R programming, SPARQL and QuickStatements. Email: alex.lum@wikimedia.org.au

The activity/project[edit]

Tell us about your proposed project. What will you get achieved with this time?

Current situation

The scripting language R is the most used data science language, well suited to. There has been a R packages around for a while to read from wikidata (reading single items with WikidataR and submitting SPARQL queries with WikidataQueryServiceR). However no package exists that can write to wikidata from R.

Proposed action

I am therefore creating an expanded and updated WikidataR to have this ability to write to Wikidata via the quickstatements API (by expanding the functionality of the old read-only WikidataR package, since its original creator is no longer a developer).

Applications

The ability to write to Wikidata from R will greatly enhance the community's ability to enrich publication items in Wikidata. These include properties such as funding scheme (P6195), editor (P98), reviewed by (P4032), or describes a project that uses (P4510) for methods and equipment; significant events such as ethical approval (Q98550700) or rectification (Q56478729); and author/editor/reviewer properties such as declared conflict of interest (Q99429881). Similarly, journals can be indicated as predatory publisher (Q65770389) or hijacked journal (Q17300027). As a language, R is particularly well suited to the sort of data handling necessary to cross-reference existing databases and write to appropriate wikidata items, and this R package will act as an equivalent for the to the python PyPl package.

Your qualifications[edit]

Describe how you (and, if applicable, your team)are able to achieve this project. What skills, expertise, and motivation do you have which will enable you to succeed?

  • I have experience in publication metadata open metadata from my work with the WikiJournal User Group
  • I have experience in research output metadata (including non-traditional research outputs)
  • I have experience in R package development and basic web interfaces
  • I have already implemented some basic proof of concept functions and tested them:

Proposed activity dates[edit]

Four days total:

  • 10-11 Oct 2020
  • 17-18 Oct 2020

Budget (Standard WMF per-day living allowance - 75% of the "M & IE rate" for relevant city)

  • US$145 x 4 days x 2 people = US$1160 x 0.75 = USD $870 ( ~$1220 AUD)

Endorsements[edit]

Optional: Community members are encouraged to endorse your proposal and leave a rationale here.

...

Status
complete