Structured data for GLAM-Wiki/Prepare data
Appearance
This page is currently a draft. More information pertaining to this may be available on the talk page. Translation admins: Normally, drafts should not be marked for translation. |
Select the data and media files you want to contribute, and prepare them to be compatible with Wikimedia Commons and Wikidata.
Header 1
Clean up the data to be consistent and compatible with Wikimedia Commons and/or Wikidata.
- Look at similar media or data items on Wikimedia Commons or Wikidata for inspiration how to model the data.
- Wikidata's WikiProjects – the 'groups' where volunteers work together on common interests – often have recommendations on data modelling for specific subjects.
- https://www.wikidata.org/wiki/Wikidata:WikiProject_Visual_arts/Item_structure
Tools:
- Spreadsheet software - allows non-programmers to run checks against existing Wikimedia content
- Google Sheets - free spreadsheet software that can be collaborative
- Wikipedia and Wikidata tools for Google Spreadsheets is a free add-on for Google Sheets that adds functions for querying Wikipedia and Wikidata.
- OpenRefine (formerly Google Refine) - popular tool for advanced data cleaning, transformation and matching against Wikidata content. Its homepage includes video tutorials and a guide on how to use version 3.0 and higher for Wikidata manipulation and uploading.
- PAWS and Pywikibot - for those with some programming experience allows for large scale querying and advanced actions.
Website scraping/ingest tools (if the data is available online but the partner can't produce data exports from its database)