Community Wishlist Survey 2019/Wikidata/Gather metadata of ISO/ASTM/EN... standards

From Meta, a Wikimedia project coordination wiki

Gather metadata of ISO/ASTM/EN... standards

  • Problem: We do not have items for many ISO/ASTM/EN... standards. This is usefull for projects because these standards list the official names and definition of some other items such as material properties.
  • Who would benefit: Wikidata projects and the community as a whole.
  • Proposed solution: Write a script that crawl ISO/ASTM/CEN sites for standards metadata.
  • More comments:
  • Phabricator tickets:
  • Proposer: Thibdx (talk) 17:27, 11 November 2018 (UTC)[reply]

Discussion

  • Comment Comment @Thibdx: I already have a simple scrapy script that dumps ISO data into a CSV file. The time consuming part is not having a model for ISO standards, and not having an easy and efficient way to write data back into Wikidata (one edit per item with multiple statements, not Pywikibot's one edit per minor change). You can copy and have a look at the spider (especially the xpath rules) at [1]. Dhx1 (talk) 12:46, 19 November 2018 (UTC)[reply]

Voting