Saami place names to Wikidata as lexicographical data is a joint project initiative between AvoinGLAM (Wikimedia, Open Knowledge and Creative Commons Finland joint working group on GLAM, languages and culture) and Wikimedia Norge. The aim is to include Saami place names from the Finnish authorities databases to Wikidata following a process developed together with Wikimedia Norge's project Samiske stedsnavn.
Wikidata is a free and open knowledge base that can be read and edited by both humans and machines. Several institutions link existing Wikidata entities to their authority databases through dedicated properties or import data as Wikidata entities and statements. The content of Wikidata is available with the Creative Commons Zero waiver, exported using standard formats, and can be interlinked to other open data sets on the linked data web.
Project description and work packages between WMFI and WMNO
Wikimedia Finland will join Wikimedia Norway’s effort to import Inari, Skolt, and Nothern Saami place names to Wikidata as place items and lexemes. The work is part of Wikimedia Norway’s ongoing project to be completed during 2021.
Wikimedia Finland will work to engage Finnish partners in the project as observers, for discussion and data providers, and to plan further collaboration.
Wikimedia Finland executes the reconciliation process and produces a short report on tooling for reconciling geographic data and interfaces for the public to correct geographic and lexeme data.
Wikimedia Finland gathers all the relevant data and data enriching procedures it has explored for Wikimedia Norway to use in the import process. Wikimedia Finland also participates in planning the lexeme creation and import to Wikidata.
WMNO and WMFI will jointly produce learning resources for future imports and the handling of place names as lexemes.
- Partner engagement
- Provide the data and related information for the process
- Participate in the reconciliation process and produce a report on tooling
- Plan the lexeme import
- Create learning resources of the process and its parts
- Jointly plan the lexeme import
- Execute the lexeme import
- Jointly arrange the kickoff and closing events
- Documentation for Wikimedians as well as for project partners
Read the more detailed documentation in Wikidata at WikiProject Finland/Saami place names to Wikidata as lexicographical data.
- Import places with Northern Saami, Inari Saami and Skolt Saami place names from the Finnish National Land Survey Place Names Register to Wikidata using the Wikidata property P4119, NLS Geographic Names Register Place ID. See a sample of the same data: https://www.wikidata.org/wiki/Q4357913.
- April–May 2021 / AvoinGLAM
- Partner discussion in Wikimedia Finland's Slack on channel #saamiplacenames.
- Further geographic data discussions in the Telegram channel #wikimaps.
- Kickoff workshop to introduce the project and exchange ideas. Date to be discussed.
- Import lexemes
- Create the data model for importing the place names of these places in the Saami languages as lexemes. Reference the entries in the Finnish National Land Survey Place Names Register with the Wikidata property P4118, NLS-FI Geographic Name ID.
- Execute the import.
- Provide conjugations for the place names as lexicographical data in collaboration with Giellatekno at the Arctic University of Norway in Tromsø.
- Import the conjugations.
- Quality check the names before or after import.
- Final event, evaluation and next steps.
Partners in Finland
Partnership means exchange of information rather than commitments from the organization.
- Susanna Ånäs / AvoinGLAM (project management)
- Antti Jakobsson / National Land Survey of Finland
- Mikko Lappalainen, Okko Vainonen / National Library of Finland, FINTO
- Helinä Uusitalo / Kotus, Institute for the languages of Finland
- Taarna Valtonen / Giellagas Institute, University of Oulu
- Eero Hyvönen, Jouni Tuominen, Esko Ikkala / SeCo, HELDIG
- Timo Pyhälahti, Finnish Environment Institute
The project is open for further partners.
Partners in Norway
- Jon Harald Søby / Wikimedia Norge
- Jon Arild Olsen / Språkbanken at Nasjonalbiblioteket
- Trond Trosterud / Giellatekno at the Arctic University of Norway