Research:Developing a wiki-integrated workflow to build a living review on just sustainability transitions
This page documents a planned research project.
Information may be incomplete and change before the project starts.
Upcoming project events:
[edit]- 9-10 July (Paris, french) : Presentation at MeSSH (Méthodes pour les sciences sociales et les humanités - Methods for social sciences and humanities)
- 21-22 Juillet (Paris, english and french) Hackathon Wikimania
- 1er et 15 september (online, english) LD4 Wikidata Affinity Group https://www.wikidata.org/wiki/Wikidata:WikiProject_LD4_Wikidata_Affinity_Group
- 1er december (online, french) Wikicafé : De la réponse à un appel à projet de la Fondation Wikimédia à la publication des résultats : retour d'expérience du projet d'utilisation de Wikidata pour faire une revue de littérature par Adélie Ranville https://fr.wikipedia.org/wiki/Projet:Wikifier_la_science/WikiCaf%C3%A9s
Summary
[edit]This research proposal focuses on developing a living literature review on just sustainability transitions, addressing the challenges of information overload, knowledge synthesis and dissemination in academic research. We aim to assess the potential of Wikidata for creating an enriched, searchable academic knowledge graph on just sustainability transitions in order to facilitate navigation of existing academic knowledge and synthesis of research findings. To do so, we will conduct a meta-review of existing literature reviews, aiming to synthesize their findings by making the data they include interoperable and compatible with linked open data standards. Utilizing Wikidata, the project will collect and enrich bibliographic data, extract research results, and build a knowledge graph. The final output will include a literature review academic paper linked to this knowledge graph and a technical report about the challenges encountered in our literature review workflow. The project aligns with Wikimedia's strategic goals by contributing to filling content gaps on an important topic and by proposing an innovative way to build and disseminate social sciences results that could improve expert contribution to Wikimedia project and content trustworthiness.
Adressed Wikimedia Foundation Goals :
- Innovate in Free Knowledge, 2030 Strategic direction : https://meta.wikimedia.org/wiki/Movement_Strategy/Recommendations/Innovate_in_Free_Knowledge
- Identify Topics for Impact, 2030 Strategic direction : https://meta.wikimedia.org/wiki/Movement_Strategy/Recommendations/Identify_Topics_for_Impact
- Deliver trustworthy encyclopedic content and fuel volunteer growth, Multigenerational strategy : https://meta.wikimedia.org/wiki/Strategy/multigenerational
Methods
[edit]Our study will rely on a meta-review, that is a review of existing literature reviews. Data presented in literature reviews are usually presented as tables or diagrams, and sometimes provided as supplementary materials in publications. However, these data are not made interoperable and are not used to update prior literature reviews. Our goal will be to synthesize results of previous literature reviews by making their findings compatible with linked open data and open science standards using Wikidata, Wikiversity, Wikipedia and other open-science infrastructures. We will collect and enrich bibliographic data, extract research result data to build a knowledge graph, propose relevant visualization of this graph and write a literature review report linked with our knowledge graph, making scientific writing compatible with the linked open data ideal.
Timeline
[edit]Our project is organized in 3 work packages each including precise deliverables (D) or event participation (E).
WP1: Conducting a living meta review on just sustainability transition using Wikimedia projects
- D1.1: An academic Wikidata graph on just sustainability transition supported by academic references and relevant SPARQL queries to navigate the graph.
- D1.2: An academic paper presenting a meta-literature review of existing reviews on just sustainability transition including our detailed methodology.
WP2: Data engineering and user workflow assessment
- D2.1: Technical documentation of the method workflow, identifying existing, missing or incomplete tools (ex : Zotero-Wikidata synchronisation)
- D2.2: Small developments addressing workflow gaps (ex : zotero script, open refine data model, new wikidata properties, Wikidata Schema, export/import between mediawiki and word document…)
WP3: Dissemination and community building
Audience 1: Social science researchers
- E.1: Academic conference presentation at a social science conference (ex: RC33 International Conference on Social Science Methodology, Knowledge Graphs for Sustainability Workshop – KG4S) to share our new living review workflow.
- E.2: Wikidata & Wikipedia Editathon on sustainability research
Audience 2: Wikimedia researchers
- E.3: Presentation within the wikimedia community (ex: Wiki Workshop, Wikimania)
- D3.1: Updated project page on meta.wiki
- D3.2: Research grant application targeting open science infrastructure funds (OSCARS Open Calls, The navigation fund, Fond national pour la science ouverte, PEPR eNSEMBLE : « Collaboration Numérique »).
Audience 3: Wikimedia community
- E.4: Presentation at a french-speaking Wikimedia community event (ex : Wikifranca Wikiconvention Francophone)
| Task | Month |
|---|---|
| D1.1 : Building wikidata graph (data collection) | 1-3 |
| D1.1 : Building wikidata graph (data analysis) | 4-7 |
| D1.1 : Building wikidata graph (data viz) | 6-9 |
| D1.2 : Writing academic paper | 9-12 |
| D2.1 : Writing technical documentation | 1, 3, 8, 12 |
| D2.2 : Technical developments | 1, 3, 8 |
| D3.1 : Updated Meta.wiki page | 3, 4, 9, 12 |
| D3.2 : Grant(s) application writing | 2-3 |
| E1-4 : Dissemination events | 4, 10 |
Policy, Ethics and Human Subjects Research
[edit]TBD
Results
[edit]- Initial research proposal : https://openreview.net/forum?id=JePNXYgcKM
- D1.2 : Academic paper just transition : Just sustainability transitions: a living review
- D2.1 : Technical documentation
- D2.2 : Technical developments
- D2.2.1 : Participation at Wikimedia Hackathon Northwestern Europe 2026
- D2.2.2 : Training/workshop with Nicolas Vigneron on how to use OpenRefine for blibliographic data and metadata
- E.1: Academic conference presentation at a social science conference : https://doi.org/10.5281/zenodo.20071370
- E.2: Wikidata Editathon on sustainability and research: Initiation to wikidata and contributions around sustainable development goals: Editathon Wikidata Grenoble 2026, around 10 participants (master students and 1 researcher): https://outreachdashboard.wmflabs.org/courses/IAE_de_Grenoble/Initiation_Wikidata_Grenoble_2026/home. Presentation (in french) : https://www.canva.com/design/DAGRkDwtbQ0/HdZhQonQaFiJJB_Y63qTpg/edit?utm_content=DAGRkDwtbQ0&utm_campaign=designshare&utm_medium=link2&utm_source=sharebutton
- E.3: Presentation within the wikimedia community : Wikimania 2026
- E.4: Presentation at a Wikimedia community event : (https://www.wikidata.org/wiki/Wikidata:WikiProject_LD4_Wikidata_Affinity_Group ? https://fr.wikipedia.org/wiki/Projet:Wikifier_la_science/WikiCaf%C3%A9s ?)
Other community engagement activities : - Wikidata ontology course project : www.wikidata.org/wiki/User:Bass.Ham/Wikidata-for-Sustainability Other dissemination opportunities : - Post on https://forrt.org/resources/
Technical Documentation
[edit]- The Systematic Review Automation project is a project with similar goals (https://github.com/SisonkeBiotik-Africa/Systematic-Review-Automation ; https://commons.wikimedia.org/wiki/File:WikidataCon_2021_-_Systematic_Review_Automation.pdf). However, we are working on a less automated and more user-friendly workflow. A reason for this is that automation seem to require an existing reliable ontology for the concepts/keywords and the social science concepts are not well represented nor structured on Wikidata. Automation seem relevant for deductive workflows while we want to support inductive building of knowledge graphs.
- The Shiny app is developping data extraction tools to characterize a sample of publications : https://osf.io/preprints/metaarxiv/2yhux_v1
During the construction of the graph, the main difficulty encountered was the numerous bugs in the open-source tools used. For example, the "Author Disambiguator" tool, used to create entries for the researchers who worked on the publications we are analysing, fails to launch about half the time, displaying the message "too many requests"
The Orcidator tool, which, as its name suggests, automatically adds ORCIDs to researchers’ profiles, could not be used. The message "DEACTIVATED BECAUSE OF ABUSE" appears after logging[1].
Wikidata account configuration
[edit]Several tools are useful to reproduce the method developped in this project (they can be activated in user settings : https://www.wikidata.org/wiki/Special:Preferences)
- Merge
- CiteTool
- Duplicate references
- Move
- Move claim
- currentDate (to use when using a wikidata item as reference)
Importing publications in Wikidata
[edit]Best tools :
- Zotero to Quickstatement : https://www.wikidata.org/wiki/Wikidata:Zotero (The simplest way to avoid the creation of duplicates seem to be to fetch the QID of each publication using Cita : https://www.wikidata.org/wiki/Wikidata:Zotero/Cita)
- DOI to Quickstatement with Scholia (limited to 12 items in a batch) : https://scholia.toolforge.org/id-to-quickstatements
Less recommanded :
- [Do not check for duplicates] PMID (PubMed ID), DOI (Digital Object Identifier), and PMCID (PubMed Central ID) to quickstatement https://sourcemd.toolforge.org https://www.wikidata.org/wiki/Wikidata:SourceMD/instructions
- [Not user friendly] https://github.com/magnusmanske/papers
Existing documentation :
- Zotero and open refine : https://librarian.aedileworks.com/2025/08/01/how-i-use-zotero-openrefine-quickstatements-to-create-scholia-profiles-from-wikidata/
- https://iu.pressbooks.pub/wikidatascholcomm/chapter/tools/
Citing a wikidata item in a wiki page
[edit]- There is a template to cite Wikidata scholarly items https://en.wikiversity.org/wiki/Template:Cite_Q
- There is a template to link toward Wikidata items : https://en.wikiversity.org/wiki/Template:Wikidata_entity_link
- It would be interesting to be able to cite concepts as well (for exemple calling values for: Label, Description, Coined by...)
- It would be interesting to generate a "cite work" Wikidata statement on the item of the Wikipage each time a "Cite Q" template is used. It would allow to visualise the items cited in a Wiki page more easily. (Problem: it will not work to differentiate references listed in various linguistic versions of the same page)
Using properties related to scientific work
[edit]We used the following properties to enrich data related to scientific publications:
study type P8363 and data analysis method P13391
[edit]- We started a discussion on the ontology of research methods in wikidata : https://www.wikidata.org/wiki/Wikidata:Project_chat#Research_methods
main subject P921
[edit]A tool exist to semi-automatize the addition of topics for scientific items (Item Subjector https://www.wikidata.org/wiki/Wikidata:Tools/ItemSubjector), however its use is too technical for people without technical knowledge.
research site P6153
[edit]The property "research site" showed constraints issue as it was created to indicate clinical trial sites but used in practice for the larger purpose of the research site or fieldwork. A discussion is ongoing (2026) to solve this : https://www.wikidata.org/wiki/Property_talk:P6153 (Constraints alerts can be ignored in the meantime.)
cites work (P2860)
[edit]- The Zotero plugin Cita (https://www.wikidata.org/wiki/Wikidata:Zotero/Cita) is a promising tool to build citation networks but is still in developpement (feb. 2026). Bug resolution is in progress : https://github.com/zotero-cita/zotero-cita/issues/70.
- The Open Citation Bot (https://www.wikidata.org/wiki/User:OpenCitations_Bot)
- A list of references can be reconciled with Wikidata with OpenRefine
- Startin from a DOI, a list of references can be exported (from https://references.mireklzicar.com/), imported in Zotero and then exported to quickstatement (after getting QID of items existing in Wikidata from Cita)
- Because it was taking so long to fix the bug in Cita, we tested OpenRefine (https://openrefine.org/), which has fewer bugs. It allows us to perform this addition task.
Authors
[edit]We used the Author Disambiguator tool to create Wikidata items for researchers who did not yet have one. This tool helps to minimise errors caused by homonyms among researchers: following a query, it categorises scientific publications into thematic groups. It also automatically searches for ORCID, ResearchGate and VIAF pages[1].
- https://author-disambiguator.toolforge.org/
- https://www.wikidata.org/wiki/Wikidata:ORCIDator
- Enriching author's data can help doing research on diversity in a specific academic community (ex:https://wikiworkshop.org/2026/paper/wikiworkshop_2026_45_divinwd_insights_from_wikidata_for_measuring_diversity_in_scholarly_publications)
The "Author Disambiguator" tool, used to create entries for the researchers who worked on the publications we are analysing, fails to launch about half the time, displaying the message "too many requests".
Other bug : "Error retrieving token: mwoauthdatastore-request-token-not-found"
Author disambiguator sometimes failed to retrieve the publications : SPARQL query 'SELECT ?q ?qLabel WHERE { VALUES ?q { } . SERVICE wikibase:label { bd:serviceParam wikibase:language '[AUTO_LANGUAGE],en,de,es,fr,nl,mul'. } }' failed on endpoint 'https://query.wikidata.org/sparql' after 0 retries. Last HTTP response HTTP/1.1 500 Internal Server Error
- To do : report bugs in github https://github.com/arthurpsmith/author-disambiguator/issues
The Orcidator tool, which, as its name suggests, automatically adds ORCIDs to researchers’ profiles, could not be used. The message "DEACTIVATED BECAUSE OF ABUSE" appears after logging (https://sourcemd.toolforge.org/orcidator_old.php, tested on 7 April 2026).
OpenAlex ID
[edit]- We used OpenRefine to automatically add the unique OpenAlex identifier using the OpenCitations API.
Sourcing statements
[edit]- Infferred from, etc.
Extracting data from an article
[edit]Bibliographic tables
[edit]Some review articles contain tables about specific studies. These tables are hard to reuse when they are on a PDF (the table formatting is lost when we try to copy it). However, it is possible to copy a table from the online version of an article and paste it in an excel sheet.
Visualising bibliographic data from Wikidata
[edit]- Page referencing Wikidata visualisation tools : https://www.wikidata.org/wiki/Wikidata:Tools/Visualize_data
Visualizing bibliographic tables with enriched meta-data
[edit]- We tried SPARQL requests to visualise tables of bibliographic items with the data we added (main topic, study type, research site), these requests did not work because of the Graph split and a python program was necessary to visualize the table (for example : https://github.com/Ronnie-V/ReadSubjects)
- It is possible to visualize data by adding columns of reconciled data in Openrefine : [to develop]
- Tabernacle seem to be the most user-friendly tool to visualise tables : https://tabernacle.toolforge.org (Example). The direct results on this page are editable by simple click and thus not suitable for purely sharing results since accidental edit can easily occur. It can be use for contributing quickly but lacks the possibility to add references. It seem to be a great tool for label translation.
Visualising topics
[edit]Visualizing hyperlinks between Wikipedia pages
[edit]- The Wikipedia Museum tool helps visualizing the hyperlinks between Wikipedia pages https://sophiawliu.github.io/fieldtrip/#/museum-wiki/custom/list?museum=custom (Source: Sophia Liu. Revisiting the Rabbit Hole: The Hypertextual Friction of Wikipedia https://wikiworkshop.org/2026/paper/wikiworkshop_2026_46_revisiting_the_rabbit_hole_the_hypertextual_friction_of_wikipedia). This tool can help identify missing links between Wikipedia pages by comparing entities linked in Wikidata with the links across their respective Wikipedia pages.
Visualizing timelines
[edit]Visualising connexions between two concepts
- The "metaphact" pathfinder tool could be interesting to visualise a relation between concepts but many properties seem omitted in the existing tool.https://wikidata.metaphacts.com
Visualising specific properties
[edit]- Wikidata graph builder seem to be the most user friendly, robust and versatile tool to visualise a graph of a single property https://angryloki.github.io Other tools are bugged or limited to specific properties.
Writing in mediawiki
[edit]- The visual editor is not available by default and has to be enabled in the user preferences. It would help new users to have it enabled by default.
Abstract Wikipedia
[edit]- Abstract Wikipedia allows to write articles using WIkidata items https://abstract.wikipedia.org/wiki/Abstract_Wikipedia:Main_page
Word to Media Wiki integration
[edit]The problem : Academics without technical background often write using word and zotero, which allows to manage the references while we write and generate a reference list (see example attached). If we copy-paste this kind of text using Wikipedia visual editor, the text formatting is kept but the references data are lost: all (author, date) mentions have to be manually changed to a reference, for example by generating the reference from the DOI of each cited paper. This is quite consuming and as a result i have seen wikipedia pages in which the contributors did not change the references formatting, kept (author, date) mentions and copy-pasted their reference list at the end of the page.
The goal : it would be great to have a seamless integration when we copy-paste a text having zotero references in it in the visual editor, or to have a converter to change word+zotero reference formatting into wiki reference formatting.
Existing discussion : https://en.wikipedia.org/wiki/Help:WordToWiki
Impact analysis
[edit]- Changes in page views : https://pageviews.wmcloud.org
- For the item "energy democracy", the number of Wikidata page view has significantly increased since we added rich statements to it : https://pageviews.wmcloud.org/?project=wikidata.org&platform=all-access&agent=spider&redirects=0&start=2025-06&end=2026-05&pages=Q14944319
Resources
[edit]- Wikiversity page (French) : https://fr.wikiversity.org/wiki/M%C3%A9thodologie_de_revue_de_litt%C3%A9rature_cumulative
- Wikidata data models :
Just sustainability networks & projects
[edit]Publications
[edit]- https://manifesto.wiki/
- https://meta.wikimedia.org/wiki/Critical_Wikimedia_Research_Bibliography
- https://journals.sagepub.com/doi/10.1177/20539517251357292
Related Wiki projects
[edit]- Science Hub
- Wikimedians for Sustainable Development
- Wiki loves Sustainable Development Goals
- Visual Analytics for Sustainability and Climate Change
- Research:Language-Agnostic_Topic_Classification/V2_Focus_Groups#Sustainability_and_Biodiversity
References
[edit]- https://wikimediafoundation.org/news/2018/08/14/understanding-workflows-wikimedia-editors/
- https://www.mediawiki.org/wiki/Wikimedia_Product/Contribution_taxonomy
- ↑ https://sourcemd.toolforge.org/orcidator_old.php, tested on 7 April 2026