Jump to content

WikiCite 2025/Abstracts

From Meta, a Wikimedia project coordination wiki

Here we list all abstracts of all talks of the programme, the ones selected by the main committee, and the ones proposed by the community and arranged by a different jury

The abstracts (maximum length 200 words) are listed in the order of the slots on the programme. You can leave comments in the dedicated sections to provide feedback to the speakers.

Before August 25th we expect the authors to upload their slides when possible on the dedicated category on Wikimedia Commons, so that the community can also comment them before the presentations. Presenters may use the Wikicite 2025 presentations slides template:

All materials related to these talks will also be listed here at the end of the conference.

Video Streaming

[edit]

🔴 Video Streaming

Friday 29th

[edit]

Past, Present and Future of Wikidata and Wikibase

[edit]
  • language: en
  • time slot(s): 09:30-10:30 CEST
  • main author(s) and descriptions:
Lydia Pintscher (Lydia Pintscher (WMDE)) [in person]
Leif Lobinsky (Leif Lobinsky (WMDE)) [in person]

Comments

[edit]

WikiCite in Switzerland and Italy - New Perspectives

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 10:45-11:00 CEST
  • main author(s) and descriptions:
Alessandro Marchetti (Alessandro Marchetti): volunteer from WMCH
  • abstract or description:

Since around 2020-2021, two distinct yet partially interconnected initiatives have emerged in Switzerland and Italy, both focusing primarily on the indexing of academic authors on Wikidata, and later evolved under the WikiCite umbrella.

We illustrates the approaches, similarities, challenges, and differences of these efforts. This comparative analysis provides insights and recommendations that may be valuable to similar initiatives in other countries.

These findings have been presented as oral contributions on three separate occasions in 2024: in Florence on January 17th, in Katowice on August 10th 2024, and in Bologna on November 9th.

Comments

[edit]

The UNINE-Libra import project

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 11:30-12:00 CEST
  • main author(s) and descriptions:
Thomas Kerboul (Thomas Kerboul (BGE)): metadata specialist at the Bibliothèque de Genève
  • abstract or description:

This project, carried out between April 2023 and November 2024, made it possible to import and align data from Libra, the University of Neuchâtel's institutional repository, to Wikidata. Thanks to this initiative, 1310 LIBRA identifiers (P11710) have been added, often accompanied by the creation of Qid for the scientists concerned. Other identifiers, such as ORCID or SCOPUS, as well as biographical data such as gender, profession or language of expression, were also integrated where possible. At the end of the project, researchers at the University of Neuchâtel are well represented on Wikidata, making it easy to extract demographic statistics on their scientific output using the SPARQL language. Wikidata also serves as a hub for identifiers, enabling tighter control of authority data, particularly in the case of homonyms. This project opens up bibliometric perspectives by facilitating the association of researchers with their publications and, via SPARQL, enables the data from Wikidata and IdRef, the authority database used for cataloguing at the University of Neuchâtel, to be jointly compared and improved.

Comments

[edit]

Data round-tripping for "Swiss WikiCite"

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 12:00-12:30 CEST
  • main author(s) and descriptions:
Camillo Pellizzari (Epìdosis)
Alessandro Marchetti (Alessandro Marchetti)
  • abstract or description:

«Data round-tripping consists in the reciprocal synchronisation of data between Wikidata and the external databases to which it links, with main focus on the correction of errors found by Wikidata users in external databases» (d:Wikidata:Data round-tripping). This presentation analyses the issue of data round-tripping from many different perspectives and specifically in the case of the "Swiss WikiCite". Data round-tripping was a crucial aspect of the "Swiss WikiCite" iniziative, that invested a dedicated amount of human ressources to handle the topic at the community level.

Comments

[edit]

Homonyms clean-up for "Swiss WikiCite"

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 12:30-13:00 CEST
  • main author(s) and descriptions:
Sara De Monaco (SaraDeMonaco): Digital Humanities graduate at University of Pisa
  • abstract or description:

In the evolving landscape of scholarly metadata, the challenge of homonymous researcher identities presents significant barriers to accurate citation, attribution, and knowledge integration across platforms.

This presentation addresses the complexities of disambiguating academic profiles within Wikidata, drawing on the "Researchers in Switzerland" initiative as a focused case study. We explore strategies that leverage structured data properties—such as ORCID iD, institutional affiliation, and educational background—to differentiate between individuals with identical or similar names. Emphasis is placed on the role of external identifiers (e.g., VIAF, Scopus) and community-driven curation practices developed over the past year through collaborative work originated in the Italian-speaking community.

This work contributes to the ongoing WikiCite effort by showcasing a scalable, community-oriented model for improving data integrity in open scholarly knowledge graphs.

Comments

[edit]

Cooperative metadata creation - Parsifal and a possible evolution

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 14:00-14:30 CEST
  • main author(s) and descriptions:
Stefano Bargioni (d:User:Bargioni): deputy director of the Library of the Pontificia Università della Santa Croce (Rome)
  • abstract or description:

Library consortium systems have gradually evolved from the Discovery type to the current systems based on Linked Data and RDF. Parsifal is one of the latter: it implements the BIBFRAME data model, IFLA LRM, and other ontologies, transforming bibliographic information into Linked Data and promoting memory stewardship. Integration with Wikidata and Wikibase reinforces this objective. Wikidata is an open, multilingual manager of semantically structured and highly reusable data. Wikibase, also thanks to its combined use with Wikidata, can host projects similar to Parsifal, enabling the creation of memory repositories between multiple institutions. These institutions can independently model and manage the data, maintaining control over it and promoting reuse and interoperability.

I sistemi consortili di biblioteche si sono evoluti progressivamente dal tipo Discovery fino agli attuali sistemi basati su Linked Data e RDF. Parsifal è uno di questi ultimi: implementa il modello di dati BIBFRAME, IFLA LRM e altre ontologie, trasformando le informazioni bibliografiche in Linked Data e promuovendo la custodia della memoria. L’integrazione con Wikidata e Wikibase rafforza questo obiettivo. Wikidata è un gestore aperto e multilingue di dati semanticamente strutturati e fortemente riutilizzabili. Wikibase, grazie anche ad un uso combinato con Wikidata, può ospitare progetti analoghi a Parsifal, permettendo la creazione di repository della memoria tra più istituzioni, che possono modellare e gestire autonomamente i dati, mantenendone il controllo e favorendo il riuso e l’interoperabilità.

[traduction automatique] Les systèmes de consortium de bibliothèques ont progressivement évolué, passant du type Discovery aux systèmes actuels basés sur les données liées et RDF. Parsifal fait partie de ces derniers : il implémente le modèle de données BIBFRAME, IFLA LRM et d'autres ontologies, transformant les informations bibliographiques en données liées et favorisant la gestion de la mémoire. L'intégration avec Wikidata et Wikibase renforce cet objectif. Wikidata est un gestionnaire ouvert et multilingue de données sémantiquement structurées et hautement réutilisables. Wikibase, grâce à son utilisation combinée avec Wikidata, peut héberger des projets similaires à Parsifal, permettant la création de référentiels de mémoire entre plusieurs institutions. Ces institutions peuvent modéliser et gérer les données de manière indépendante, en gardant le contrôle et en favorisant la réutilisation et l'interopérabilité.

[automatische Übersetzung] Bibliothekskonsortiumsysteme haben sich schrittweise vom Discovery-Typ zu den aktuellen Systemen auf Basis von Linked Data und RDF weiterentwickelt. Parsifal ist eines dieser Systeme: Es implementiert das BIBFRAME-Datenmodell, IFLA LRM und andere Ontologien, wandelt bibliografische Informationen in Linked Data um und fördert die Verwaltung von Daten. Die Integration mit Wikidata und Wikibase unterstützt dieses Ziel. Wikidata ist ein offener, mehrsprachiger Manager semantisch strukturierter und hochgradig wiederverwendbarer Daten. Wikibase kann, auch dank der kombinierten Nutzung mit Wikidata, ähnliche Projekte wie Parsifal hosten und so die Erstellung von Datenspeichern zwischen mehreren Institutionen ermöglichen. Diese Institutionen können die Daten unabhängig modellieren und verwalten, behalten die Kontrolle darüber und fördern die Wiederverwendung und Interoperabilität.

Comments

[edit]

Linking GND and Wikidata - a cooperation project

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 14:30-15:00 CEST
  • main author(s) and descriptions:
Barbara Fischer (Fischerdata): working at Deutsche Nationalbibliothek
  • abstract or description:

The project aims to establish a workflow to improve the linking of Wikidata items with GND IDs. The GND (Gemeinsame Normdatei) is the persistent identifier system of authority files of Germany, Austria and Switzerland. It consists of more than 10 million items and grows constantly. All data is licensed under CC0 and thus well suited to be reused within Wikidata. This happens already both by volunteers but mainly automatically based on VIAF files. For the pilot we focus on persons. To improve the coverage and the quality of the linkage the pilot trys to establish a direct exchange of data and will expand it furthermore into a semi-automated workflow. The presentation will show how we work and how far we have come by now.

Comments

[edit]

Czech Authority Files Meet Wikidata

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 15:00-15:30 CEST
  • main author(s):
Vojtěch Dostál (Vojtěch Dostál) [in person]
Jiří Sedláček (Frettie)
Petra Šťastná (Nestastna)
Linda Jansová (Linda.jansova)
  • abstract or description:

Since the inception of Wikidata, authority files produced by the National Library of the Czech Republic (NL CR) have been instrumental in improving its content and have left a strong footprint in terms of both size and quality. We will describe the contents of these authority files, their use in library practice and how they are shared with a wider audience. Examples will illustrate how data is currently recorded in the MARC 21 format and represented in Wikidata.

We will focus on importing personal name authority data from the Czech National Authority File to Wikidata, including how selected data is extracted from the original authority records and prepared in a dedicated Wikibase instance. Also, we will elaborate on how we deal with changes in both Wikidata items and Czech authority records and describe procedures and tools which have been implemented to keep the data synchronized (including error reporting options). The presentation will also briefly touch upon other Wikidata-related projects implemented in the NL CR, especially those using Wikibase Cloud provided by Wikimedia Deutschland.

Through these activities, the NL CR continues to strengthen its role in the global linked open data movement and in shaping the future of library data.

Comments

[edit]

Authority Data from a Librarian's Perspective in Contrast with Wikidata

[edit]
(Etherpad notes)
  • Dati d’autorità (authority data) dal punto di vista bibliotecario in contrapposizione a Wikidata
  • Normdaten aus bibliothekarischer Sicht im Spannungsfeld zu Wikidata
  • language: de/en
  • time slot(s): 16:00-16:30 CEST
  • main author(s):
Christian Aliverti
He is a member of the Managing Board of the Swiss National Library and Head of the Section of Bibliographic Access of the Swiss National Library. He is a member of the IFLA Cataloguing Section, the RDA Board and member of the Committee for Library Standards of the German speaking countries (Standardisierungsausschuss). He teaches principles of cataloguing to master students at the University of Zurich.
  • abstract or description:

The contribution deals with the relationship between library authority data and Wikidata. The first part illustrates the functions that authority data serve in library catalogs. The second part shows how libraries can learn from and benefit from non-library authority data, such as Wikidata. Finally, the limitations of Wikidata from the perspective of libraries are analyzed.

Il contributo tratta il rapporto tra i dati d’autorità (authority data) bibliotecari e Wikidata. Nella prima parte si illustrano le funzioni dei dati d’autorità nei cataloghi delle biblioteche. Nella seconda parte si mostra come le biblioteche possano imparare e trarre vantaggio dai dati d’autorità non bibliotecari, come ad esempio Wikidata. Infine vengono analizzati i limiti di Wikidata dal punto di vista delle biblioteche.

Der Beitrag behandelt das Verhältnis von bibliothekarischen Normdaten mit Wikidata. Im ersten Teil zeigt er die Funktionen Normdaten in Katalogen der Bibliotheken übernehmen. Der zweite Teil zeigt, wie die Bibliotheken von nicht bibliothekarischen Normdaten, wie zum Beispiel Wikidata lernen und Nutzen ziehen können. Schliesslich werden die Grenzen von Wikidata aus der Sicht der Bibliotheken thematisiert.

Comments

[edit]

Reusing Open Government Data in Wikidata – The Swiss Commune Register Case

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 16:30-17:00 CEST
  • main author(s): Benedikt Hitz-Gamper (Bern University of Applied Sciences)
  • abstract or description:

In recent years, the Swiss federal administration has expanded its Open Government Data activities, with initiatives such as LINDAS – the Linked Data Service of the Federal Administration – making high-quality, machine-readable datasets publicly available. Among these is the Swiss commune register, which provides authoritative information on all current and former communes and their historical changes. In this talk, we analyze the current state of Swiss commune data in Wikidata and demonstrates how the newly introduced _LINDAS ID_ property (P13740) can link entries to official LINDAS records. The session will discuss strategies for improving both data completeness and referencing, showing how Wikidata can benefit from authoritative, government-maintained linked data.

Comments

[edit]
[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 17:00-17:30 CEST
  • approval: selected by academic committee
  • main author(s) and descriptions:
Adriano Rutz (Adriano Rutz): postdoctoral researcher at ETH Zürich
  • abstract or description:

The LOTUS initiative is an open, community-driven knowledge base on natural products and their biological occurrences. Its aim is to make this knowledge freely accessible, linked, and reusable across disciplines. Through its connection to scholarly items on Wikidata, LOTUS is linked to the WikiCite ecosystem, bringing chemistry and biology closer to the global graph of open citations. This talk will highlight how LOTUS uses Wikidata’s infrastructure: from linking article topics with Scholia to tracking structural revisions, retracted articles, and article accessibility through Unpaywall. We will also demonstrate SPARQL queries that reveal both the opportunities and the challenges arising from the current graph split, and discuss how better integration across Wikimedia projects could improve discoverability and impact for natural product research.

L'initiative LOTUS est une base de connaissances ouverte et communautaire sur les produits naturels et leurs occurrences biologiques. Son objectif est de rendre ces connaissances librement accessibles, interconnectées et réutilisables dans toutes les disciplines. Grâce à sa connexion au travers des articles scientifiques sur Wikidata, LOTUS est relié à l'écosystème WikiCite, rapprochant ainsi la chimie et la biologie du graphe mondial des citations ouvertes. Cette présentation mettra en évidence la manière dont LOTUS utilise l'infrastructure de Wikidata : de la mise en relation des sujets d'articles avec Scholia au suivi des révisions structurelles, des articles retirés et de l'accessibilité des articles via Unpaywall. Nous présenterons également des requêtes SPARQL qui révèlent à la fois les opportunités et les défis découlant de la division actuelle du graphe, et discuterons commment une meilleure intégration entre les projets Wikimedia pourrait améliorer la visibilité et l'impact de la recherche sur les produits naturels.

Die LOTUS-Initiative ist eine offene, gemeinschaftlich betriebene Wissensdatenbank über Naturprodukte und ihre biologischen Vorkommen. Ihr Ziel ist es, dieses Wissen frei zugänglich, vernetzt und fachübergreifend wiederverwendbar zu machen. Durch ihre Verbindung zu wissenschaftlichen Artikeln auf Wikidata ist LOTUS mit dem WikiCite-Ökosystem verknüpft und bringt Chemie und Biologie näher an das globale Netz offener Zitate heran. In diesem Vortrag wird erläutert, wie LOTUS die Infrastruktur von Wikidata nutzt: von der Verknüpfung von Artikelthemen mit Scholia bis hin zur Verfolgung struktureller Überarbeitungen, zurückgezogener Artikel und der Zugänglichkeit von Artikeln über Unpaywall. Wir werden auch SPARQL-Abfragen demonstrieren, die sowohl die Chancen als auch die Herausforderungen aufzeigen, die sich aus der aktuellen Aufteilung des Graphen ergeben, und diskutieren, wie eine bessere Integration zwischen den Wikimedia-Projekten die Auffindbarkeit und Wirkung der Naturstoffforschung verbessern könnte.

L'iniziativa LOTUS è una base di conoscenze aperta e gestita dalla comunità sui prodotti naturali e sugli organismi in cui si trovano. Il suo obiettivo è rendere queste conoscenze liberamente accessibili, collegate e riutilizzabili in tutte le discipline. Grazie al suo collegamento con gli articoli accademici su Wikidata, LOTUS è collegato all'ecosistema WikiCite, avvicinando la chimica e la biologia al grafico globale delle citazioni aperte. Questo intervento metterà in evidenza come LOTUS utilizza l'infrastruttura di Wikidata: dal collegamento degli argomenti degli articoli con Scholia al monitoraggio delle revisioni strutturali, degli articoli ritirati e dell'accessibilità degli articoli attraverso Unpaywall. Mostreremo anche le query SPARQL che rivelano sia le opportunità che le sfide derivanti dall'attuale divisione del grafico e discuteremo come una migliore integrazione tra i progetti Wikimedia potrebbe migliorare la reperibilità e l'impatto della ricerca sui prodotti naturali.

Comments

[edit]

Saturday 30th

[edit]

Connecting the Dots: Linking New Zealand Theses to the World Through Wikidata

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 09:00–09:30 CEST
  • main author(s):
Tamsin Braisher (DrThneed)
  • abstract or description:

The New Zealand Thesis Project is a collaboration between academic librarians and Wikimedians to upload a national set of dissertations to Wikidata. This project brings together metadata for more than 66,000 theses from 13 educational institutions, and by adding them to Wikidata, allows them to be easily accessed in multiple languages and cited on Wikipedia. Using OpenRefine to clean and manipulate our data, we have disambiguated more than 23,000 individuals connected to the thesis collection, allowing us to visualise our data in new ways and find unexpected connections. We developed a data model, collaborated with Australian Wikimedians to map controlled vocabularies to Wikidata, and improved data on New Zealand qualifications. The NZThesisProject is a testament to the power of collaboration and highlights the potential for academic institutions, academic librarians, and Wikimedians to work together towards a common goal. I’ll show how we did this work, and highlight some of the issues we face as we approach our second upload, and why more libraries should be connecting their collections to Wikidata.

Comments

[edit]

The DAMEIP project: adding data in Wikidata about the peer review process

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 9:30-9:55 CEST
  • main author(s):
Rossana Morriello
  • abstract or description:

This presentation explores the intersection of open access, bibliometrics, and authority control, focusing on the use of Wikidata to represent researchers, works, affiliations, and persistent identifiers. It draws on the research project DAMEIP (Data and Metadata to Implement Peer Review) conducted by a multidisciplinary team at the University of Florence, Italy, composed of researchers of library science, archival science and sociology. The aim of the project is to investigate peer review practices in journals published by Florence University Press (FUP), selected as a signicant sample of Italian open access academic publishing.

The project analyses, with full respect for anonymity and privacy, peer review procedures, timelines, and metrics, with particular attention to gender differences. To conduct the project, and to support future research, essential metadata from FUP journals will be added to Wikidata.

High-quality metadata in Wikidata allows discoverability, enables reviewer assignment, and supports open science. The systematic addition of high-quality metadata to Wikidata would enable the creation of a national-level platform capable of enhancing the open knowledge ecosystem. As AI-driven systems increasingly depend on such data, Wikidata’s role in scholarly communication and digital knowledge organization becomes ever more critical.

Comments

[edit]

Linked Open Data awareness in Polish LAM sector – Survey Results

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 10:05-10:30 CEST
  • main author(s):
Dorota Siwecka, University Library of the University of Wrocław
  • abstract or description:

This presentation shares the preliminary results of a survey focused on the awareness and use of Linked Open Data (LOD) and Wikidata among professionals working in the Libraries, Archives, and Museums (LAM) sector in Poland. The research examines their understanding of LOD principles, the interoperability tools currently employed, and the main challenges in sharing and integrating cultural data. Additional questions provided supplementary insights into familiarity with Wikidata as one of the platforms supporting LOD. The findings highlight the current level of familiarity with LOD and Wikidata, popular interoperability tools, and training needs within the sector. The results also point to the need for targeted digital training to support Polish cultural institutions in leveraging open, interoperable data effectively.

Comments

[edit]

Linking Malta’s Literary Past to the Digital Present - Adding Malta’s National Book Prize Winners to Wikidata

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 10:30-11:00 CEST
  • main author(s):
Enrique Tabone (EnriqueTabone) [in person]
  • abstract or description:

This presentation explores the ongoing collaboration between the Wikimedia Community Malta and Malta’s National Book Council through a Wikimedian in Residence (WiR) program. The project focuses on capturing and disseminating data related to the National Book Prize. This is part of a broader effort to raise awareness of the history of the National Book Prize, which was established in 1971. Many of the notable authors for this national award are now findable through Wikidata, which provides a substantial starting point to encourage the creation of articles about them in various relevant languages. By leveraging Wikidata, Wikimedia Commons, and the Maltese-language Wikipedia, the initiative aims to enhance the findability of Maltese literary culture and citations. This includes structured data entries, including relevant references, and the integration of relevant media files. This presentation highlights the initial plans, practical challenges, and future directions, including community engagement initiatives, as well as ways to expand this to other language editions of Wikipedia.

Comments

[edit]

Contemporary citations of Die Gartenlaube in ‚Small digital editions‘ of 19th century scientific literature today

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 11:30–12:00 CEST
  • main author(s):
Jens Bemme (User:jeb), SLUB Dresden
  • abstract or description:

Die Gartenlaube was a very first famous illustrated magazine in 19th century Germany. Die Datenlaube with Wikidata is a citizen science project for open bibliographic data of Die Gartenlaube articles in the German Wikisource. We mark references with Property:P2860 when we find old citations of contemporary scientific findings and their publications or references and self citations of older Gartenlaube texts and in article series. Today we cite articles of Die Gartenlaube while publishing about Die Datenlaube and Die Gartenlaube. And we add references of Die Gartenlaube and of other german and international periodicals in historical art guides to Wikidata items too, see the Wikisource edition of Friedrich von Boetticher's 'Paintings of the nineteenth century: contribution to the history of art', 1891/1895/1898/1901, https://scholia.toolforge.org/topic/Q120720904#context. How can we use such digital methods for digital collections and regional heritage portals of regional libraries? Our idea is now to link, cite and connect collections of Lucerne and Central Switzerland with digitised Saxon sources in Dresden like newspapers – in blogposts, metadata and other contemporary publications, today.

Comments

[edit]

Paulys Realencyclopädie der classischen Altertumswissenschaft (RE) 1893-1980 on Wikisource.de

[edit]
Etherpad notes
  • language: en
  • time slot(s): 12:00-12:30 CEST
  • main author(s):
Erik Sommer (User:THE IT)
Peter Hug (User:eLexikon)
  • abstract or description:

Paulys RE has a lot of information in 84 volumes on a high science level. The content is still relevant today, written in German and mixed with some Latin and Greek citations. The RE was published between 1890 and 1980, collecting 100.000 articles on 100.000 pages (columns). A lot of supplements makes it hard to find all text on a certain subject. New readers have to learn many details. We will explain the process of this complex digitization. Our project started in 2007, and it could be finished in 2080 according to the rules of copyright. In between we try to achieve some completeness.

Comments

[edit]

Interlinking of the Wikisource version of the Paulys Realencyclopädie der classischen Altertumswissenschaft (RE)

[edit]
Etherpad notes
  • language: en
  • time slot(s): 12:30-13:00 CEST
  • main author(s):
Mark Depauw
Margherita Fantoli (User:Ake1990)
  • abstract or description:

The Paulys Realencyclopädie der classischen Altertumswissenschaft (RE), comprising over 100,000 entries, is a foundational reference work for scholars of antiquity. It is currently being published on Wikisource, making it an ideal candidate for use as a knowledge base in the semantic annotation of classical texts. Within the NIKAW project, we selected the RE to support the disambiguation of individuals mentioned in a large corpus of Ancient Greek and Latin texts.

In the first part of this presentation, we outline the work carried out by the Trismegistos+ team to interlink Trismegistos People with RE entries. These results are already accessible via the Trismegistos portal. We then turn to an analysis of the alignment between RE-Wikisource entries on individuals and corresponding entries in Wikidata, detailing the current status of this linking effort and the ongoing work.

In particular, we explore how this linkage could facilitate the automated annotation of classical texts using RE-Wikisource as a semantic resource. However, several questions remain open—for instance, how to reconcile differing interpretations between Wikidata contributors and RE scholars, and how to ensure the long-term stability of these links.

Comments

[edit]

Nuovo Soggettario and Wikidata: comparison between two KOSs

[edit]
Etherpad notes
  • language: en
  • time slot(s): 14:00-14:30 CEST
  • main author(s):
Elisabetta Viti (Elisabetta Viti2) [in person]
Camillo Pellizzari (Epìdosis) [in person]
  • abstract or description:

The aim of this presentation is to discuss the main structural and functional similarities and differences between the Nuovo soggettario Thesaurus and Wikidata, and also to give an overview of their reciprocal interoperability. A brief history of the collaboration between the National Central Library of Florence (BNCF) and Wikimedia projects is outlined, starting from the first links added in the Thesaurus to Italian Wikipedia articles in 2005; this collaboration has recently been institutionalised with an agreement between the BNCF and Wikimedia Italia. Since 2023 the Thesaurus has been focusing on improving the interoperability with Wikidata, mainly through the manual revision of the unique-value and single-constraint violations in Wikidata (i.e. the cases in which the links between the two tools were not 1-to-1). As a case study, in 2024 a group of terms in the Thesaurus regarding photography was manually reconciled with the corresponding Wikidata items. Some significant examples found during the reconciliation process led to a revision of the meaning and form of the terms in the Thesaurus and Wikidata. Some general conclusions are listed about the possibilities of reciprocal improvement; and we foresee some future projects.

Comments

[edit]

Using Wikidata for Controlled Vocabularies

[edit]
Etherpad notes
  • language: en
  • time slot(s): 14:30-15:00 CEST
  • main author(s):
Peter F. Patel-Schneider
Ege Atacan Doğan
  • abstract or description:

Controlled vocabularies have similarities with knowledge graphs containing ontologies and data organized using that ontology, but there are differences in goals and scope between them. Large ontologies can offer advantages in depth and rigour and the multiple categories and properties they define can support controlled vocabularies. Using parts of a general-purpose knowledge graph like Wikidata as the basis of a controlled vocabulary requires delimiting the entities from the knowledge graph to be used and selecting what information about them is important. Using Wikidata further requires determining how to treat the errors in Wikidata, particularly errors in the Wikidata ontology.

Comments

[edit]

Wikimedia and Biomedical Online Resources

[edit]
Etherpad notes
  • language: en
  • time slot(s): 15:00-15:30 CEST
  • main author(s):
Houcemeddine Turki (Csisc) [in person]
  • abstract or description:

Wikidata, a collaborative open knowledge graph, offers a powerful platform for organizing biomedical data, but gaps and inconsistencies persist. Leveraging curated resources, such as UniProt, MeSH, OBO, and PubChem enables systematic enrichment with authoritative identifiers, synonyms, and relationships among biomedical entities. Additionally, machine learning (ML) algorithms and large language models (LLMs) present transformative opportunities for this process. ML-based entity linking and relation extraction can mine unstructured biomedical literature to suggest new statements and references for Wikidata, while LLMs like GPT-4 can assist in aligning text descriptions with structured data, generating candidate edits, and performing semantic validation. These approaches complement rule-based tools and bots that synchronize Wikidata with curated databases, enabling scalable enrichment workflows. Constraint-based validation frameworks, augmented by ML anomaly detection, can automatically flag inconsistencies and missing data. Together, these technical methods enhance Wikidata’s coverage, accuracy, and interoperability, supporting applications such as semantic search, biomedical knowledge graphs, and tools like Scholia. Collaborative efforts between informatics experts and the Wikidata community remain essential to sustain high-quality biomedical curation. Ultimately, combining curated resources with ML and LLM-driven enrichment advances Wikidata as a FAIR-compliant (Findable, Accessible, Interoperable, Reusable) knowledge hub accelerating open biomedical research.

Comments

[edit]

Wikiproject Personal collections

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 16:00-16:30 CEST (community space)
  • main author(s): User:Taniamaio
  • abstract or description:

Personal collections are organic complexes of published and/or unpublished materials collected and/or produced by significant people in the world of culture, professions and the arts, mainly from the second half of the 19th century onwards. ( Source)

The current descriptions of personal collection in WD are afflicted by a marked fragmentation, both terminological and structural, and lack of standardization. The aim of the Wikiproject personal collections is to build a set of controlled terms and create specific items for the description of personal collections, author libraries and personal archives, ensuring semantic coherence with the various linguistic versions of Wikimedia projects.

A dedicated Wikidata WikiProject and a standard data model for personal collections will enhance data consistency, interoperability, and GLAM professionals’ engagement, promoting richer, multilingual metadata and integration with catalogs and information sources.

Comments

[edit]

Scholarly profiling in Wikidata: Creating a knowledge graph for the SEEKCommons open science research network

[edit]
(Etherpad notes)

What are best practices of constructing a knowledge graph of a research network? This presentation will focus on our work to use Wikidata to improve the discoverability of research publications, datasets, and technologies in open science produced by members of the SEEKCommons network (Wikidata:SEEKCommons), and creating these workflows. Profiling research projects is not only useful for generating metrics about research outputs, it can also produce insights to help the community learn about itself and notice gaps. Members of our group will discuss data representations for different scholarly outputs. Name disambiguation challenges with labeling authors will also be discussed, including how we’ve employed the Scholia ecosystem to make disambiguation more efficient. We are inspired by existing work to bridge conversations across academic publishing, open science, and GLAMs.

Comments

[edit]

The State of Librarybase

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 17:00-17:30 CEST (community space)
  • main author(s): James Hare
  • abstract or description:

Librarybase is a Wikibase created in 2023 to serve as a general purpose bibliographic metadata repository. It can be used alongside Orb Open Graph, a private deployment of the Wikidata Query Service without the graph split and without timeouts. This presentation will discuss the project's vision, the projects making use of it, and possible future plans.

Comments

[edit]

The Linked Open Data strategy and what it means for your chapter

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 18:00-19:00 CEST
  • main author(s):
Alan Ang (Alan Ang (WMDE)) [in person]
Leif Lobinsky (Leif Lobinsky (WMDE)) [in person]
Lydia Pintscher (Lydia Pintscher (WMDE)) [in person]
  • abstract or description:

In this session we will go over the current Linked Open Data strategy that guides the development around Wikidata and Wikibase. We will look at the big picture opportunities and issues we need to address. We will then discuss how this will influence the work of your chapter and how Wikimedia Deutschland can support your work.

Comments

[edit]

MARC data on Wikibase using RDA/RDF

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 20:00–20:30 CEST
  • main author:
Michalis Gerolimos
  • abstract or description:

In 2020, the National Library of Greece (NLG) decided to utilize Linked Data technologies to transform its legacy MARC data. This presentation describes the decisions made, the steps taken and the outcomes of the project so far. It documents the vision behind the decision to use Wikibase, the actions to publish data as RDA/RDF and the lessons learned along the way.

Comments

[edit]

Wikibase to study ancient Greek literature: Hypotheseis and DataLib

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 20:30–21:00 CEST
  • main author:
Camillo Pellizzari (Epìdosis)
  • abstract or description:

This presentation shows how Wikibase can be used to create databases for literary and historical studies, describing briefly two Wikibase Cloud instances developed since 2024 to study ancient Greek literature.

The first Wikibase Cloud instance, Hypotheseis, aims to catalogue rhetorical exercises from the Imperial age to the end of the Middle Ages; its primary focus is on Ancient Greek, but a first expansion has been recently made to Armenian and further expansion to other languages (e.g. Latin) are possible; for rhetorical exercises, each theme is tagged with thematical tags for mentioned persons, subjects, events, and places.

The second Wikibase Cloud instance, DataLib, aims to collect structured data regarding the 1544 epistles of Libanius, an important rhetorician of the 4th century AD, and the persons mentioned by them; printed prosopographies already exist, but the added value of converting them in structured data will be the possibility of making complex queries using them. The database could be expanded to include others corpora of letters of the Late Antiquity.

Comments

[edit]

Extracting citation relations from legacy publications: The work of Basque scholar Koldo Mitxelena

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 20:30-21:00 CEST
  • main author(s):
David Lindemann (David Lindemann)
  • abstract or description:

We present experimental steps in the computational processing of the contents of Mitxelena's publications, and of relationships they contain. The written work of Koldo Mitxelena (1915-1987), Basque Philologist, is split in 1,665 single publications (745 unique articles). We have extracted citations of Mitxelena's articles from two Basque scientific journals, and collected them as citation relations in a Wikibase (Inguma Wikibase), on the one hand, and we have worked with the mentions of persons in Mitxelena's own work, on the other hand. We have recorded the mentions of persons in their context, along with relationships involving the entities describing these persons in Wikidata, in the same Wikibase instance. We have also included the results in Wikidata (see Mitxelena's citations at Scholia), for what we have taken some decisions, which we will discuss in this presentation.

Comments

[edit]

Wikibase: State of the Cite

[edit]
(Etherpad notes)
  • language: en
  • time slot(s): 21:00–21:30 CEST
  • main author(s): User:Olea
  • abstract or description:

We are going to explore what's the state of citing features for Wikibase users. The bad news first: we don't have most of them ready to use. But we can identify the related artifacts and structure a set of requirements contributed to the DARIAH-EU WG DHwiki and a project road map compatible with the Very Small GLAM initiative for setting up a useful set of citing features.

Comments

[edit]

Sunday 31st

[edit]

WDQS Graph Split — Overview and Query-a-thon

[edit]
  • language: en
  • time slot(s): 14:00-14:30 CEST
  • main author(s):
User:TiagoLubiana
  • abstract or description:

The graph split happened for real in May 2025, creating quirks for applications relying on the Wikidata Query Service. This session will present some simplified, WikiCite-focused documentation and create a space for collective work on adapting queries to the new system. Bring your questions and your queries and let's navigate the split together. Page with selected Scholia queries to be rewritten.

Comments

[edit]

Federation and data governance in Wikidata and the Wikibase Ecosystem - What’s happening? Where are we going?

[edit]
  • language: en
  • time slot(s): 14:30-15:00 CEST
  • main author(s):
User:Lydia Pintscher (WMDE)
Leif Lobinski (WMDE)
User:Alan Ang (WMDE)
  • abstract or description:

Scalability and sustainability of Wikidata is top of mind for the development team. We will talk about the current and future work around federation and data governance and how they relate to WikiCite. In the remainder of the session we’d like to discuss possible future scenarios for the scholarly article corpus in Wikidata and their consequences.

Comments

[edit]

Update on the Citation Typing Ontology citation intention annotations

[edit]
  • language: en
  • time slot(s): 16:00-16:10 CEST
  • main author(s):
User:Egon Willighagen
  • abstract or description:

This lightning talk gives an update on the use of Wikidata to capture and disseminate citation intention annotations with the Citation Typing Ontology (CiTO). It will cover the main information sources (data sets, journal articles/preprints, and nanopublications, Rogue Scholar blog posts) and the use of Scholia to visualize citation intentions for works and venues (like journal articles and journals).

Comments

[edit]

Retracted articles in Wikidata and how to use them

[edit]
  • language: en
  • time slot(s): 16:10-16:20 CEST
  • main author(s):
User:Egon Willighagen
  • abstract or description:

This lightning talk will review the approximately 22 thousand retracted articles in Wikidata and shows how this knowledge can be used. It will cover how information from the Retraction Database is included in Wikidata using the CrossRef API and a custom script. It will show how Wikidata SPARQL queries from https://bigcat-um.github.io/sparql-examples/examples/WikidataRetractions/ can be used to get more insight on the impact of these retractions on Wikidata, Wikipedia, and science itself.

Making GLAM resources more accessible and reusable: a FAIR case study on European Literary Bibliography

[edit]
  • language: en (es)
  • time slot(s): 16:20-16:30 CEST
  • main author(s):
Gustavo Candela
Cezary Rosiński
Arkadiusz Margraf
  • abstract or description:

This study presents a reproducible framework for publishing and reusing bibliographic metadata from GLAM, focusing on the European Literary Bibliography. It employs Linked Open Data transformation, metadata enrichment by means of Wikidata, and computational reuse via Jupyter Notebooks. The work has been recently published in the form of a research article and presented in several conferences.

Comments

[edit]
  • I add here a link to the related publication.

Wikimedia projects as Diamond Open Access platforms

[edit]
  • language: en
  • time slot(s): 16:30-16:40 CEST
  • main author(s): Daniel Mietchen
  • abstract or description:

Diamond open access is a concept commonly referring to an arrangement in which academic publications are funded such that neither the readers nor the authors have to pay and the costs are instead borne by some other entity (e.g. a research funder, an institution or a scholarly society). In this session, we will explore this concept through the lens of WIkimedia projects. This will include, for instance, books published via Wikibooks, journals published via Wikiversity, legacy publications archived and annotated on Wikisource, media files published on WIkimedia Commons or structured data being published and curated on Wikidata. We will also touch upon how Wikimedia platforms provide background information on Diamond open access more generally, from individual publications to the stakeholders involved in the matter and relevant legal and policy frameworks.


Comments

[edit]

Nanopublications and the Wikidata/ Wikibase ecosystem

[edit]
  • language: en
  • time slot(s): 16:40-16:50 CEST
  • main author(s): Daniel Mietchen
  • abstract or description:

Knowledge graphs like Wikidata and other Wikibase projects are powerful tools for creating and sharing knowledge. In this talk, we'll explore how nanopublications can be used to improve how these projects work.

Nanopublications are a simple way to publish small pieces of information, such as a single claim, along with details on its origin and who published it. Think of them as tiny, verifiable data packages, or mini versions of knowledge graphs. We'll discuss several ways in which nanopublications can connect with the Wikidata/Wikibase ecosystem. For example, they can be used to:

  • Add new information to Wikibase.
  • Export data from Wikibase.
  • Reference an external source for a Wikibase statement.
  • Track and report errors in Wikibase statements, as well as confirm when those errors have been fixed.
  • Synchronize information across different knowledge graphs.
  • Report on inconsistencies between different knowledge graphs or different parts of the same knowledge graphs.
  • Thank someone for some contribution to a knowledge graph or to a resource cited from it.

During this session, we will look at real-world examples and open up a discussion about the potential of nanopublications to enhance the WikiCite community's work.

Comments

[edit]

Planning the Wikimedia Science Hub

[edit]
Science Hub
  • language: en
  • time slot(s): 16:50-17:00 CEST
  • main author(s):
Daniel Mietchen
user:bluerasberry
  • abstract or description:
Sign up at Science Hub/join

Within the Wikimedia ecosystem, Hubs are intended to facilitate community interactions among existing wiki community organizations within a certain scope. The scope can be regional, thematic or linguistic, and the Science Hub is a thematic one, to be officially proposed later this year. It seeks to facilitate Wikimedia activities relating to scientific and other scholarly content from all disciplines of systematic inquiries. In terms of knowledge domains, this includes the natural sciences, social sciences, humanities, engineering, mathematics, medicine and so on. On the wiki side, it includes all content, communities and infrastructure of Wikimedia wikis as well as supporting activities.

The goal of the Science Hub is to serve as an administrative and grantmaking center for the scientific community of affiliates and user groups in the Wikimedia Ecosystem. Our goal is to develop networks in each of the Wikimedia Regions, especially to help community members find partners for projects and granting opportunities in general. The hub is especially set up to support community members in finding partners, finding grants, and administering grants. This assistance can be for instance to develop standard ways for Wikimedia affiliates to report their science based activity, developing regular reports of the scientific initiatives and efforts that Wikimedia is undertaking, providing example inter-university grant agreements, and even serving as a fiscal sponsor where it makes sense.

In this session, we want to explore in what ways the proposed Science Hub could assist ongoing and future WikiCite-related activities.

Comments

[edit]

Status of user groups

[edit]
  • language: en
  • time slot(s): 17:00-17:05 CEST
  • main author(s):
User:Alexmar983
  • abstract or description:

It’s a rather "dry" topic, but since I heard that some users want to propose an "Ontology User Group", I mentioned that I had drafted a possible "Connectivity User Group" years ago. Getting critical mass is challenging, so I imagined that we could merge the two ideas.

This though led me to explore the different user groups at the meta level related to Wikidata. While this isn’t a concrete planning, it’s worth considering how the situation might (not) evolve and I am just curious to hear the overall feedback on this.

Which Wikidata-related themes do you think might require dedicated representation through a thematic affiliate in the long term? What about a WikiCite User Group for example?

How do you see the future development of these thematic groups?

  • Proposed by: Community Proposal
  • slides on Wikimedia Commons: Not sure if I will have time to prepare something

Comments

[edit]
That would be useful information for those considering a proposal submission. Not me — while I'm not submitting one myself, my aim is to support the development of more stable and well-structured proposals.--Alexmar983 (talk) 20:06, 20 August 2025 (UTC)[reply]


Date of death of researchers and academic staff

[edit]
  • language: en
  • time slot(s): 17:05-17:10 CEST
  • main author(s):
User:Alexmar983 (and Camillo Pellizzari)
  • abstract or description:

I am one of the few users in Italy who regularly monitors local news sources and adds concise, well-referenced information about the dates of death of academics. When the deceased is not a moderately famous politician, litetary author, or media figure, you’re often the first person to notice and document it the "date of death" (P570) property. Since Pisa is a university town, the local press generally covers these topics efficiently, for other town this might not be the case. I am curiuos to monitor how this could go in other areas.

We held a class with Camillo in Pisa with the Univeristy Librarians, where my session focused on managing data related to researchers. I specifically used queries for unusually old professors to check whether P570 was missing and insert proper values and sources as an exercise. It’s something that can easily go unnoticed for several years.

In the field of humanities, integration with national authority databases can help. However, if no books have been published, those databases often fail to provide reliable sources for avarage researchers. In many cases, when local news cannot serve as a source, the available information is often indirect — a puzzle composed of private web pages or obituary-style mailing list messages. In theory, we could upload on Commons images of public obituary posters.

Sometimes the authority record that should be updated (besides Wikidata item) is not available in the Italian system, but may exist in another library network or national archive that provides more specialized profiling—such as records for theses completed abroad or book chapters—or simply has more resources to catalog a wider range of individuals than the italian national OPAC SBN system. Some users with access to national authority files from other countries, however, may not consider various indirect sources sufficiently reliable to update their record. As a result, crucial information can remain overlooked for years even with proper good will and a note can be left in the talk pages of the item.

Let’s discuss a few case studies or examples. What strategies do you think are most effective in addressing this issue?

Comments

[edit]

Using Citation Watchlist to monitor source addition and removal from articles

[edit]
  • language: en
  • time slot(s): 17:10-17:20 CEST
  • main author(s):
Jake Orlowitz
James Hare
  • abstract or description:

Citation Watchlist is an interactive tool developed by the nonprofit organization Hacks/Hackers with support from Wikimedia CH. It adds visual indicators to watchlists, recent changes, user contributions pages, and page histories when unreliable sources are added to articles. Recently, it gained a new feature: tracking the removal of any URL. This presentation will include a demonstration of Citation Watchlist and discussion of ideas for the future.


Comments

[edit]

Introducing the Wikipedia Citations Database

[edit]
  • language: en
  • time slot(s): 17:20-17:30 CEST
  • main author(s): James Hare
  • abstract or description:

The Wikipedia Citations Database seeks to create a structured database around the unstructured citation strings that appear in articles. Because Wikipedia itself does not have a structured database of citations, the citations that appear on articles can be formatted in different ways and may be ambiguous references (like a last name and a page number with context provided elsewhere). This project represents the first step of separating out the citations from the non-citations. Matching citation to with actual known publications is left to subsequent derivation steps.

Comments

[edit]