From Meta, a Wikimedia project coordination wiki
Jump to navigation Jump to search

Project Grants This project is funded by a Project Grant

proposal people timeline & progress finances midpoint report

Timeline for DBpedia[edit]

Timeline Date
Study (choose two initial sync targets and analyse the lack of references in Wikidata) Day Month Year
GlobalFactSync tool (extend the current prototype with new features) Day Month Year
Mapping Refinements Day Month Year
GlobalFactSync WikiData ingest Day Month Year
GlobalFactSync Sprints Day Month Year

Monthly updates[edit]

Please prepare a brief project update each month, in a format of your choice, to share progress and learnings with the community along the way. Submit the link below as you complete each update.

Current tasks[edit]

A log of current tasks is kept here. Ongoing discussions should be held using the corresponding discussion page.

(Preparation) April/May[edit]

June 2019 (official start)[edit]


First Release Report: A first release containing detailed information about our micro-services is published on the DBpedia Blog


  • First success story
  • Deployment of first micro-services on the server
  1. Initial User Interface here
  2. PreFusion JSON API here (user: read, pw: gfs)
  3. Reference Extraction Service here
  4. Reference Data Download here
  5. Infobox Extraction Service here
  6. ID service here
  1. definition of a set of problems with different layers of complexity
  2. analysis of various groups of subjects with respect to these synchronization problems


  • Continuing improvements of the first deployments, which will be an ongoing process. Especially the GFS Data Browser is being worked on:
    • users can now insert any Wikipedia URL into the subject search field
    • overall layout improvements
    • reference information is being added
  • Johannes Frey presented the GFS project at Wikimania
  • We created a news page within our Meta-Wiki project page framework for volunteers to keep them in the loop and encourage exchange. So far this has lead to three more volunteers signing up for our 'GFS Feedback Squad' and two users leaving feedback about our sync target study.


  • more work towards sync target study, focus on targets that were brought up by Wikidata users (e.g., geo coordinates, employer, nobel price)
  • intensive work on creating the complement to Wikidata and Wikipedia by collecting and providing data that is currently missing in both



  • re-extraction of GFS data and fusion
  • some work on the UI
  • identifying and testing ways to generate lists of the Wikipedia articles related to selected topics: categories, infoboxes, Wikidata queries and other articles (lists).


  • extraction of reference data for Polish cities; studied sources: BDL - Bank Danych Lokalnych, Wikipedia, Wikidata
  • analysis of available mappings between various geographical identifiers for Polish administrative units
  • GFSre - reference datasets for Polish cities.pdf
    showing current understanding of the fusion challenge

January 2020[edit]

February 2020[edit]

Planned Next Steps[edit]

  • experiment prototype for improved harvesttemplate
    • index Infoboxes / Templates
  • watch for feedback of new mockup
  • incorporate demo (hard-coded) references view into GFS browser using the novel JSON references dump
  • GFS browser features
    • include mapping management to allow search for properties of new external sources

Is your final report due but you need more time?