Research:Wiki Participation Challenge

From Meta, a Wikimedia project coordination wiki
Contact
Diederik van Liere
This page documents a completed research project.


Key Personnel[edit]

  • Diederik van Liere
  • Howie Fung

Project Summary[edit]

This competition challenges data-mining experts to build a predictive model that predicts the number of edits an editor will make in the five months after the end date of the training dataset. The dataset is randomly sampled from the English Wikipedia dataset from the period January 2001 - August 2010.

The objective of this competition is to quantitively understand what factors determine editing behavior. We hope to be able to answer questions, using these predictive models, why people stop editing or increase their pace of editing.

Contestants are expected to build a predictive model that can be reused by the Wikimedia Foundation to forecast long term trends in the number of edits that we can expect.

Methods[edit]

Participants are free to use any econometric / statistical method or machine learning approach.

Dissemination[edit]

The explanations of the algorithms can be found here:

Wikimedia Policies, Ethics, and Human Subjects Protection[edit]

Benefits for the Wikimedia community[edit]

Output will consist of algorithms and source code that will predict future editing behaviour.

Time Line[edit]

The competition ran from Tuesday 28 June 2011 until Tuesday 20 September 2011, for a total of 84 total days.

References[edit]

External links[edit]

http://www.kaggle.com/c/wikichallenge

Contacts[edit]