Research:Editor Behaviour Analysis & Graphs

From Meta, a Wikimedia project coordination wiki
Jump to navigation Jump to search
Contact
no affiliation
Duration:  2015-06 — ??
GearRotate.svg

This page documents a research project in progress.
Information may be incomplete and change as the project progresses.
Please contact the project lead before formally citing or reusing results from this page.


The project aims to explore different way to visualise the entire edit activity of a wiki from its birth. The graphs should help us understand macro/community level changes in editor activity.

Graphs[edit]

The graphs are broadly divided into two. The graphs with the data for articles and the same with the data for editors.

Methods[edit]

The wiki db's on toollabs are queried for the data & then transformed into graphs.

Definitions[edit]

  • Active Editor
    • An 'active editor' is a registered (and signed in) person (not known as a bot) who makes 5 or more edits in any month in mainspace on countable pages.[1]
  • Editor Group/Cohort
    • Editors are grouped by the month of their first edit.
  • Active Article
    • Similar to the active editor an active article is defined as an article which receives 5 or more edits in any month.
  • Article Group/Cohort
    • Articles are grouped by the month of their creation.

Research questions[edit]

The high-level research questions that we want to answer visually are the following:

  • What is the average longevity of editors and how has it changed over the years & across different languages.
  • How does the contribution of new editors in their first month compare with existing editors?
    • How is it in terms of edit sessions?
    • How is it in terms of content/bytes added?
    • How has it changed over the years?
  • Are users finding it difficult to discover articles where they can meaningfully contribute?
  • Is the reducing/plateauing editor count a result of a maturing encyclopedia?
  • What does edit activity look like from an article's perspective?

Goals[edit]

  • Turn the graphs into a dashboard to understand macro changes in a language/community wiki.
  • As a dashboard for the grants & evaluation team.

Potential Ideas[edit]

Preliminary Results[edit]

Editors[edit]

Articles[edit]

  • Longevity of articles
    • In all of the languages which were analysed (ru, es, it etc) it is the articles that were created in the initial days (birth of the wiki to 2004 - 2005) that are continuing to get edited. The articles created since then get a storm of edit activity when they are created but subsequently see very little edit activity on them.
  • Edit Activity on articles in a given month.
    • Two major chunks of edit activity can be observed. the biggest % of edit activity happens in the articles that are created in the month in question. The second chunk of activity happens with the articles created in the early days of the wiki.

Results[edit]

  • The monthly decline in active editors in en may be attributed to the old timers. We see this across other languages too.
  • Only the articles in the (beginning - 2007) cohorts continue to see active edit activity.
  • Retention rates are much higher in languages like de.
  • zh (continues to show uptrend on many fronts. Active Editors, retention etc).

Gallery[edit]

Links[edit]

References[edit]

See also[edit]