Research talk:New page reviewer impact analysis/Work log/2017-06-09

From Meta, a Wikimedia project coordination wiki

Friday, June 9, 2017[edit]

OK so today I have an hour to think about this. My goal is to put a (probably naive) plan together to get page creation events. I think I can do a good job of this based on my past work in Research:Wikipedia article creation.

I have https://github.com/halfak/Wikipedia-article-creation-research to draw from too.

It looks like I need to run https://github.com/halfak/Wikipedia-article-creation-research/blob/master/sql/enwiki/creations.table.sql using the output of https://github.com/halfak/Wikipedia-article-creation-research/blob/master/sql/enwiki/pages.table.sql

Both of these look like they took an inane amount of time to run so I'm not too excited about running them again. Hmm... maybe I can sample. Let's say I get 10k page creations from each year since 2008 (when we started tracking these things). Hmm --EpochFail (talk) 22:22, 9 June 2017 (UTC)[reply]