Research talk:Automated classification of edit types/Work log/2016-03-24

From Meta, a Wikimedia project coordination wiki

Thursday, March 24, 2016[edit]

Loading some edit type campaigns in today. Let's take notes!

u_wikilabels=> select id, wiki, name from campaign where active;
 id |     wiki     |                         name                          
----+--------------+-------------------------------------------------------
  4 | enwiki       | Edit quality (20k random sample, 2015)
  8 | azwiki       | Edit quality (20k random sample, 2015)
  5 | trwiki       | Değişiklik kalitesi (20,000 rastgele örnekleme, 2015)
  7 | ptwiki       | Qualidade das edições (amostra de 20k revisões, 2015)
  9 | frwiki       | Modifier la qualité (20k échantillon aléatoire, 2015)
 12 | eswiki       | Editar calidad (20k muestra aleatoria, 2015)
 14 | nlwiki       | Kwaliteit bewerken (20k steekproef, 2015)
 10 | ruwiki       | Качество правок (20-тыс. случайная выборка, 2015)
 11 | ukwiki       | Якість редагувань (вибірка випадкових 20 тис., 2015)
 15 | jawiki       | 編集品質( 20Kランダムサンプル)
 16 | dewiki       | Qualität Edit ( 20k Zufallsstichprobe )
 17 | etwiki       | Edit kvaliteet ( 20k juhuslik valim )
 18 | itwiki       | Qualità degli edit (campione casuale di 20k edit)
 19 | wikidatawiki | Edit quality (20k balanced sampled)
 13 | idwiki       | Kualitas suntingan (20k sampel acak, 2015)
 21 | fawiki       | کیفیت ویرایش نسخه ۲ (نمونه تصادفی ۲۰ هزارتایی،۲۰۱۵)
 22 | enwiki       | Edit types (0.5k sample)
 23 | urwiki       | معیار ترمیم کریں ( 5K متوازن )
 24 | plwiki       | Edycja jakości (20k próba losowa, 2015)
 25 | hewiki       | איכות ערוכה ( 5k מאוזן )
 26 | viwiki       | Sửa chất lượng ( 5k cân bằng)
 27 | nowiki       | Edit kvalitet ( 5k balansert)
(22 rows)

OK, first, let's deactivate the current enwiki campaign.

u_wikilabels=> update campaign set active = False where id = 22;
UPDATE 1

Now to create the new enwiki & itwiki campaigns.

u_wikilabels=> INSERT INTO campaign (name, wiki, form, view, created, labels_per_task, tasks_per_assignment, active) VALUES ('Edit type training (50 revisions)', 'enwiki', 'edit_type', 'DiffToPrevious', NOW(), 20, 10, True);
INSERT 0 1
u_wikilabels=> select id, wiki, name from campaign where active and wiki = 'enwiki';
 id |  wiki  |                  name                  
----+--------+----------------------------------------
  4 | enwiki | Edit quality (20k random sample, 2015)
 28 | enwiki | Edit type training (50 revisions)
(2 rows)

u_wikilabels=> INSERT INTO campaign (name, wiki, form, view, created, labels_per_task, tasks_per_assignment, active) VALUES ('Modifica tipo di formazione (100 modifiche)', 'itwiki', 'edit_type', 'DiffToPrevious', NOW(), 20, 10, True);
INSERT 0 1
u_wikilabels=> select id, wiki, name from campaign where active and wiki = 'itwiki';
 id |  wiki  |                       name                        
----+--------+---------------------------------------------------
 18 | itwiki | Qualità degli edit (campione casuale di 20k edit)
 29 | itwiki | Modifica tipo di formazione (100 modifiche)
(2 rows)

OK. Now to load the data.

halfak@wikilabels-01:~/datasets$ cat enwiki.revision_sample.50_hand-picked.tsv | /srv/wikilabels/venv/bin/wikilabels task_inserts 28 | psql -h wikilabels-database --user u_wikilabels u_wikilabels -W 
Password for user u_wikilabels: 
INSERT 0 70

halfak@wikilabels-01:~/datasets$ cat itwiki.revision_sample.100k_article_size-changing.tsv | /srv/wikilabels/venv/bin/wikilabels task_inserts 29 | psql -h wikilabels-database --user u_wikilabels u_wikilabels -W 
Password for user u_wikilabels: 
INSERT 0 100

And that's all. :) --Halfak (WMF) (talk) 18:59, 24 March 2016 (UTC)[reply]