Research talk:Teahouse long term new editor retention/Work log/2015-11-25
Add topicWednesday, November 25, 2015
[edit]Today, I'm working with instances of <!-- Template: ... --> found on talk pages. This is commonly used when a template is substituted onto the page to aid in tracking which templates are being used. What I'd really like to do is build list of those templates that correspond to warnings so that we can see the rise of warnings in Wikipedia.
First, let's look at the most common templates:
> month_templates[,list(postings = sum(postings)), by=template][order(postings, decreasing=T),][1:100]
template postings
1: template:uw-vandalism1 7598768
2: template:unsigned 5232148
3: template:uw-cluebotwarning1 5176690
4: template:uw-vandalism2 3554289
5: template:uw-huggle1 3038861
6: template:db-csd-notice-custom 2976716
7: template:uw-vandalism3 2467730
8: template:unsignedip 2021922
9: template:uw-huggle2 1772924
10: template:uw-vandalism4 1729712
11: template:shared ip advice 1715998
12: template:welcome 1537006
13: template:unsigned ip 1358353
14: template:uw-cluebotwarning2 1351987
15: template:orphaned 1338576
16: template:uw-huggle3 1151319
17: template:di-orphaned fair use-notice 939837
18: template:undated 921745
19: template:uw-huggle4 869519
20: template:welcomeg 868722
21: template:no fair 831747
22: template:uw-test1 795100
23: template:prodwarning 790365
24: template:first article 786359
25: template:uw-delete1 776057
26: template:ani-notice 770452
27: template:uw-cluebotwarning3 764645
28: template:afc decline 693823
29: template:db-notability-notice 641948
30: template:uw-3rr 639090
31: template:afdwarning 608508
32: template:welcome-anon 599767
33: template:idw 599545
34: template:uw-vblock 590827
35: template:di-no fair use rationale-notice 589424
36: template:test5 583995
37: template:uw-unsourced1 580988
38: template:uw-block1 564287
39: template:smile 560905
40: template:uw-delete2 549592
41: template:proposed deletion notify 532922
42: template:dykproblem 531822
43: template:uw-cluebotwarning4 531352
44: template:adw 498872<nowiki><!--
45: template:uw-vandalism4im 489806
46: template:tfdnotice 487919
47: template:nn-warn 483352
48: template:uw-block 480144
49: template:di-no source-notice 479116
50: template:firstarticle 478585
51: template:afd-notice 476222
52: template:frs message 424695
53: template:uw-spam1 421547
54: template:uw-test2 413546
55: template:db-bio-notice 413490
56: template:db-spam-notice 411424
57: template:updateddyknom 393143
58: template:uw-tilde 382693
59: template:di-replaceable fair use-notice 370515
60: template:missing rationale 367146
61: template:fdw 335226
62: template:uw-unsourced2 328842
63: template:teahouse hostbot invitation 309877
64: template:uw-delete3 306730
65: template:cfd-notify 304796
66: template:uw-block2 302805
67: template:updateddyk 295040
68: template:di-no license-notice 285162
69: template:db-nocontext-notice 282275
70: template:di-no permission-notice 278691
71: template:archivebox 278532
72: template:you can request undeletion 272489
73: template:prodwarningblp 267398
74: template:db-copyvio-notice 261870
75: template:reviewer-notice 261439
76: template:idw-pui 259508
77: template:welcomemenu 254433
78: template:uw-editsummary 239236
79: template:tilde 234289
80: template:uw-coi 230890
81: template:uw-huggletest1 225330
82: template:archivebox ends 218960
83: template:archivebox begins 216279
84: template:db-vandalism-notice 209334
85: template:uw-ablock 204101
86: template:uw-error1 201245
87: template:db-afc-notice 200222
88: template:uw-copyright 197881
89: template:drn-notice 197678
90: template:fdw-puf 191015
91: template:uw-huggledelete1 190227
92: template:image source 189983
93: template:uw-spam0 184244
94: template:mfdwarning 183830
95: template:db-nocontent-notice 181716
96: template:uw-test3 178027
97: template:uw-spam2 175015
98: template:afc talk 171398
99: template:teahouse afc invitaiton 169603
100: template:blatantvandal-n 168648
OK. Looks like we have quite a few to review! First let's try to gather the obvious warnings.
> common_templates[regexpr("^template:uw-", template) != -1,]
template postings
1: template:uw-vandalism1 7598768
2: template:uw-cluebotwarning1 5176690
3: template:uw-vandalism2 3554289
4: template:uw-huggle1 3038861
5: template:uw-vandalism3 2467730
---
427: template:uw-ew4 108
428: template:uw-attempt4 105
429: template:uw-notenglish-fr 102
430: template:uw-deletionpolicy1 101
431: template:uw-tempabuse4 101
OK. Looks like there's a lot of those. I saw some 3rr and "warning" stuff in there too.
> common_templates[regexpr("^template:(uw-|3rr|drm|.*warning.*)", template) != -1,]
template postings
1: template:uw-vandalism1 7598768
2: template:uw-cluebotwarning1 5176690
3: template:uw-vandalism2 3554289
4: template:uw-huggle1 3038861
5: template:uw-vandalism3 2467730
---
467: template:drmspeedy3-n 106
468: template:uw-attempt4 105
469: template:uw-notenglish-fr 102
470: template:uw-deletionpolicy1 101
471: template:uw-tempabuse4 101
OK. Cool. Now lets group those together by month and make some plots!
We can see the steep rise in warning template postings in 2006. It's interesting how the warning postings are periodic. The values seem to conform to the summer months -- when kids are not in school (and outside rather than vandalizing wikipedia??). The teahouse template postings appear to begin in 2013, but I know that this is merely when the template began to be flagged since I made the edit that added the <!-- ... --> to the template content in January of 2013 (see en:Special:Diff/532920543). --Halfak (WMF) (talk) 18:28, 25 November 2015 (UTC)

