Research:Top edited Portuguese Wikipedia articles in 2016

From Meta, a Wikimedia project coordination wiki
Duration:  2017-2 – 2017-2
This page documents a completed research project.


Based on similar researches regarding Wikipedia in other languages, this project aims to present a list of top edited pages in Portuguese Wikipedia.

Methods[edit]

Currently, all Wikipedia projects runs over MediaWiki software, maintened by Wikimedia Foundation. When every edit is made on this software, a new record is included in the revision table of the database describing all information of that edit (when it was done, what page, who made it, what changed etc). Knowing this, we measure the number of edits counting the number of records in revision table for each page, in the period of 2016. To achieve this, it is needed to access the Portuguese Wikipedia database. A backup of the database can downloaded in its full form from Wikimedia Downloads, or queries can be run online through Quarry.

This approach is similar to Top edited English Wikipedia articles in 2016, but since most queries should be run once, performance was not a concern here. So the result of this query represents exactly the main objective of this project.

Query executed in the database (SQL language)
use ptwiki_p;

set @total_edit_rank = 0;
set @prev_edit_count = 0;

SELECT
  t.rank
  , REPLACE(t.page_title, '_', ' ') page_title
  , t.total_edits
FROM
  (
    SELECT
      t.*
      , @total_edit_rank := @total_edit_rank + IF(@prev_edit_count=t.total_edits, 0, 1) AS rank
      , @prev_edit_count := t.total_edits
    FROM
      (
        SELECT
          page_title
          , COUNT(*) AS total_edits
        FROM
          revision
          INNER JOIN page
            ON (revision.rev_page = page.page_id)
        WHERE
          revision.rev_timestamp LIKE '2016%'
          and page.page_namespace = 0
        GROUP BY
          page.page_title
      ) t
    ORDER BY
      t.total_edits DESC
      , t.page_title ASC
  ) t
WHERE
  (t.rank <= 100)
;

In this approach, we list the 100 most edited pages, in a way that multiple pages may belong to the same position in the rank if they have the same number of edits (the number of listed pages may vary, but it is assured to be equals or greather than 100). We also filter pages of the main namespace only, that is, only Article pages.[note 1]

Policy, Ethics and Human Subjects Research[edit]

Although editors' information is publicly available, their names or IP are not the subject of this research project, so they can not be identified using the results published here. Their usernames or IP are stored in the queried table of the database, but it is not disclosed for any purpose in any part of the process (even to people involved in this project, or to people aiming to reproduce the results presented here).

Results[edit]

The table bellow presents the results extracted from the database. Nothing was changed, despite formatting it as a wikitable with wikilinks (big thanks to TablesGenerator.com).

rank page_title total_edits
1 Lista de episódios de Steven Universe 2431
2 2016 na televisão 1385
3 Processo de impeachment de Dilma Rousseff 1226
4 Campeonato Brasileiro de Futebol de 2016 - Série A 1163
5 Campeonato Europeu de Futebol de 2016 1106
6 Temporada do São Paulo Futebol Clube de 2016 1018
7 Êta Mundo Bom! 1010
8 Big Brother Brasil 16 968
9 Os Under-Undergrounds 966
10 Campeonato Brasileiro de Futebol de 2016 - Série D 952
11 Operação Lava Jato 944
12 BTS 907
13 Lista de episódios de One Piece 906
14 Temporada do Club de Regatas Vasco da Gama de 2016 893
15 Copa do Brasil de Futebol de 2016 882
16 Lista de filmes de maior bilheteria 880
17 Paysandu Sport Club 859
18 Temporada de Fórmula 1 de 2016 858
19 A Quinta: O Desafio 783
20 Yo soy Franky 765
21 Haja Coração 761
21 Secret Story - Casa dos Segredos (6.ª edição) 761
22 Sandy 760
23 Copa União 749
24 Campeonato Paulista de Futebol de 2016 - Série A1 738
25 Campeonato Brasileiro de Futebol de 2016 - Série B 732
26 Velho Chico (telenovela) 729
27 Totalmente Demais (telenovela) 727
28 Santa Cruz Futebol Clube 724
29 Lista de líderes em home runs da Major League Baseball 715
30 Steven Universe 703
31 Campeonato Brasileiro de Futebol de 1987 692
32 Seleção Portuguesa de Futebol 691
33 Túlio Maravilha 688
34 Copa Libertadores da América de 2016 658
35 Black Pink 646
35 Brasil nos Jogos Olímpicos de Verão de 2016 646
36 Lista de jogadores de voleibol do Brasil 643
37 TWICE 637
38 Real Madrid Club de Fútbol 619
39 Seleção Brasileira de Futebol 616
40 Miss Universo 2016 613
41 Clássico das Multidões (Recife) 612
42 2016 598
43 Copa São Paulo de Futebol Júnior de 2016 596
43 Spider-Man: Homecoming 596
44 Campeonato Brasileiro de Futebol de 2016 - Série C 593
45 Eleição municipal de São Paulo em 2016 584
46 Copa Libertadores da América de 2017 583
47 The Voice Portugal (4.ª edição) 577
48 Love On Top (reality show) 556
49 Lista de episódios de Regular Show 552
49 Resultados da segunda fase da Copa Libertadores da América de 2016 552
50 Clássico dos Milhões 546
51 Lista de episódios de Adventure Time 538
52 Lucero (artista) 537
53 A Lei do Amor 528
54 Lista de episódios de O Incrível Mundo de Gumball 522
54 Power Rangers: Dino Charge 522
55 Girls' Generation 515
56 Futbol Club Barcelona 512
57 Copa do Nordeste de Futebol de 2016 511
58 Julius Jr. 509
59 Eleição municipal do Rio de Janeiro em 2016 506
59 Lista de canais da televisão digital brasileira 506
59 Pro Evolution Soccer 2017 506
60 Capitão América: Guerra Civil 504
61 Soy Luna 503
62 Ronaldinho Gaúcho 500
63 Fluminense Football Club 498
63 Primeira Liga do Brasil de 2016 498
64 Associação Chapecoense de Futebol 490
65 Gabriel Barbosa 487
65 Miraculous: As Aventuras de Ladybug 487
66 Miss Brasil 2016 486
67 Juarez Moreira 485
68 Copa Sul-Americana de 2016 480
69 Temporada de Fórmula 1 de 1950 475
70 Monsta X 469
71 Batman v Superman: Dawn of Justice 463
72 Lista de episódios de Naruto Shippuden 462
73 Pequenos Gigantes (2.ª temporada) 461
73 Sistema Brasileiro de Televisão 461
74 Renato Sanches 460
75 Sport Club do Recife 455
76 Campeonato Carioca de Futebol de 2016 - Série B 454
76 Copa América Centenário 454
76 Rising Star - A Próxima Estrela 454
77 Club de Regatas Vasco da Gama 451
78 Jogos Olímpicos de Verão de 2016 449
79 Olavo de Carvalho 448
80 Lista de episódios de Saint Seiya 447
81 Red Velvet 445
82 Paris Saint-Germain Football Club 444
83 Cúmplices de um Resgate 439
83 Festival Eurovisão da Canção 2017 439
84 Resultados da primeira fase da Copa do Brasil de Futebol de 2016 437
85 Esporte Clube Bahia 435
86 Resultados do primeiro turno do Campeonato Brasileiro de Futebol de 2016 - Série A 434
87 Gabriel Jesus 433
88 Botafogo de Futebol e Regatas 432
89 Lista de episódios de My Little Pony: Friendship Is Magic 430
90 Zlatan Ibrahimović 429
91 Instituto de Educação de Minas Gerais 420
91 Movimento Brasil Livre 420
92 História da Catalunha 419
93 A Regra do Jogo (telenovela) 417
94 Michel Temer 415
95 Grêmio Foot-Ball Porto Alegrense 412
96 Lady Gaga 407
97 Quadrinhos para adultos 403
98 Governo Michel Temer 402
99 Cheias de Charme 401
99 Shawn Mendes 401
100 Clube de Regatas Brasil 400
100 Os Dez Mandamentos (telenovela) 400

Technical info and other considerations[edit]

Run times and server info

The query presented here was executed in Wikimedia Tool Labs servers, at Feb 17, 2017, 10:00 PM (UTC). Their run times are:

real    1m26.288s
user    0m0.003s
sys     0m0.015s

Other relevant information about client and server instances:

$ uname -a
Linux tools-bastion-03 3.13.0-100-generic #147-Ubuntu SMP Tue Oct 18 16:48:51 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux

$ mysql --version
mysql  Ver 15.1 Distrib 5.5.54-MariaDB, for debian-linux-gnu (x86_64) using readline 5.2

$ ./sql enwiki 'SHOW VARIABLES LIKE "%version";'
+------------------+-----------------+
/ Variable_name    / Value           \
+------------------+-----------------+
/ innodb_version   / 5.6.21-70.0     \
/ protocol_version / 10              \
/ tokudb_version   / tokudb-7.5.3    \
/ version          / 10.0.15-MariaDB \
+------------------+-----------------+

Notes[edit]

  1. Portuguese Wikipedia has several namespaces for various purposes (User pages, User talks, Discussion, Help pages etc). But articles are only in the main namespace, and there is nothing in the main namespace but articles.