List of Wikipedias by sample of articles

From Meta, a Wikimedia project coordination wiki

Jump to: navigation, search

This page contains a list of the largest Wikipedias under the auspices of the Wikimedia Foundation for various languages. Test Wikipedias are listed at the Wikimedia Incubator Wiki project.

This list of Wikipedias is based on the List of articles every Wikipedia should have (total: 1009 on the 26th of October, 2008) as a sample, but the actual list which is used is at the end of List of Wikipedias by sample of articles/Source code and can be a little different. For every Wikipedia, the articles in this sample list is retrieved (based on interwiki links from the English Wikipedia) and the number of characters is calculated (minus "comments" and the "interwiki" text at the bottom of the article). The size of each article is then adjusted for each language by multiplying it by the language weight. The articles are divided in four classes: "absent" (i.e. non-existing; size = 0), "stubs" (size in characters inferior to 10,000), "articles" (size between 10,000 and 30,000) and "long articles" (size superior to 30,000). The average weighted size of the non-absent articles in the sample is also calculated. Finally, a score is computed, based on the following formula: rawscore = stubs + articles*4 + long.articles*9. In order to have a consistent scale the raw score is normalized by dividing by the maximum score and multiplying by 100. The maximum score would be maxscore = (absent + stubs + articles + long.articles)*9. The final score is the following score = rawscore / maxscore * 100. The language editions are then listed in order of decreasing score.

A copy of the program used to obtain this list is in List of Wikipedias by sample of articles/Source code.

Absent articles for major Wikipedias are in List of Wikipedias by sample of articles/Absent Articles.

See also:


Last Update: 25-26 October 2008

Wiki Language Weight Average Article
Size (wt.chars)
Absent
(0k)
Stubs
(< 10k)
Articles
(10-30k)
Long Art.
(> 30k)
Score Growth
1 en English 1.0 48,177 0 51 283 675 79.93 +0.88
2 de Deutsch 1.0 35,889 5 150 408 446 63.83 +1.04
3 fr Français 1.0 32,115 0 228 419 362 56.84 +1.07
4 es Español 1.1 30,081 2 249 434 324 53.97 +1.59
5 it Italiano 1.1 25,753 7 286 421 295 50.93 +1.14
6 ru Русский 1.4 25,733 2 331 400 275 48.57 +1.24
7 zh 中文 3.7 27,239 2 364 380 263 46.81 +0.92
8 pt Português 1.1 17,961 3 453 387 166 38.49 +1.35
9 ja 日本語 1.9 17,395 2 435 427 145 37.97 +0.63
10 pl Polski 1.1 16,181 6 530 339 134 34.05 +0.87
11 hu Magyar 1.1 15,670 62 498 318 131 32.47 +0.79
12 fi Suomi 1.1 14,589 12 561 312 124 32.21 +0.86
13 cs Čeština 1.3 13,830 26 545 333 104 31.01 +0.84
14 he עברית 1.2 12,312 16 553 363 77 29.71 +0.70
15 sv Svenska 1.1 12,787 1 613 295 100 29.66 +0.71
16 nl Nederlands 0.9 11,888 6 573 359 71 29.16 +0.91
17 vi Tiếng Việt 1.1 15,798 158 476 245 130 28.92 +0.93
18 no Norsk (Bokmål) 1.2 11,793 12 639 271 87 27.60 +0.50
19 ca Català 1.1 12,203 1 657 268 83 27.27 +1.09
20 tr Türkçe 1.3 12,639 30 613 283 78 27.08 +0.84
21 uk Українська 1.3 11,410 16 631 291 71 26.80 +0.28
22 sr Српски / Srpski 1.4 12,213 79 592 247 91 26.42 +0.53
23 hr Hrvatski 1.3 9,989 59 663 227 60 23.25 +0.47
24 sk Slovenčina 1.3 10,741 119 613 210 67 22.64 +0.66
25 ro Română 1.1 10,311 120 618 207 64 22.27 +0.70
26 el Ελληνικά 1.1 10,534 192 549 208 60 21.15 +0.97
27 ko 한국어 2.5 8,529 45 736 175 53 21.07 +0.74
28 da Dansk 1.2 8,929 34 755 165 55 21.03 +0.58
29 bg Български 1.1 8,444 99 677 195 38 19.81 +0.35
30 id Bahasa Indonesia 1.0* 6,834 34 788 153 34 18.79 +0.43
31 ar العربية 1.0 5,782 1 833 154 20 17.96 +0.07
32 sl Slovenščina 1.2 7,293 72 750 163 24 17.82 +0.42
33 gl Galego 1.0* 8,185 150 649 186 24 17.72 +0.65
34 fa فارسی 1.2 7,097 141 697 139 32 16.97 +0.51
35 th ไทย 1.0 6,698 124 720 138 27 16.68 +0.60
36 eo Esperanto 1.1 6,071 1 881 102 25 16.67 +0.14
37 lt Lietuvių 1.0* 5,498 75 799 125 10 15.30 +0.41
38 ms Bahasa Melayu 1.0* 8,296 290 553 133 33 15.22 +0.58
39 simple Simple English 1.0* 3,989 0 931 70 8 14.13 -0.01
40 nn Nynorsk 1.2** 5,834 176 720 92 21 14.06 +0.20
41 is Íslenska 1.0* 3,112 29 913 58 9 13.50 +0.35
42 sh Srpskohrvatski / Српскохрватски 1.0* 6,905 319 554 116 20 13.19 +0.41
43 et Eesti 1.0* 4,737 151 771 73 14 13.09 +0.37
44 lv Latviešu 1.0* 5,638 243 653 100 13 12.88 +0.61
45 bs Bosanski 1.0* 4,824 175 736 95 3 12.59 +0.36
46 la Latina 1.1 4,221 190 749 53 17 12.27 +0.27
47 eu Euskara 1.0* 4,423 210 718 74 7 11.86 +0.34
48 af Afrikaans 1.0* 8,877 476 407 93 33 11.85 +0.43
49 mk Македонски 1.0* 5,123 297 627 71 14 11.42 +0.32
50 ta தமிழ் 0.9 3,410 205 749 49 6 11.00 +2.40
51 cy Cymraeg 1.0* 2,538 173 808 25 3 10.30 +0.16
52 ka ქართული 1.0* 3,943 308 651 41 9 9.87 +0.40
53 ml മലയാളം 1.0* 5,748 414 517 64 13 9.81 +0.69
54 zh-yue 粵語 3.7** 6,071 482 457 55 15 8.94 +0.10
55 bn বাংলা 1.0* 4,418 444 506 46 13 8.89 +0.35
56 br Brezhoneg 1.0* 4,783 434 517 48 10 8.80 +0.20
57 sq Shqip 1.0* 4,858 452 498 48 11 8.69 +0.61
58 ps پښتو 1.0* 31,394 876 30 35 68 8.61 +0.22
59 hi हिन्दी 1.0* 3,072 415 554 33 6 8.16 +0.45
60 be-x-old Беларуская (тарашкевіца) 1.0* 4,636 497 452 57 3 7.79 +0.35
61 lb Lëtzebuergesch 1.0* 4,478 494 465 42 8 7.76 +0.12
62 qu Runa Simi 1.0* 2,625 420 557 32 0 7.54 +0.31
63 bat-smg Žemaitėška 1.0* 1,100 343 663 3 0 7.43 +0.27
64 yi ייִדיש 1.0* 3,046 479 501 22 7 7.18 +0.21
65 ga Gaeilge 1.0* 4,163 525 446 28 10 7.14 +0.35
66 scn Sicilianu 1.0* 2,547 435 552 21 1 7.10 +0.21
67 oc Occitan 1.0* 3,723 501 467 36 3 7.04 +0.21
68 sw Kiswahili 1.0* 3,175 424 572 12 1 6.93 +0.15
69 tl Tagalog 1.0* 3,470 509 466 29 5 6.90 +0.41
70 nds Plattdüütsch 1.0* 5,214 620 340 32 17 6.84 +0.44
71 ur اردو 1.0* 4,007 557 409 36 7 6.78 +0.44
72 ast Asturianu 1.0* 3,720 497 482 28 2 6.74 +0.13
73 az Azərbaycan 1.0* 3,156 532 450 22 5 6.42 +0.26
74 io Ido 1.0* 1,738 461 544 3 1 6.22 +0.20
75 su Basa Sunda 1.0* 9,917 763 167 64 15 6.14 +0.24
76 zh-min-nan Bân-lâm-gú 1.2 2,025 480 522 7 0 6.06 +0.28
77 an Aragonés 1.0* 2,929 555 438 13 3 5.69 +0.26
78 te తెలుగు 1.0* 6,935 727 223 48 11 5.66 +0.20
79 jv Basa Jawa 1.0* 3,563 588 403 16 2 5.34 +0.51
80 mr मराठी 1.0* 2,617 603 391 9 6 5.30 +0.22
81 ku Kurdî / كوردی 1.0* 2,424 580 416 12 1 5.21 +0.15
82 be Беларуская 1.0* 3,902 651 329 27 2 5.01 +0.24
83 mn Монгол 1.0* 4,606 655 331 18 5 4.93 +0.15
84 ia Interlingua 1.0* 2,927 635 354 19 1 4.83 +0.31
85 als Alemannisch 1.0* 7,967 777 179 45 8 4.75 +0.14
86 fy Frysk 1.0* 3,933 665 323 20 1 4.54 +0.28
87 gd Gàidhlig 1.0* 2,356 668 333 8 0 4.02 +0.14
88 tg Тоҷикӣ 1.0* 2,121 684 317 6 2 3.95 +0.07
89 kn ಕನ್ನಡ 1.0* 4,809 778 206 17 8 3.81 +0.29
90 vec Vèneto 1.0* 3,432 729 263 15 2 3.76 +0.05
91 li Limburgs 1.0* 3,800 734 257 17 1 3.68 +0.08
92 uz O‘zbek 1.0* 2,757 732 262 14 1 3.60 +0.14
93 cv Чăваш 1.0* 2,935 722 278 9 0 3.46 +0.29
94 nah Nāhuatl 1.0* 2,430 741 261 5 2 3.29 +0.12
95 hy Հայերեն 1.2 2,782 753 244 11 1 3.27 +0.28
96 mt Malti 1.0* 6,685 849 129 22 8 3.19 +0.28
97 vo Volapük 1.0* 1,764 741 265 1 2 3.16 -0.10
98 ht Krèyol ayisyen 1.0* 1,399 737 269 2 1 3.15 +0.08
99 kk Қазақша 1.0* 5,215 787 206 13 3 3.14 +0.51
100 fo Føroyskt 1.0* 2,932 782 214 12 1 2.98 +0.23
101 pam Kapampangan 1.0* 5,701 820 167 21 1 2.86 +0.04
102 zh-classical 古文 / 文言文 3.7** 5,560 824 167 17 1 2.69 +0.05
103 fur Furlan 1.0* 2,979 791 211 7 0 2.63 +0.07
104 pms Piemontèis 1.0* 3,462 819 180 6 3 2.55 +0.06
105 nrm Nouormand/Normaund 1.0* 2,079 779 230 0 0 2.53 +0.10
106 sco Scots 1.0* 1,967 791 218 0 0 2.40 +0.07
107 bar Boarisch 1.0* 5,015 850 143 15 1 2.33 +0.17
108 ceb Sinugboanong Binisaya 0.8 2,425 826 175 7 1 2.33 +0.08
109 lij Líguru 1.0* 1,473 811 196 2 0 2.25 +0.09
110 wa Walon 1.0* 2,396 819 186 4 0 2.22 +0.09
111 nov Novial 1.0* 1,644 824 181 4 0 2.17 +0.06
112 wuu 吴语 3.7** 9,595 900 92 10 7 2.15 +0.11
113 szl Ślůnski 1.0* 2,761 818 189 1 0 2.13 +0.24
114 gv Gaelg 1.0* 3,488 847 157 4 1 2.00 +0.19
115 chr ᏣᎳᎩ ᎧᏬᏂᎯᏍᏗ 1.0* 18,729 964 20 13 12 1.98 +0.06
116 jbo Lojban 1.0* 1,278 831 178 0 0 1.96 +0.08
117 ln Lingala 1.0* 1,751 845 160 4 0 1.94 +0.07
118 si සිංහල 1.0* 19,629 957 34 4 14 1.94 +0.41
119 nds-nl Nedersaksisch 1.0* 3,056 839 169 1 0 1.91 +0.04
120 dv ދިވެހިބަސް 1.0* 3,561 877 119 11 1 1.90 +0.05
121 sa संस्कृतम् 1.0* 1,206 844 163 2 0 1.88 +0.02
122 diq Zazaki 1.0* 2,906 852 155 2 0 1.79 +0.05
123 ne नेपाली 1.0* 4,274 890 110 6 3 1.77 +0.14
124 am አማርኛ 1.0* 1,315 855 154 0 0 1.70 +0.04
125 new नेपाल भाषा 1.0* 4,320 899 97 12 1 1.70 +0.16
126 rm Rumantsch 1.0* 3,618 893 107 7 2 1.68 +0.03
127 ksh Ripoarisch 1.0* 2,463 880 125 3 1 1.61 +0.10
128 kw Kernewek/Karnuack 1.0* 2,100 864 145 0 0 1.60 +0.07
129 frp Arpitan 1.0* 1,518 865 144 0 0 1.59 +0.06
130 gan 贛語 3.7** 3,849 880 122 5 0 1.57 +0.03
131 os Иронау 1.0* 2,139 874 133 2 0 1.55 +0.03
132 yo Yorùbá 1.0* 2,289 870 139 0 0 1.53 +0.16
133 lmo Lumbaart 1.0* 3,073 896 105 8 0 1.51 +0.08
134 fiu-vro Võro 1.0* 1,450 876 132 1 0 1.50 +0.05
135 ang Englisc 1.0* 1,894 883 123 3 0 1.49 +0.04
136 hsb Hornjoserbsce 1.0* 2,532 884 122 3 0 1.48 +0.08
137 ext Estremeñu 1.0* 4,323 908 92 8 1 1.46 +0.16
138 bpy ইমার ঠার/বিষ্ণুপ্রিয়া মণিপুরী 1.0* 4,702 904 100 2 2 1.39 +0.03
139 hak Hak-kâ-fa / 客家話 1.0* 1,655 884 125 0 0 1.38 +0.04
140 vls West-Vlams 1.0* 3,679 901 103 3 1 1.37 +0.05
141 lad Dzhudezmo 1.0* 3,338 902 102 5 0 1.34 +0.03
142 gu ગુજરાતી 1.0* 3,583 917 84 7 1 1.33 +0.12
143 arc ܐܪܡܝܐ 1.0* 1,208 890 119 0 0 1.31 +0.09
144