Wikipedia Diversity Observatory/List of Wikipedias by Cultural Context Content
This page contains a list of all the current Wikipedia language editions ordered by their number of articles from their Cultural Context Content dataset, a collection of articles from various topics, from geographical places to biographies, that relate to territories where the language is spoken as official or as indigeneous.
The table entries are ordered by article count. Each entry gives the language name in English (linked to the English Wikipedia article for the language), the language code used in the wiki's URL address, the current number of articles in this language edition, as well as few statistics based on cultural context content.
For each language edition, statistics account for the number of articles of different CCC segments and their percentage computed in relation to the overall total number of Wikipedia articles. This is CCC (%) as the number of CCC articles, CCC GL (%) as the number of articles from CCC that are geolocated, KW Title (%) as the number of articles from CCC that contain specific keywords (language name, territory name or demonym) in their titles, CCC People (%) as the number of CCC articles whose Wikidata qitems contain a property related to gender, and CCC Female-Male % as the percentage of female and male contained in the previous group of people articles. Finally, Region (continent) is introduced in order to contextualize the results.
This table is updated automatically on a monthly basis as soon as a new CCC dataset is generated. Currently, it is only possible to consult previous version of the table using the View history function.
In a similar way than other Lists of Wikipedia pages the table should only be modified by its generating script. If something is wrong with an entry other than simply having slightly out of date statistics, post about it on the talk page.