Wikipedia Diversity Observatory/Culture Gap (spread)
The following table is particularly useful in order to understand the content culture gap between language editions, that is the imbalances across languages editions in content representing cultural context. Specifically, it shows which language CCC is more popular among all Wikipedia language editions by counting the CCC spread articles, i.e. articles from one language CCC that exist in other language editions.
Languages are sorted in alphabetic order by their Wikicode, and the columns present the following statistics: (CCC %) the number of CCC articles and the percentage it occupies in the language computed in relation to their total number of articles, the first five other languages with more spread articles from the language CCC and the percentage they occupy computed in relation to their corresponding total number of articles, the relative spread (R. Spread) of a language CCC across all the other languages computed as the average of the percentage they occupy in each other language edition, the total spread (T. Spread) of a CCC across all the other languages computed as the percentage in relation to all languages articles (not counting the own), and finally, the total number of language CCC articles (Spread Art.) that exists across all the other language editions.
These latter statistics show, for instance, that English CCC occupies more than a quarter (T. Spread) of the sum of all articles from all Wikipedia language editions, which equates to almost seven million of articles. This language CCC is followed by other occidental languages which have had a dominant geopolitical position.
This table is updated automatically on a monthly basis as soon as a new CCC dataset is generated and the percentages are calculated.