Jump to content

Wikipedia Diversity Observatory/Language Territories Mapping

From Meta, a Wikimedia project coordination wiki

This page contains a copy of the latest version of the Language Territories Mapping database (see wikipedia_language_territories_mapping.csv in github). The first version of this database has been generated using Ethnologue, Wikidata and Wikipedia language pages. Wikimedians are invited to suggest changes by e-mailing tools.wcdo@tools.wmflabs.org or by posting a comment in the talk page.

The database contains all the territories (political divisions of first and second level) in which a language is spoken because it is indigenous or official, along with some specific metadata used in the generation of Cultural Context Content (CCC) dataset.

The following table is a reduced version of the database with the Language name, wikicode, Wikidata Qitem for the territory, territory in native language, demonyms in native language, ISO 3166 and ISO 3166-2, whereas the full database includes the Qitem for the language, language names in Native languages among other information. Additionally, the full table is extended with the database country_regions.csv, which presents an equivalence table between countries, world regions (continents) and subregions.