Special language codes

From Meta, a Wikimedia project coordination wiki
Jump to: navigation, search

Other languages:
العربية • ‎беларуская (тарашкевіца)‎ • ‎Deutsch • ‎English • ‎español • ‎français • ‎polski • ‎português • ‎português do Brasil • ‎中文

The language of a Wikimedia wiki can be found in the lang and xml:lang attributes of the <html> element of each page. These language codes should generally be canonical language tags as defined by BCP 47.

In most cases, the subdomain names that we use for projects correspond to language codes, but there are some exceptions. Usually this is for historical reasons, where a valid ISO 639 code was not available at the time of creation of the project.

Subdomains that do not match the lang attribute[edit]

Subdomain Language Project(s) Notes
als local: Alemannisch; in English: Alemannic; Language family: Germanic Wikipedia, Wiktionary, Wikibooks, Wikiquote Uses gsw which matches the language's ISO 639-3 code.
be-x-old local: беларуская (тарашкевіца)‎; in English: Belarusian (Taraškievica); Language family: Slavic Wikipedia Uses IETF language tag be-tarask.
bh local: भोजपुरी; in English: Bihari; Language family: Indo-Aryan Wikipedia Uses bho which matches the language's ISO 639-3 code.
crh local: qırımtatarca; in English: Crimean Tatar; Language family: Turkic Wikipedia Site code is right, however they use its Latin variant "crh-Latn" as default language code.
roa-rup local: armãneashti; in English: Aromanian; Language family: Italic Wikipedia, Wiktionary Uses rup which matches the language's ISO 639-3 code.
simple local: Simple English; in English: Simple English; Language family: Germanic Wikipedia, Wiktionary, Wikibooks, Wikiquote Uses en of ordinary English.
zh-classical local: 文言; in English: Classical Chinese; Language family: Sinitic Wikipedia Classical Chinese has ISO 639-3 code lzh.
zh-min-nan local: Bân-lâm-gú; in English: Minnan; Language family: Sinitic Wikipedia, Wiktionary, Wikibooks, Wikiquote, Wikisource Min Nan has ISO 639-3 code nan.
zh-yue local: 粵語; in English: Cantonese; Language family: Sinitic Wikipedia Cantonese has ISO 639-3 code yue.

misc:

  • All subdomains of wikimedia.org

Subdomains that do not conform to a valid ISO 639 language code[edit]

Subdomain Language Project(s) Notes
als local: Alemannisch; in English: Alemannic; Language family: Germanic Wikipedia, Wiktionary, Wikibooks, Wikiquote Alemannic has ISO 639-3 code gsw. ISO 639-3 code als is assigned to Tosk Albanian instead (see phab:T25215).
bat-smg local: žemaitėška; in English: Samogitian; Language family: Baltic Wikipedia Samogitian has the ISO 639 code sgs (see phab:T27522).
cbk-zam local: Chavacano de Zamboanga; in English: Chavacano de Zamboanga; Language family: Pidgin and Creole Wikipedia Chavacano de Zamboanga has no ISO 639 code as an individual language. ISO 639-3 code cbk is assigned to Chavacano, a superset of Chavacano de Zamboanga. (see phab:T124657)
eml local: emiliàn e rumagnòl; in English: Emilian-Romagnol; Language family: Italic Wikipedia ISO 639-3 code eml for Emilian-Romagnol is now retired and split into egl (Emilian) and rgn (Romagnol). However, eml will remain a valid BCP 47 language tag indefinitely (see phab:T36217).
fiu-vro local: Võro; in English: Võro; Language family: Finno-Permic Wikipedia Võro has ISO 639-3 code vro (see phab:T31186).
iu local: ᐃᓄᒃᑎᑐᑦ/inuktitut; in English: Inuktitut; Language family: Eskimo-Aleut Wikipedia ISO 639 considers iu/iku not a locale but a macrolanguage of ike/ikt. MediaWiki agrees (since 5e396d428), but: falls back to ike, called ike-cans; adds ike-latn; has no ikt support. CLDR considers Cans an aspirational script.
ksh local: Ripoarisch; in English: Ripuarian; Language family: Germanic Wikipedia ISO 639-3 code ksh is assigned to Kölsch, a subset of Ripuarian.
map-bms local: Basa Banyumasan; in English: Banyumasan; Language family: Sunda-Sulawesi Wikipedia Banyumasan has no ISO 639 code as an individual language. ISO 639-1 code jv is assigned to Javanese, a superset of Banyumasan.
nds-nl local: Nedersaksies; in English: Dutch Low Saxon; Language family: Germanic Wikipedia Duplicated with Low German's nds.
nrm local: Nouormand; in English: Norman; Language family: Italic Wikipedia Norman has no ISO 639 code as an individual language (However, two dialects of Norman, Guernésiais and Jèrriais, are sharing ISO 639-3 code nrf). ISO 639-3 code nrm is assigned to Narom language instead. ISO 639-3 lumps Norman with French, as with most varieties of northern France (see phab:T25216).
roa-rup local: armãneashti; in English: Aromanian; Language family: Italic Wikipedia, Wiktionary Aromanian has ISO 639-3 code rup (see phab:T17988).
roa-tara local: tarandíne; in English: Tarantino; Language family: Italic Wikipedia Tarantino has no ISO 639 code as an individual language. ISO 639-3 lumps it with Italian, as with most varieties of northern Italy.
sh local: srpskohrvatski / српскохрватски; in English: Serbo-Croatian; Language family: Slavic Wikipedia, Wiktionary sh was originally ISO 639-1 code for Serbo-Croatian but is no longer active. However, it remains a valid BCP 47 language tag. There is the ISO 639-3 code hbs for Serbo-Croatian. In CLDR aliases, sh maps to sr_Latn (see phab:T127679, phab:T127680).
simple local: Simple English; in English: Simple English; Language family: Germanic Wikipedia, Wiktionary, Wikibooks, Wikiquote Simple English has no ISO 639 code but has an IETF subtag "simple". (see phab:T110190)
zh-classical local: 文言; in English: Classical Chinese; Language family: Sinitic Wikipedia Classical Chinese has ISO 639-3 code lzh (see phab:T10217, phab:T30443).
zh-min-nan local: Bân-lâm-gú; in English: Minnan; Language family: Sinitic Wikipedia, Wiktionary, Wikibooks, Wikiquote, Wikisource Min Nan has ISO 639-3 code nan (see phab:T10217, phab:T30442).
zh-yue local: 粵語; in English: Cantonese; Language family: Sinitic Wikipedia Cantonese has ISO 639-3 code yue (see phab:T10217, phab:T30441).

misc:

  • tokipona.wikipedia.org (defunct)
  • ru-sib.wikipedia.org (defunct, hoax in fictional "Siberian" language)
  • be-x-old.wikipedia.org (fixed and redirected to be-tarask.wikipedia.org, see phab:T11823)

Other[edit]

Subdomain Language Project(s) Notes
ak local: Akan; in English: Akan; Language family: Niger-Congo Wikipedia; closed WB & Wikt Are these two Wikipedias in the same language? See Wikipedia article.
tw local: Twi; in English: Twi; Language family: Niger-Congo Wikipedia; closed Wikt
de-formal local: Deutsch; in English: German; Language family: Germanic not used as host names but included as pseudo variant-subtags (unregistered) for translations in translatewiki.net (used in Meta-Wiki for pages like policies): we should have used a private-use extension
nl-informal local: Nederlands; in English: Dutch; Language family: Germanic not used as host names but included as pseudo variant subtags (unregistered) for translations in translatewiki.net (used in Meta-Wiki for pages like policies): we should have used a private-use extension

See also[edit]