Language code

From Meta, a Wikimedia project coordination wiki

Jump to: navigation, search

The language of a Wikimedia wiki can be found in the lang and xml:lang attributes of the <html> element of each page. These language codes should generally be canonical language tags as defined by RFC 4646.

In most cases, the subdomain names that we use for projects correspond to language codes, but there are some exceptions. Usually this is for historical reasons, where a valid ISO 639 code was not available at the time of creation of the project.

[edit] Subdomains that do not match the lang attribute

Subdomain Language Project(s) Notes
als local: Alemannisch; in English: Alemannic; Language family: Germanichttp://en.wikipedia.org/wiki/Germanic_languages Wikipedia, Wiktionary, Wikibooks, Wikiquote Uses gsw which matches the language's ISO 639-3 code.
be-x-old local: Беларуская (тарашкевіца); in English: Belarusian (Tarashkevitsa); Language family: Slavichttp://en.wikipedia.org/wiki/Slavic_languages Wikipedia Uses IETF language tag be-tarask.
roa-rup local: Armãneashce; in English: Aromanian; Language family: Italichttp://en.wikipedia.org/wiki/Italic_languages Wikipedia, Wiktionary Uses rup which matches the language's ISO 639-3 code.
simple local: Simple English; in English: Simple English; Language family: Germanichttp://en.wikipedia.org/wiki/Germanic_languages Wikipedia, Wiktionary, Wikibooks, Wikiquote Uses en of ordinary English.

misc:

  • All subdomains of wikimedia.org

[edit] Subdomains that do not conform valid ISO 639 language code

Subdomain Language Project(s) Notes
als local: Alemannisch; in English: Alemannic; Language family: Germanichttp://en.wikipedia.org/wiki/Germanic_languages Wikipedia, Wiktionary, Wikibooks, Wikiquote Alemannic has ISO 639-3 code gsw. ISO 639-3 code als is assigned to Tosk Albanian instead.
bat-smg local: Žemaitėška; in English: Samogitian; Language family: Baltichttp://en.wikipedia.org/wiki/Baltic_languages Wikipedia Samogitian has no ISO 639 code as an individual language.
be-x-old local: Беларуская (тарашкевіца); in English: Belarusian (Tarashkevitsa); Language family: Slavichttp://en.wikipedia.org/wiki/Slavic_languages Wikipedia Belarusian in Taraskievica orthography has no ISO 639 code as an individual language but it has IETF language tag be-tarask.
cbk-zam local: Chavacano de Zamboanga; in English: Chavacano de Zamboanga; Language family: Pidgin and Creolehttp://en.wikipedia.org/wiki/Pidgin_and_Creole_languages Wikipedia Chavacano de Zamboanga has no ISO 639 code as an individual language.
eml local: Emiliàn e rumagnòl; in English: Emilian-Romagnol; Language family: Italichttp://en.wikipedia.org/wiki/Italic_languages Wikipedia ISO 639-3 code eml for Emilian-Romagnol is now retired and split into egl (Emilian) and rgn (Romagnol).
fiu-vro local: Võro; in English: Võro; Language family: Finno-Permichttp://en.wikipedia.org/wiki/Finno-Permic_languages Wikipedia Võro has ISO 639-3 code vro.
ksh local: Ripoarisch; in English: Ripuarian; Language family: Germanichttp://en.wikipedia.org/wiki/Germanic_languages Wikipedia ISO 639-3 code ksh is assigned to Kölsch, a subset of Ripuarian.
map-bms local: Basa Banyumasan; in English: Banyumasan; Language family: Sunda-Sulawesihttp://en.wikipedia.org/wiki/Sunda-Sulawesi_languages Wikipedia Banyumasan has no ISO 639 code as an individual language.
nds-nl local: Nedersaksisch; in English: Dutch Low Saxon; Language family: Germanichttp://en.wikipedia.org/wiki/Germanic_languages Wikipedia Duplicated with Low German's nds.
nrm local: Nouormand; in English: Norman; Language family: Italichttp://en.wikipedia.org/wiki/Italic_languages Wikipedia Norman has no ISO 639 code as an individual language. ISO 639-3 code nrm is assigned to Narom language instead.
roa-rup local: Armãneashce; in English: Aromanian; Language family: Italichttp://en.wikipedia.org/wiki/Italic_languages Wikipedia, Wiktionary Aromanian has ISO 639-3 code rup.
roa-tara local: Tarandíne; in English: Tarantino; Language family: Italichttp://en.wikipedia.org/wiki/Italic_languages Wikipedia Tarantino has no ISO 639 code as an individual language.
sh local: Srpskohrvatski / Српскохрватски; in English: Serbo-Croatian; Language family: Slavichttp://en.wikipedia.org/wiki/Slavic_languages Wikipedia, Wiktionary sh was originally ISO 639-1 code for Serbo-Croatian but is no longer active. There is the ISO 639-3 code hbs for Serbo-Croatian.
simple local: Simple English; in English: Simple English; Language family: Germanichttp://en.wikipedia.org/wiki/Germanic_languages Wikipedia, Wiktionary, Wikibooks, Wikiquote Simple English has no ISO 639 code.
zh-classical local: 文言; in English: Classical Chinese; Language family: Sinitichttp://en.wikipedia.org/wiki/Sinitic_languages Wikipedia Classical Chinese has ISO 639-3 code lzh.
zh-min-nan local: Bân-lâm-gú; in English: Minnan; Language family: Sinitichttp://en.wikipedia.org/wiki/Sinitic_languages Wikipedia, Wiktionary, Wikibooks, Wikiquote, Wikisource Min Nan has ISO 639-3 code nan.
zh-yue local: 粵語; in English: Cantonese; Language family: Sinitichttp://en.wikipedia.org/wiki/Sinitic_languages Wikipedia Cantonese has ISO 639-3 code yue.

misc:

  • tokipona.wikipedia.org (defunct)

[edit] See also