Language code

From Meta, a Wikimedia project coordination wiki
Jump to: navigation, search

The language of a Wikimedia wiki can be found in the lang and xml:lang attributes of the <html> element of each page. These language codes should generally be canonical language tags as defined by BCP 47.

In most cases, the subdomain names that we use for projects correspond to language codes, but there are some exceptions. Usually this is for historical reasons, where a valid ISO 639 code was not available at the time of creation of the project.

Contents

Subdomains that do not match the lang attribute [edit]

Subdomain Language Project(s) Notes
als local: Alemannisch; in English: Alemannic; Language family: Germanichttp://en.wikipedia.org/wiki/Germanic_languages Wikipedia, Wiktionary, Wikibooks, Wikiquote Uses gsw which matches the language's ISO 639-3 code.
be-x-old local: беларуская (тарашкевіца)‎; in English: Belarusian (Taraškievica); Language family: Slavichttp://en.wikipedia.org/wiki/Slavic_languages Wikipedia Uses IETF language tag be-tarask.
roa-rup local: Armãneashce; in English: Aromanian; Language family: Italichttp://en.wikipedia.org/wiki/Italic_languages Wikipedia, Wiktionary Uses rup which matches the language's ISO 639-3 code.
simple local: Simple English; in English: Simple English; Language family: Germanichttp://en.wikipedia.org/wiki/Germanic_languages Wikipedia, Wiktionary, Wikibooks, Wikiquote Uses en of ordinary English.
zh-classical local: 文言; in English: Classical Chinese; Language family: Sinitichttp://en.wikipedia.org/wiki/Sinitic_languages Wikipedia Classical Chinese has ISO 639-3 code lzh.
zh-min-nan local: Bân-lâm-gú; in English: Minnan; Language family: Sinitichttp://en.wikipedia.org/wiki/Sinitic_languages Wikipedia, Wiktionary, Wikibooks, Wikiquote, Wikisource Min Nan has ISO 639-3 code nan.
zh-yue local: 粵語; in English: Cantonese; Language family: Sinitichttp://en.wikipedia.org/wiki/Sinitic_languages Wikipedia Cantonese has ISO 639-3 code yue.

misc:

  • All subdomains of wikimedia.org

Subdomains that do not conform valid ISO 639 language code [edit]

Subdomain Language Project(s) Notes
als local: Alemannisch; in English: Alemannic; Language family: Germanichttp://en.wikipedia.org/wiki/Germanic_languages Wikipedia, Wiktionary, Wikibooks, Wikiquote Alemannic has ISO 639-3 code gsw. ISO 639-3 code als is assigned to Tosk Albanian instead (see bugzilla:23215).
bat-smg local: žemaitėška; in English: Samogitian; Language family: Baltichttp://en.wikipedia.org/wiki/Baltic_languages Wikipedia Samogitian has no ISO 639 code as an individual language. A request is pending at the ISO 639-3/RA to assign the code 'sgs' to it (see bugzilla:25522).
be-x-old local: беларуская (тарашкевіца)‎; in English: Belarusian (Taraškievica); Language family: Slavichttp://en.wikipedia.org/wiki/Slavic_languages Wikipedia Belarusian in Taraškievica orthography has no ISO 639 code as an individual language but it has IETF language tag be-tarask (see bugzilla:9823).
cbk-zam local: Chavacano de Zamboanga; in English: Chavacano de Zamboanga; Language family: Pidgin and Creolehttp://en.wikipedia.org/wiki/Pidgin_and_Creole_languages Wikipedia Chavacano de Zamboanga has no ISO 639 code as an individual language. ISO 639-3 code cbk is assigned to Chavacano, a superset of Chavacano de Zamboanga.
eml local: emiliàn e rumagnòl; in English: Emilian-Romagnol; Language family: Italichttp://en.wikipedia.org/wiki/Italic_languages Wikipedia ISO 639-3 code eml for Emilian-Romagnol is now retired and split into egl (Emilian) and rgn (Romagnol). However, eml will remain a valid BCP 47 language tag indefinitely (see bugzilla:34217).
fiu-vro local: Võro; in English: Võro; Language family: Finno-Permichttp://en.wikipedia.org/wiki/Finno-Permic_languages Wikipedia Võro has ISO 639-3 code vro (see bugzilla:29186).
ksh local: Ripoarisch; in English: Ripuarian; Language family: Germanichttp://en.wikipedia.org/wiki/Germanic_languages Wikipedia ISO 639-3 code ksh is assigned to Kölsch, a subset of Ripuarian.
map-bms local: Basa Banyumasan; in English: Banyumasan; Language family: Sunda-Sulawesihttp://en.wikipedia.org/wiki/Sunda-Sulawesi_languages Wikipedia Banyumasan has no ISO 639 code as an individual language. ISO 639-1 code jv is assigned to Javanese, a superset of Banyumasan.
nds-nl local: Nedersaksies; in English: Dutch Low Saxon; Language family: Germanichttp://en.wikipedia.org/wiki/Germanic_languages Wikipedia Duplicated with Low German's nds.
nrm local: Nouormand; in English: Norman; Language family: Italichttp://en.wikipedia.org/wiki/Italic_languages Wikipedia Norman has no ISO 639 code as an individual language. ISO 639-3 code nrm is assigned to Narom language instead. ISO 639/3 lumps Norman with French, as with most varieties of northern France (see bugzilla:23216).
roa-rup local: Armãneashce; in English: Aromanian; Language family: Italichttp://en.wikipedia.org/wiki/Italic_languages Wikipedia, Wiktionary Aromanian has ISO 639-3 code rup (see bugzilla:15988).
roa-tara local: tarandíne; in English: Tarantino; Language family: Italichttp://en.wikipedia.org/wiki/Italic_languages Wikipedia Tarantino has no ISO 639 code as an individual language. ISO 639-3 lumps it with Italian, as with most varieties of northern Italy.
sh local: srpskohrvatski / српскохрватски; in English: Serbo-Croatian; Language family: Slavichttp://en.wikipedia.org/wiki/Slavic_languages Wikipedia, Wiktionary sh was originally ISO 639-1 code for Serbo-Croatian but is no longer active. However, it remains a valid BCP 47 language tag. There is the ISO 639-3 code hbs for Serbo-Croatian.
simple local: Simple English; in English: Simple English; Language family: Germanichttp://en.wikipedia.org/wiki/Germanic_languages Wikipedia, Wiktionary, Wikibooks, Wikiquote Simple English has no ISO 639 code.
zh-classical local: 文言; in English: Classical Chinese; Language family: Sinitichttp://en.wikipedia.org/wiki/Sinitic_languages Wikipedia Classical Chinese has ISO 639-3 code lzh (see bugzilla:8217, bugzilla:28443).
zh-min-nan local: Bân-lâm-gú; in English: Minnan; Language family: Sinitichttp://en.wikipedia.org/wiki/Sinitic_languages Wikipedia, Wiktionary, Wikibooks, Wikiquote, Wikisource Min Nan has ISO 639-3 code nan (see bugzilla:8217, bugzilla:28442).
zh-yue local: 粵語; in English: Cantonese; Language family: Sinitichttp://en.wikipedia.org/wiki/Sinitic_languages Wikipedia Cantonese has ISO 639-3 code yue (see bugzilla:8217, bugzilla:28441).

misc:

  • tokipona.wikipedia.org (defunct)
  • ru-sib.wikipedia.org (defunct)

Other [edit]

Subdomain Language Project(s) Notes
ak local: Akan; in English: Akan; Language family: Niger-Congohttp://en.wikipedia.org/wiki/Niger-Congo_languages Wikipedia; closed WB & Wikt Are these two Wikipedias in the same language? See Wikipedia article.
tw local: Twi; in English: Twi; Language family: Niger-Congohttp://en.wikipedia.org/wiki/Niger-Congo_languages Wikipedia; closed Wikt

See also [edit]