Language code
From Meta, a Wikimedia project coordination wiki
The language of a Wikimedia wiki can be found in the lang and xml:lang attributes of the <html> element of each page. These language codes should generally be canonical language tags as defined by BCP 47.
In most cases, the subdomain names that we use for projects correspond to language codes, but there are some exceptions. Usually this is for historical reasons, where a valid ISO 639 code was not available at the time of creation of the project.
Contents |
Subdomains that do not match the lang attribute [edit]
| Subdomain | Language | Project(s) | Notes |
|---|---|---|---|
| als | local: Alemannisch; in English: Alemannic; Language family: Germanic | Wikipedia, Wiktionary, Wikibooks, Wikiquote | Uses gsw which matches the language's ISO 639-3 code. |
| be-x-old | local: беларуская (тарашкевіца); in English: Belarusian (Taraškievica); Language family: Slavic | Wikipedia | Uses IETF language tag be-tarask. |
| roa-rup | local: Armãneashce; in English: Aromanian; Language family: Italic | Wikipedia, Wiktionary | Uses rup which matches the language's ISO 639-3 code. |
| simple | local: Simple English; in English: Simple English; Language family: Germanic | Wikipedia, Wiktionary, Wikibooks, Wikiquote | Uses en of ordinary English. |
| zh-classical | local: 文言; in English: Classical Chinese; Language family: Sinitic | Wikipedia | Classical Chinese has ISO 639-3 code lzh. |
| zh-min-nan | local: Bân-lâm-gú; in English: Minnan; Language family: Sinitic | Wikipedia, Wiktionary, Wikibooks, Wikiquote, Wikisource | Min Nan has ISO 639-3 code nan. |
| zh-yue | local: 粵語; in English: Cantonese; Language family: Sinitic | Wikipedia | Cantonese has ISO 639-3 code yue. |
misc:
- All subdomains of wikimedia.org
Subdomains that do not conform valid ISO 639 language code [edit]
| Subdomain | Language | Project(s) | Notes |
|---|---|---|---|
| als | local: Alemannisch; in English: Alemannic; Language family: Germanic | Wikipedia, Wiktionary, Wikibooks, Wikiquote | Alemannic has ISO 639-3 code gsw. ISO 639-3 code als is assigned to Tosk Albanian instead (see bugzilla:23215). |
| bat-smg | local: žemaitėška; in English: Samogitian; Language family: Baltic | Wikipedia | Samogitian has no ISO 639 code as an individual language. A request is pending at the ISO 639-3/RA to assign the code 'sgs' to it (see bugzilla:25522). |
| be-x-old | local: беларуская (тарашкевіца); in English: Belarusian (Taraškievica); Language family: Slavic | Wikipedia | Belarusian in Taraškievica orthography has no ISO 639 code as an individual language but it has IETF language tag be-tarask (see bugzilla:9823). |
| cbk-zam | local: Chavacano de Zamboanga; in English: Chavacano de Zamboanga; Language family: Pidgin and Creole | Wikipedia | Chavacano de Zamboanga has no ISO 639 code as an individual language. ISO 639-3 code cbk is assigned to Chavacano, a superset of Chavacano de Zamboanga. |
| eml | local: emiliàn e rumagnòl; in English: Emilian-Romagnol; Language family: Italic | Wikipedia | ISO 639-3 code eml for Emilian-Romagnol is now retired and split into egl (Emilian) and rgn (Romagnol). However, eml will remain a valid BCP 47 language tag indefinitely (see bugzilla:34217). |
| fiu-vro | local: Võro; in English: Võro; Language family: Finno-Permic | Wikipedia | Võro has ISO 639-3 code vro (see bugzilla:29186). |
| ksh | local: Ripoarisch; in English: Ripuarian; Language family: Germanic | Wikipedia | ISO 639-3 code ksh is assigned to Kölsch, a subset of Ripuarian. |
| map-bms | local: Basa Banyumasan; in English: Banyumasan; Language family: Sunda-Sulawesi | Wikipedia | Banyumasan has no ISO 639 code as an individual language. ISO 639-1 code jv is assigned to Javanese, a superset of Banyumasan. |
| nds-nl | local: Nedersaksies; in English: Dutch Low Saxon; Language family: Germanic | Wikipedia | Duplicated with Low German's nds. |
| nrm | local: Nouormand; in English: Norman; Language family: Italic | Wikipedia | Norman has no ISO 639 code as an individual language. ISO 639-3 code nrm is assigned to Narom language instead. ISO 639/3 lumps Norman with French, as with most varieties of northern France (see bugzilla:23216). |
| roa-rup | local: Armãneashce; in English: Aromanian; Language family: Italic | Wikipedia, Wiktionary | Aromanian has ISO 639-3 code rup (see bugzilla:15988). |
| roa-tara | local: tarandíne; in English: Tarantino; Language family: Italic | Wikipedia | Tarantino has no ISO 639 code as an individual language. ISO 639-3 lumps it with Italian, as with most varieties of northern Italy. |
| sh | local: srpskohrvatski / српскохрватски; in English: Serbo-Croatian; Language family: Slavic | Wikipedia, Wiktionary | sh was originally ISO 639-1 code for Serbo-Croatian but is no longer active. However, it remains a valid BCP 47 language tag. There is the ISO 639-3 code hbs for Serbo-Croatian. |
| simple | local: Simple English; in English: Simple English; Language family: Germanic | Wikipedia, Wiktionary, Wikibooks, Wikiquote | Simple English has no ISO 639 code. |
| zh-classical | local: 文言; in English: Classical Chinese; Language family: Sinitic | Wikipedia | Classical Chinese has ISO 639-3 code lzh (see bugzilla:8217, bugzilla:28443). |
| zh-min-nan | local: Bân-lâm-gú; in English: Minnan; Language family: Sinitic | Wikipedia, Wiktionary, Wikibooks, Wikiquote, Wikisource | Min Nan has ISO 639-3 code nan (see bugzilla:8217, bugzilla:28442). |
| zh-yue | local: 粵語; in English: Cantonese; Language family: Sinitic | Wikipedia | Cantonese has ISO 639-3 code yue (see bugzilla:8217, bugzilla:28441). |
misc:
- tokipona.wikipedia.org (defunct)
- ru-sib.wikipedia.org (defunct)
Other [edit]
| Subdomain | Language | Project(s) | Notes | |
|---|---|---|---|---|
| ak | local: Akan; in English: Akan; Language family: Niger-Congo | Wikipedia; closed WB & Wikt | Are these two Wikipedias in the same language? See Wikipedia article. | |
| tw | local: Twi; in English: Twi; Language family: Niger-Congo | Wikipedia; closed Wikt | ||