Language code
From Meta, a Wikimedia project coordination wiki
The language of a Wikimedia wiki can be found in the lang and xml:lang attributes of the <html> element of each page. These language codes should generally be canonical language tags as defined by BCP 47.
In most cases, the subdomain names that we use for projects correspond to language codes, but there are some exceptions. Usually this is for historical reasons, where a valid ISO 639 code was not available at the time of creation of the project.
Contents |
[edit] Subdomains that do not match the lang attribute
| Subdomain | Language | Project(s) | Notes |
|---|---|---|---|
| als | local: Alemannisch; in English: Alemannic; Language family: Germanic | Wikipedia, Wiktionary, Wikibooks, Wikiquote | Uses gsw which matches the language's ISO 639-3 code. |
| be-x-old | local: Беларуская (тарашкевіца); in English: Belarusian (Taraškievica); Language family: Slavic | Wikipedia | Uses IETF language tag be-tarask. |
| roa-rup | local: Armãneashce; in English: Aromanian; Language family: Italic | Wikipedia, Wiktionary | Uses rup which matches the language's ISO 639-3 code. |
| simple | local: Simple English; in English: Simple English; Language family: Germanic | Wikipedia, Wiktionary, Wikibooks, Wikiquote | Uses en of ordinary English. |
misc:
- All subdomains of wikimedia.org
[edit] Subdomains that do not conform valid ISO 639 language code
| Subdomain | Language | Project(s) | Notes |
|---|---|---|---|
| als | local: Alemannisch; in English: Alemannic; Language family: Germanic | Wikipedia, Wiktionary, Wikibooks, Wikiquote | Alemannic has ISO 639-3 code gsw. ISO 639-3 code als is assigned to Tosk Albanian instead. |
| bat-smg | local: Žemaitėška; in English: Samogitian; Language family: Baltic | Wikipedia | Samogitian has no ISO 639 code as an individual language. A request is pending at the ISO 639-3/RA to assign the code 'sgs' to it. |
| be-x-old | local: Беларуская (тарашкевіца); in English: Belarusian (Taraškievica); Language family: Slavic | Wikipedia | Belarusian in Taraškievica orthography has no ISO 639 code as an individual language but it has IETF language tag be-tarask. |
| cbk-zam | local: Chavacano de Zamboanga; in English: Chavacano de Zamboanga; Language family: Pidgin and Creole | Wikipedia | Chavacano de Zamboanga has no ISO 639 code as an individual language. ISO 639-3 code cbk is assigned to Chavacano, a superset of Chavacano de Zamboanga. |
| eml | local: Emiliàn e rumagnòl; in English: Emilian-Romagnol; Language family: Italic | Wikipedia | ISO 639-3 code eml for Emilian-Romagnol is now retired and split into egl (Emilian) and rgn (Romagnol). However, eml will remain a valid BCP 47 language tag indefinitely. |
| fiu-vro | local: Võro; in English: Võro; Language family: Finno-Permic | Wikipedia | Võro has ISO 639-3 code vro. |
| ksh | local: Ripoarisch; in English: Ripuarian; Language family: Germanic | Wikipedia | ISO 639-3 code ksh is assigned to Kölsch, a subset of Ripuarian. |
| map-bms | local: Basa Banyumasan; in English: Banyumasan; Language family: Sunda-Sulawesi | Wikipedia | Banyumasan has no ISO 639 code as an individual language. ISO 639-1 code jv is assigned to Javanese, a superset of Banyumasan. |
| nds-nl | local: Nedersaksisch; in English: Dutch Low Saxon; Language family: Germanic | Wikipedia | Duplicated with Low German's nds. |
| nrm | local: Nouormand; in English: Norman; Language family: Italic | Wikipedia | Norman has no ISO 639 code as an individual language. ISO 639-3 code nrm is assigned to Narom language instead. ISO 639/3 lumps Norman with French, as with most varieties of northern France. |
| roa-rup | local: Armãneashce; in English: Aromanian; Language family: Italic | Wikipedia, Wiktionary | Aromanian has ISO 639-3 code rup. |
| roa-tara | local: Tarandíne; in English: Tarantino; Language family: Italic | Wikipedia | Tarantino has no ISO 639 code as an individual language. ISO 639-3 lumps it with Italian, as with most varieties of northern Italy. |
| sh | local: Srpskohrvatski / Српскохрватски; in English: Serbo-Croatian; Language family: Slavic | Wikipedia, Wiktionary | sh was originally ISO 639-1 code for Serbo-Croatian but is no longer active. However, it remains a valid BCP 47 language tag. There is the ISO 639-3 code hbs for Serbo-Croatian. |
| simple | local: Simple English; in English: Simple English; Language family: Germanic | Wikipedia, Wiktionary, Wikibooks, Wikiquote | Simple English has no ISO 639 code. |
| zh-classical | local: 文言; in English: Classical Chinese; Language family: Sinitic | Wikipedia | Classical Chinese has ISO 639-3 code lzh. |
| zh-min-nan | local: Bân-lâm-gú; in English: Minnan; Language family: Sinitic | Wikipedia, Wiktionary, Wikibooks, Wikiquote, Wikisource | Min Nan has ISO 639-3 code nan. |
| zh-yue | local: 粵語; in English: Cantonese; Language family: Sinitic | Wikipedia | Cantonese has ISO 639-3 code yue. |
misc:
- tokipona.wikipedia.org (defunct)
- ru-sib.wikipedia.org (defunct)
[edit] Other
| Subdomain | Language | Project(s) | Notes | |
|---|---|---|---|---|
| ak | local: Akan; in English: Akan; Language family: Niger-Congo | Wikipedia; closed WB & Wikt | Are these two Wikipedias in the same language? See Wikipedia article. | |
| tw | local: Twi; in English: Twi; Language family: Niger-Congo | Wikipedia; closed Wikt | ||