Talk:Special language codes

From Meta, a Wikimedia project coordination wiki
Jump to navigation Jump to search



how can be add to Wikipedia project its Moravian language mutation? Moravian language hasn't ISO code. Moravia is country in czech republic, but Czech wikipedia censor any information about Moravia. They don't accept Moravian nationality. 11:18, 2 March 2011 (UTC)

Currently, the language proposal policy requires an ISO 639 code before a new language Wikipedia can be created. So I guess the first step would be to get an ISO 639 code, which Moravian doesn't have currently. PiRSquared17 (talk) 22:44, 11 March 2019 (UTC)


The entry about Tarantino is possibly wrong:

ISO 639-3 lumps it with Italian, as with most varieties of northern Italy.

According to Wikipedia, Tarantino is a dialect of Sicilian (language code scn), not Italian (language code ita).
Best regards. Naudefj (talk) 06:55, 25 September 2013 (UTC)

According to linguistic maps, Tarantino is more closely related to the Neapolitan group of dialects than to Sicilian, although Apulian dialects to its south are in the Sicilian group. You can refer to the English Wikipedia article w:Tarantino dialect or to it:Dialetti italiani meridionali. PiRSquared17 (talk) 22:50, 11 March 2019 (UTC)
This is not contested that this is a Southern dialect, but yes the sentence used may be confusive (as if it was saying it was a Northern Italian dialect like Emilian, Friulan, Genovese, Ligurian, Piemontese or Venitian... that have their own ISO 639-3 codes! that sentence about northern dialects is in fact false for most important Northern dialects). Anyway I did not create that sentence, just fixed the translatability. I had noted that statement but waited for reactions of Italians. But it's a fact that Tarantino is still not separately encoded and many documents from this area are tagged with "it" (ISO 639-1) and then do not even use the ISO639-3 code [scn] of Sicilian as well (and also frequently merge into [it] the contents in Sardinian, or even the Corsican [co] variant in North Sardinia, derived from medieval Genovese. verdy_p (talk) 02:10, 12 March 2019 (UTC)
Per File:Italian Dialects.png (label #39 in Puglia) and w:Tarantino dialect, I think it makes more sense to talk about roa-tara as part of nap than as part of scn. But Ethnologue lists nap as Neapolitan/Calabrese-Lucanian, without reference to Barese/Tarantino (or Abruzzo, Molise, southern Lazio dialects for that matter). On the other hand, Ethnologue does say that "Pugliese" is included in scn, but I assume they mean Salentino and not the northern dialects of Puglia. PiRSquared17 (talk) 02:36, 12 March 2019 (UTC)
Note that this page was initially created (by someone else) to create a more complete reference for an old request (sent in 2015) to Unicode CLDR. This request has been denied and closed because the submitter wanted to add some of the non-standard Wikimedia (& language codes to the CLDR project (which correctly honors only requests about standard codes): this is not the problem of CLDR but a pure problem of Wikimedia alone (which also infects other projects, including many incorrect tagging spread in their wikis, including the content of the Wikidata database). Note that wikimedia "interwikis codes" are not a problem (they are not restrited to just language codes and are only used to create links to Wikimedia domain names, which are not part of the language tagging: there's no such restriction to use only BCP 47 tags in DNS domain names, so this is also not a request to change the interwiki codes, or Wikimedia domains, or internal database names or aliases).
Here we are only concerned by what we need in CLDR data (for translation of labels or some comment properties), in (for translations of MediaWiki UI or some of its extensions or opensourced bot scripts), and in what we insert in lang="..." and xml:lang="..." attributes in HTML/XML/CSS (which must conform to BCP 47).
And this page currently incorrectly states that some codes are invalid when they are still perfectly correct for BCP 47 (even if some of them they have been retired in ISO 639, or recategorized from single languages to macrolanguages or deprecated for splitting them in separate languages, such as old codes [bh] for Bihari or [qu] for Quechua). As well we continue remaining concerned about the non-conforming use of pseudo-variant extensions (that should have used private use codes or should have been registered by a formal request to the IANA database, such as "formal" for German and "informal" for Dutch, because they only exist to satisfy user requests and solve conflicts in some Wikimedia projects). Even the [simple] code used by Wikimedia for interwiki or domain names is valid for such use, but invalid for use in lang=..." and xml:lang="..." attributes.
As well [zh-yue] remains perfectly valid in BCP 47 (even if it is now deprecated in favor of [yue] by a specific entry added to the IANA database. So there's no emergency to change these codes, unlike the very incorrect codes [als] and [nrm] used in Wikimedia!). verdy_p (talk) 18:36, 12 March 2019 (UTC)