WikiProject Language samples

From Meta, a Wikimedia project coordination wiki
Jump to navigation Jump to search
Pieter Bruegel the Elder - The Tower of Babel (Vienna) - Google Art Project - edited.jpg

The Wikipedia exists in many languages, in which we have articles about languages. We already have a lot of high quality wikipedia-articles about languages in which you can find information about the number of speakers, the vocabulary, stem and grammar of the language. But often one very natural question about a language is still left unanswered: "How does this language sound?".

This project seeks to change this. The goal of this project is to add a small sample of text spoken by a native speaker to all the articles about languages. The first article of the Universal Declaration of Human Rights (UDHR) is an appropriate choice for this, since it is translated in "all" languages of the world, public domain and of the right length to be included to a Wikipedia article.

This is an example of how this could look and sound for the Japanese language:

 
すべての人間(にんげん)は、()まれながらにして自由(じゆう)であり、かつ、尊厳(そんげん)権利(けんり)とについて平等(びょうどう)である。人間(にんげん)は、理性(りせい)良心(りょうしん)とを(さず)けられており、(たが)いに同胞(どうほう)精神(せいしん)をもって行動(こうどう)しなければならない。
subete no ningen wa, umarenagara ni shite jiyū de ari, katsu, songen to kenri to ni tsuite byōdō de aru. ningen wa, risei to ryōshin to o sazukerarete ori, tagai ni dōhō no seishin o motte kōdō shinakereba naranai.
All human beings are born free and equal in dignity and rights. They are endowed with reason and conscience and should act towards one another in a spirit of brotherhood.

Project-History[edit]

The project benefited of a Librivox-project, which also had the goal of creating recordings of the Universal Declaration of Human Rights in over 50 languages. MichaelSchoenitzer imported those to commons, edited the files (see below) and extracted the first article. Through a project in the German Wikipedia they were included in the articles there.

Now it's time to make this into a international community-project – Wikipedians all around the globe can record the first article (or even the whole document) in their mother tongue, and the Wikipedia communities can add them to their language articles.

How-to create a recording[edit]

Gnome-audio-input-microphone.svg

You want to read the first article of the UDHR in your mother tongue? Or you could convince some other person to do so? Awesome.

First: Get a microphone. Cheap Microphones have of course a lower sound quality, but if you follow the descriptions below even a cheap microphone will give reasonable results. If you live in a country with a local chapter you can ask there whether they can help you getting a microphone, for example in Germany you can borrow a microphone at Wikimedia Deutschland.

You can get the translation of the UDHR in your language at OHCHR.org. Find the first article and copy it to an editor or text program and format it in a way you can most comfortably read it. Before starting the recording read it two or three times loud and drink some water. If you misread a word simply read the word or group of words again and later cut out the wrong version.

Very important: when doing the recording make sure you also record at least 5 seconds of silence at the beginning or end of the recording – this is needed for editing. If you never did a recording, we recommend to use the free software Audacity. If you have a passive microphone (without power supply): activate the microphone boost and put the volume control to maximum. For an active microphone make sure the audio is not that high, that you reach the maximum gain when recording. After the recording, mark the part with silence you recoded and click on Effect -> Noise Reduction and click the Get Noise Profile button. After that select the whole recording (Edit > Select > All or the hotkey CTRL + A) and go again at Effect -> Noise Reduction and click the OK button. After that you can remove the silence and if there were any the misread sections by simply selecting them and pressing Del. After that use from the Effect-Menu the filters Compressor, Leveller and Normalizer in this order. The default settings should be fine. When you are done, go on File -> Export audio, choose Ogg Vorbis as format and save the file.

Upload your recording to Wikimedia Commons, put it in the Category Audiorecordings of Article 1 of the Universal Declaration of Human Rights and add it on the listing below.

More tips for high-quality audio samples can be found in: A short guide to the recording of high-quality audio samples for Wiktionary

Project Status[edit]

So far we have recordings of the following languages:

Language Full recording Recording of Artikel 1 German Wikipedia your Wikipedia…

edit

Afrikaans Yes check.svg Done Yes check.svg Done link 1

no
Arabic Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Acehnese Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Balinese Yes check.svg Done Yes check.svg Done link 1

??
Basque X mark.svg Not done Yes check.svg Done link 1

Yes check.svg Done
Brazilian Portuguese Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Buginese Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Bulgarian Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Catalan Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Chinese (Mandarin) Yes check.svg Done, 2 Versions Yes check.svg Done link 1

Czech Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Danish Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Dutch Yes check.svg Done, 2 Versions Yes check.svg Done link 1 link 2

Yes check.svg Done
English Yes check.svg Done, 2 Versions Yes check.svg Done link 1


Esperanto Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Faroese Yes check.svg Done Yes check.svg Done link 1

no
Finnish Yes check.svg Done Yes check.svg Done link 1

no
French Yes check.svg Done, 3 Versions Yes check.svg Done link 1 link 2 link 3

Yes check.svg Done
German Yes check.svg Done Yes check.svg Done link 1

no
Modern Greek Yes check.svg Done Yes check.svg Done link 1

Hebrew Yes check.svg Done, 2 Versions Yes check.svg Done link 1

Yes check.svg Done
Hindi Yes check.svg Done Yes check.svg Done link 1

no
Hungarian Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Indonesian Yes check.svg Done, 2 Versions Yes check.svg Done link 1 link 2

no
Italian Yes check.svg Done, 2 Versions Yes check.svg Done link 1

Yes check.svg Done
Japanese Yes check.svg Done Yes check.svg Done link 1 link 2

Yes check.svg Done
Javanese Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Javanese (Semarang) Yes check.svg Done ToDo no article
Kapampangan Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Korean Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Latin Yes check.svg Done, 2 Versions Yes check.svg Done link 1

no
Latvian Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Luxembourgish Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Malay Yes check.svg Done Yes check.svg Done link 1 link 2

Minangkabauian Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Nynorsk Yes check.svg Done Yes check.svg Done link 1

Todo
"plain" ??? Yes check.svg Done ToDo ???
Okzitanian (Languedocien) Yes check.svg Done Yes check.svg Done link 1

no
Oriya Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Persian X mark.svg Not done Yes check.svg Done link 1


Polish Yes check.svg Done, 2 Versions Yes check.svg Done link 1

Yes check.svg Done
Portuguese Yes check.svg Done, 2 versions Yes check.svg Done link 1 link 2

Yes check.svg Done
Romanian Yes check.svg Done very bad Quality ToDo
Russian Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Swedish Yes check.svg Done, 2 Versions Yes check.svg Done link 1

Yes check.svg Done
Slovak Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Serbian X mark.svg Not done Yes check.svg Done link 1

Todo
Sesotho / South Sotho X mark.svg Not done Yes check.svg Done link 1

Yes check.svg Done
Spanish Yes check.svg Done, 2 Versions Yes check.svg Done link 1 link 2

Yes check.svg Done
Sundanese Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Tagalog Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Tamil Yes check.svg Done, 2 Versions Yes check.svg Done link 1

no
Turkisch X mark.svg Not done Yes check.svg Done link 1

(bad quality)

Todo
Ukrainian Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Urdu Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Walloons Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
West Frisian Yes check.svg Done Yes check.svg Done link 1

Yes check.svg Done
Yiddish Yes check.svg Done Yes check.svg Done link 1

no

Add language


Open Questions and Tasks[edit]

  • Should we also make recordings in different dialects?
  • How do we link the audio-files on Wikidata?
  • How do we reach native speakers of small languages?
  • Design a logo for this project