Research talk:Characterizing Wikipedia Reader Behaviour/Demographics and Wikipedia use cases

From Meta, a Wikimedia project coordination wiki
Jump to navigation Jump to search

Work log

20 %[edit]

20 % is a significant number for language switches in the same session. If it's anywhere near that, or even just above 10 %, I'd say it validates the multilingualism of Wikimedia projects. Nemo 08:33, 19 January 2019 (UTC)

Note that in contrast, mw:Universal_Language_Selector/Compact_Language_Links/metrics/data says "In June 2016 0.2125% of Wikipedia visitors in all languages clicked on interlanguage links. In June 2017 it was 0.4042%." It's not quite clear to me though what definition of "visitors" was used there.
Diego and I talked a bit recently about this difference, but haven't found a clear explanation yet.
Regards, Tbayer (WMF) (talk) 01:42, 4 February 2019 (UTC)

Interested languages[edit]

If you would like to see the study of reader demographics in your language, please fill out the information in the table below.

Language Point of Contact Documents to translate Status
Arabic Abbad Feasible
English LZia_(WMF) Feasible
Hebrew Amire80 Feasible
Polish Aegis Maelstrom Evaluating
Russian Kaganer Feasible
Danish fnielsen Feasible - time permitting
Persian Mardetanha Feasible
Armenian Davit Saroyan (WMAM) Evaluating (sampling challenges due to monthly pageview counts)
French Rémy Gerbet WMFr and Ilario Feasible
Chinese Liang-chih ShangKuan (WMTW) Feasible
Romanian Strainu Feasible
Ukrainian Юрій Булка Feasible
Portuguese Chicocvenancio (pt-br), DarwIn (pt-pt) Feasible (with one translation)
Norwegian Bokmål Jon Harald Søby (WMNO) Feasible
Norwegian Nynorsk Jon Harald Søby (WMNO) Very low pageviews which makes it hard to sample.
Basque Theklan sampling challenges due to monthly pageview counts
Italian Ilario Feasible - time permitting
German Ilario Feasible
Hungarian Tgr Feasible
Sakha (sah) HalanTul Pageviews are too low to make this specific research possible.
Spanish Feasible
Japanese Feasible
Hindi Feasible
Indonesian Feasible

@Strainu: @Tgr: @Amire80: @AWossink: @Antanana: @Lyzzy: @Shangkuanlc: @Whym: @Kaganer: @عباد ديرانية: @Satdeep Gill: @Racso: @Hasive: Hi. Given that you were the point of contact for the previous round of the study of readers in 14 languages, I am pinging you here to make sure you have seen my recent message to wikimedia-l and you can act on it if you wish.

@LZia (WMF): Thank you for the pinging, I don't monitor Wikimedia-l regularly. I am glad this is moving forward & can't wait for the documents to translate come out! --Liang(WMTW) (talk) 16:01, 7 March 2019 (UTC)

@Mardetanha: Let me know if I should sign up fawiki. I had guilty conscious the other time when I learned that you wanted to act on it but you didn't get a chance to do it. I'd be happy to do the bulk of the translation work if you're generally up for it and you find it useful for the work you all do in fawiki. Of course, if there is someone else from the fawiki community who can help me, I'd love that. Let me know.

@Anthere: In Wikimania 2018, you raised a good question about why we don't include Africa specific languages. Let's change it for this new round of research. Which languages you would recommend we consider? (I can check the traffic to those languages to see if running the survey in them would be possible). Arabic and English were present in the previous survey. I'm hoping we can at least add French, and a couple of more languages of your suggestion though for French I'll need indication of interest from the Wikipedia French language community. :) --LZia (WMF) (talk) 21:37, 6 March 2019 (UTC)

Hi. I would say you could pick up Hausa, Swahili, Igbo, and Yoruba. Other options, Lingala, Luganda, Amharic, Shona. Possibly Afrikaans as well would be an option. Thanks Anthere (talk)
@Anthere: We looked into the languages above. At most we see ~2 million monthly pageviews which means we can't sample the readers for the survey if we want to have enough responses for reliable statistics. The debiasing step to make sure the results are reliable can also run into issues as we will be showing the surveys to everyone, and we know only a very small percentage will participate. We now do have fr, en, ar and we can sample by country to focus on specific countries in Africa and get more data from them. We have decided to go with those languages for this part of the study. If you see a point we have missed, please flag it in the coming week. --LZia (WMF) (talk) 18:24, 2 April 2019 (UTC)
Got you ! Thank you Anthere (talk)
@LZia (WMF): permanent question - where is translatable survey materials, for improving and profreading? --Kaganer (talk) 22:44, 6 March 2019 (UTC)
@Kaganer: The three questions you worked on for the previous study are at . Feel free to improve them. I'd appreciate if you let me know a summary of improvements you make. The 5 new questions are not fully finalized, yet. Once they're available, we will ping you here and share the link to those as well. I'm looking forward to working with you on this project. :)--LZia (WMF) (talk) 22:49, 6 March 2019 (UTC)

User:LZia (WMF): I can help User:Mardetanha so it would be faster and less work if that's okay for you two. Amir (talk) 15:37, 7 March 2019 (UTC)

@Ladsgroup: That's excellent. Thanks! Let's do it then. (And I see that Persian is already added to the table.) --LZia (WMF) (talk) 17:30, 7 March 2019 (UTC)
more the merrier. Thanks for the offer Amir Mardetanha talk 10:05, 8 March 2019 (UTC)

@LZia (WMF): I can help with Arabic. Should we start by translating the page with the three questions, in the link you provided above? --Abbad (talk) 10:16, 8 March 2019 (UTC).

I have put three languages because for me (and for Wikimedia Switzerland) the community is split in 4 main languages (including English). If this study can be country based, it would be the best. --Ilario (talk) 19:13, 11 March 2019 (UTC)


Hi all -- thank you for volunteering and your patience! At this point, we have compiled the translation templates for each language. Below are the templates for each language. Each one has the English text on the left-hand side and space for you to write in translations on the right-hand side. You will need to request access to the document with whatever email address you prefer. If you have any questions or concerns, please do not hesitate to reach out. We did our best to find translations from previous projects or external resources to reduce your burden and these are pre-filled in the templates. You may leave them as is or make updates if there is something obviously wrong with them.

We would like to have this stage completed in two weeks if at all possible, which is 4 June 2019. When you finish, let me know and we can set up a call in order to talk through the translation and answer any questions.

Just a reminder of the project overview:

Best and thanks again! --Isaac (WMF) (talk) 18:07, 21 May 2019 (UTC)

@Isaac (WMF): access requested. By way, why there was not used TranslateWiki or Translate extension in Meta? --Kaganer (talk) 20:35, 21 May 2019 (UTC)
Yeah, good question @Kaganer:. Two reasons: 1) because we are seeking to use these questions for research, we want to be careful about having a single, agreed-upon translation. Similar to last time with the reader motivation questions, the thought is to do them through a more carefully controlled process and then add them via the translate extension once they are finalized, and 2) we were also aiming for simplicity (keeping all the work that had been done and needed to be done in one place for each language). The privacy policy is actually hosted on and so the translations will be added there and the questions/survey language will remain here on meta. Furthermore, the parts of the privacy policy that are already translated for some languages do not break neatly into paragraphs. Given that this workflow worked pretty well last time and we were unaware of a good way to be both simple and careful via meta or translatewiki, we have opted to go this route. Hopefully that makes sense. If not, let me know. --Isaac (WMF) (talk) 21:26, 21 May 2019 (UTC)
@Isaac (WMF): Russian is Yes check.svg Done. --Kaganer (talk) 23:52, 21 May 2019 (UTC)
Hebrew is done. I think it's OK, but it would be nice if anyone could review it. --Amir E. Aharoni (talk) 07:39, 22 May 2019 (UTC)