Research talk:Non-bot interlanguage linking

From Meta, a Wikimedia project coordination wiki
Jump to navigation Jump to search

More details needed[edit]

In order to understand the scope and direction of this project, I would appreciate if an example of such a debate could be provided and situated in the context of the proposed research. What kind of conclusions can be drawn from such analyses? What kind of assumptions are relevant and could perhaps even be tested? What kind of data is requested precisely? -- Daniel Mietchen - WiR/OS (talk) 12:04, 12 October 2012 (UTC)

What data, exactly, do you need?[edit]

If you can give me a few examples of what you are looking for, I might be able to help. I've got to admit that I'm starting to worry that you would expect us to find the examples you are looking for without a detailed specification. I'm in the position to run database queries for you, but I'm not in the position to perform text analysis without an exact specification. --EpochFail (talk) 14:58, 21 November 2012 (UTC)

Thank you very much for your comment. Results of the following database queries would be helpful to start with, a search of comment lines of edits made (edit comments): a. must contain "+"; b. must contain the character string "iw"; c. must contain the character string "интервики".
Would it be possible for you to start with c. in the wikipedias in Cyrillic, and in case you wish to choose only some to start with, please go in this sequence: mn., xal., bxr., mk., sh., sr., bg., ce., os., av., lbe., lez., krc., tt., myv., mdf., mrj., mhr., kv., koi., cv., kk., ba., ky., tk., tg., be., be-x-old., uk., rue., ru. -- C.Koltzenburg (talk) 08:26, 30 November 2012 (UTC)
Just to make sure I understand correctly, I generated a small dataset from mn and xal such that the revision comment met the conditions of a, b or c inclusively.
wiki    rev_id  rev_comment
mn      222221  интервики
mn      53527   интервики
mn      66517   интервики
mn      61975   интервики
mn      103049  интервики
mn      43856   интервики
mn      61979   интервики
mn      103051  интервики
mn      103057  интервики
mn      44339   интервики
mn      44342   интервики
mn      297632  интервики (kk)
mn      43855   интервики
mn      48062   интервики
mn      48132   интервики
mn      66519   интервики
mn      52741   интервики
mn      103054  интервики
mn      45130   интервики
mn      50769   интервики
mn      44341   интервики
mn      44340   интервики
mn      45511   Шинэ хуудас: {{subst:welcome}} Сайн байна уу, та шинэ өгүүлэл оруулахдаа ангилал, интервики оруулаарай. Ж.нь. интерви...
mn      202122  интервики
mn      52744   интервики
mn      206206  интервики
mn      102362  интервики
mn      168456  интервики
xal     2412    интервики никуда не ведут
xal     7363    интервики
xal     8626    а теперь интервики
xal     9924    интервики
xal     9925    интервики
xal     10980   интервики
xal     11465   интервики
yes & thanks! You got me correctly, great, so now I just need to find out how to find the actual URL of the revisions, will let you know as soon as you can go on, o.k.? -- C.Koltzenburg (talk) 08:21, 3 December 2012 (UTC)
ok, found out, so please go on, EpochFail, thanks -- C.Koltzenburg (talk) 09:35, 3 December 2012 (UTC)
I'll make another set of queries this weekend. Do you have a repository of some sort that you'd like me to upload the dataset to? It could be a little too large for copy-pasting here or sending via email. --EpochFail (talk) 14:43, 4 December 2012 (UTC)
Thank you, excellent, and yes, here. -- C.Koltzenburg (talk) 05:40, 7 December 2012 (UTC)
Data from the Wikipedias in Cyrillic received allright, thank you very much, EpochFail -- C.Koltzenburg (talk) 18:01, 11 December 2012 (UTC)