Research talk:Page view

From Meta, a Wikimedia project coordination wiki
Jump to: navigation, search

Dialect Specific Directories[edit]

As per T92020:

One of the big improvements of the new definition over the old one is that the old one is not limited to /wiki/. It includes all of the chinese and serbian dialects that have their own folder names and were not appearing, as a result, in the old pageview counts.

James F (thanks James!) pointed out that there are other wikis that do this - see the list at https://meta.wikimedia.org/wiki/Wikipedias_in_multiple_writing_systems#With_Automatic_Conversion_System.

New pageview definition covers every language presented in the previous link. This has been checked using the language tab on country specific wikipedia website (third tab on the top the page, with a dropdown list).

Note: The definition also include zh.wikipedia.org/zh-hans and zh.wikipedia.org/zh-hant event if those two languages are not present in the chinese wikipedia website language tab. There was no hit on those folders on the week 2015-03-[16-22]

Artifical increase in statistics of Romanian Wikipedia[edit]

Here one can find that Romanian Wikipedia added +53% of its popularity (number of page views) last month.

But here one can find most visited pages on Romanian Wikipedia this month

  1. 1 (183 576 views)
  2. 2 (183 470 views)
  3. 6 (183 463 views)
  4. 5 (183 462 views)
  5. 8 (183 456 views)
  6. 4 (183 453 views)
  7. 7 (183 449 views)
  8. 3 (183 446 views)
  9. 9 (183 435 views)
  10. Zero (dezambiguizare) (183 432 views)

Strange. Is not it?

How to fight the artifical increase in the most important Wikipedia statistics? --Perohanych (talk) 04:22, 10 April 2015 (UTC)

Massviews – how to request by API using a pagepile list[edit]

API/REST_v1 offers some ways to get Pageviews data. What about the following features using API calls:

  1. I miss to call massviews defined by a PagePile list.
  2. A wmflabs call of massview offers a simple list day by day that can be downloaded as csv. I'ld prefer to get the data in csv format, e.g. by application/csv or text or something else. (I want to analyze the data inside an own application and want to avoid to store the data manually to a csv file and then load by app.)
  3. What's the best way to get the monthly or yearly summarized views? Of course, I can read the csv file and add 365 values of a year. On the other side, the database has to add 24 values of each day – it should be able to post one number as sum of 24*365 values.

I'ld be happy if someone could tell me a more direct way. Thanks in advance, Juetho (talk) 15:43, 28 December 2016 (UTC)

In the meanwhile, I found massviews URL structure. It works if I call the URL directly from the browser. But I'm not able to call it by an own .NET application using HttpWebRequest class and GetResponse method – always getting "System.Net.WebException: Der Remoteserver hat einen Fehler zurückgegeben: (403) Unzulässig." HTTP status codes says: "... the server is refusing to respond to it. The user might be logged in but does not have the necessary permissions for the resource." How can I tell the wmflabs server by my application that I'm allowed to call these data? -- Juetho (talk) 09:28, 1 January 2017 (UTC)

This question is moved in a more uptodate way to Talk:Pageviews Analysis. -- Juetho (talk) 10:05, 1 January 2017 (UTC)

Is the data comparable with Webalizer?[edit]

We try to compare page views running on our WMDE server with this wiki page views tool. Is this data comparable? Webalizer provides the following description for the metric 'Pages':

"Pages are, well, pages! Generally, any HTML document, or anything that generates an HTML document, would be considered a page. This does not include the other stuff that goes into a document, such as graphic images, audio clips, etc... This number represents the number of 'pages' requested only, and does not include the other 'stuff' that is in the page. What actually constitutes a 'page' can vary from server to server. The default action is to treat anything with the extension '.htm', '.html' or '.cgi' as a page. A lot of sites will probably define other extensions, such as '.phtml', '.php3' and '.pl' as pages as well. Some people consider this number as the number of 'pure' hits... I'm not sure if I totally agree with that viewpoint. Some other programs (and people :) refer to this as 'Pageviews'."

--Stefan Schneider (WMDE) (talk) 14:01, 20 July 2017 (UTC)