Talk:Pageviews Analysis/Archives/2024/1

From Meta, a Wikimedia project coordination wiki

Topviews 2023 für german-language Wikipedia

Hello all and a very happy new year! As I wanted to write a blog article about the top 10 german-language Wikipedia articles for 2023, I was hoping to get the numbers out of topview – but apparently the monthly numbers haven't been compiled and analysed for 2023, yet. Does anyone know if and when this task will be undertaken? I was thinking about adding the numbers manually, but worry about making a mistake. And then there is the issue of false positives, so I was hoping to get the numbers for 2023 from the topview-tool. I am greatful for any information. Kind regards! Franziska Kelch (WMDE) (talk) 11:06, 3 January 2024 (UTC)

@Franziska Kelch (WMDE): Statistics from the logs are here: https://archive.org/details/2023-top_2k_user_pageviews For all local Wikipedias. They are unfiltered, unedited.
https://pageviews.wmcloud.org/ has filtered data or something, and those results will be sometime in the first 2 weeks of the year. Dušan Kreheľ (talk) 12:06, 3 January 2024 (UTC)
@Dušan Kreheľ Thank you so very much! It is most appreciated! One more question...Am I right to assume that the numbers for Pornhub and the Suits-Index article are false positives? Thank you again, Franziska Franziska Kelch (WMDE) (talk) 12:12, 3 January 2024 (UTC)
@Franziska Kelch (WMDE): Sorry, I won't answer that. As an amateur, I dealt with the processing of large data for pageviews. Dušan Kreheľ (talk) 12:35, 3 January 2024 (UTC)
I see your point. Thanks again. Franziska Kelch (WMDE) (talk) 13:18, 3 January 2024 (UTC)
@Franziska Kelch (WMDE) die Liste im Kurier enthält einige klare false positives:
  • Pornhub war z.B. bereits im Ranking 2022 ausgenommen [1]
  • Die Abrufzahlen von Suits-Index sehen auch manipuliert aus (jahrelang quasi 0, dann ohne Grund massiv nach oben geschossen) [2]
  • Bei Kleopatra VII. sind die Aufzuzahlen zwar nicht manipuliert, aber die meisten Aufrufe wohl nur Testanfragen [3][4]
  • TV Mainfranken hat anscheinend vor zwei Jahren begonnen, die Abzuzahlen zu manipulieren [5], die vorher jahrelang quasi 0 waren
  • ZDF & ARD sind auch unnatürlich hoch, hier aber wie bei Kleopatra nicht aufgrund von Manipulation, sondern weil deren Wikipedia-Artikel häufig durch YouTube verlinkt werden [6]
Realistisch dürfte im Vergleich mit den Listen der letzten Jahre im Artikelnamensraum folgende Top10 für 2023 sein:
  1. Nekrolog 2023
  2. Deutschland
  3. Robert Oppenheimer
  4. ChatGPT
  5. Oppenheimer
  6. Gazastreifen
  7. Israel
  8. Chronologie des russischen Überfalls auf die Ukraine
  9. Periodensystem
  10. Ricarda Lang
Johannnes89 (talk) 18:19, 3 January 2024 (UTC)
Das halte ich nicht alles für klar. Ailura (talk) 20:33, 3 January 2024 (UTC)
I think we need to wait for the tool to update in the coming days to filter out false positives before taking this manual list for granted :) Xia (talk) 11:55, 4 January 2024 (UTC)
@Franziska Kelch (WMDE) It is now published. Apologies for the delay. I don't think myself or anyone at WMF has done much filtering for German Wikipedia, specifically, but I did hide Pornhub and Suits-Index as obvious false positives. All the best and happy new year, MusikAnimal (WMF) (talk) 03:24, 5 January 2024 (UTC)

Why was Pornhub eliminated?--Ailura (talk) 17:30, 11 January 2024 (UTC)

It is not genuine traffic. See https://pageviews.wmcloud.org/topviews/faq/#false_positive for more information. MusikAnimal (WMF) (talk) 22:18, 12 January 2024 (UTC)

Yearly Topviews Analysis has some bug

This page and this page has error. Please inspect. Thanks, Hooman Mallahzadeh (talk) 13:34, 4 January 2024 (UTC)

@Hooman Mallahzadeh: This is not a mistake in the true sense of the word. Data is not accessible via API. The data is not processed or what. The data will be accessible during the first two weeks of the new year. Dušan Kreheľ (talk) 17:05, 4 January 2024 (UTC)
@Hooman Mallahzadeh It is now published. Apologies for the delay, MusikAnimal (WMF) (talk) 03:24, 5 January 2024 (UTC)

Mediaviews for a whole category

Tracked in Phabricator:
Task T245698

Is it possible to show mediaviews for a full commons category? Kristbaum (talk) 22:17, 28 January 2024 (UTC)

This is phab:T245698. This is all completely doable, just will require a lot of work. I hope to get a start on it at the Hackathon coming up this May. MusikAnimal (WMF) (talk) 06:52, 31 January 2024 (UTC)

Massviews by "search" namespace filter

The namespace filter is (to my knowledge) not exposed to mw:Help:CirrusSearch syntax, and also not shown in the Massviews tool interface. By inspecting network requests, I'm happy to see that adds namespace=0 as a condition. However, this is not said in the interface, and, on the results page, the title links to Special:Search without that filter, thus giving the impression to people that the total count is for all namespaces.

On the results page, it links to [7] which matches 12,000 pages, whereas the Massviews tool (correctly) considered the 6,000 main space articles only.

My suggestion would be to fix the title link as first step. Perhaps as a feature request it'd be nice to be able to control it, though personally it's working fine as-is (main space is what I use it for). Thanks for this awesome tool! Krinkle (talk) 13:28, 11 January 2024 (UTC)

Acknowledging this, and commenting so this thread doesn't get archived :)
I plan to get this in as part of a series of updates to come over the next few months. MusikAnimal (WMF) (talk) 16:19, 14 February 2024 (UTC)

How to use massviews to search for all English Wikipedia pages which use references from a source?

Hi all

This is a question rather than a bug report but this seems like a sensible place to ask it. I want to find all the pages on English Wikipedia (and later from other Wikipedias) which use references from a specific source, starting with FAO.

I've managed to work out with help that I can use this query for all articles on English Wikipedia which includes fao.org in the wikitext.

However I've realised this will miss out any references sources from FAO which don't include the URL, e.g if the reference was generated from an ISBN. I think this can be captured if it would be possible to add to the previous search to look for either fao.org or publisher=FAO on the page.

Can someone tell me how to add this to the existing query?

Thanks very much

John Cummings (talk) 08:04, 30 January 2024 (UTC)

Hey MusikAnimal (WMF), Kaldari, Mforns (WMF) do you have any ideas for this? I basically want to work out a way of querying for the combined results of insource:"fao.org" and insource:"publisher=FAO" where its showing the results for if either is present, not for if both are present. Thanks :) John Cummings (talk) 10:13, 1 February 2024 (UTC)
@John Cummings: Send u me over email the page titles (duplicate names, i will remove) and the date interval. Dušan Kreheľ (talk) 16:23, 6 February 2024 (UTC)
Hi, thanks very much for the offer, but I already know how to do that. I need to combine the results into one query in this tool specifically because I need it to be run by people with lower technical skills who can just click one button and get the answer. John Cummings (talk) 00:15, 7 February 2024 (UTC)
Hi @John Cummings! Thanks for reaching out! It seems the Massviews utility uses CirrusSearch to find the pages, and this engine provides limited regular expression capabilities, see: https://www.mediawiki.org/wiki/Help:CirrusSearch#Insource. I've tried to use the query insource:/(fao.org|publisher=FAO)/, which should return all pages containing fao.org or publisher=FAO. It seemed to work, however I don't know if the results are what you expect! Mforns (WMF) (talk) 16:33, 8 February 2024 (UTC)
Hi Mforns (WMF) thanks so much :) Yes this is the same number I got from doing it a very long way round. I'll write this up in the documentation, if you have any suggestions of where to put it please let me know. One issue I have is that because of the special characters you can't acually link to this query in a link, (link) it breaks in both a link on Wiki and also in things like Whastapp, even just copying and pasting the links doesn't work. I think Massviews is making some kind of URLs that are broken in some way, I started a Phabricator ticket about it here. If you have any ideas of how to avoid this I'd greatly appreaciate it. Thanks again, John Cummings (talk) 00:51, 9 February 2024 (UTC)
Update, aparently there was a aspecial character in there causing problems, aparently using insource:fao insource:/(fao.org|publisher=FAO)/ will work and not make the URL break. https://pageviews.wmcloud.org/massviews/?platform=all-access&agent=user&source=search&range=latest-20&project=en.wikipedia.org&sort=views&direction=1&view=list&target=insource%3Afao+insource%3A%2F%28fao.org%7Cpublisher%3DFAO%29%2F.. Thanks, John Cummings (talk) 06:10, 9 February 2024 (UTC)

Number of observers

The number of observers, if less than 30, is not displayed. Xedin (?!) 13:54, 12 February 2024 (UTC)

If there are fewer than 30 page watchers, the exact number is hidden (except for admins). This is something imposed by MediaWiki, not Pageviews Analysis. MusikAnimal (WMF) (talk) 17:43, 13 February 2024 (UTC)

February 17th and 18th down to zero on Wiki.pt

Hey guys. The last two viewing days (February 17th and 18th) are not indicated on the Portuguese Wikipedia. All the articles I consult appear at zero. See here, eg. Regards, Sturm (talk) 03:11, 19 February 2024 (UTC)

It's missing in English too, seems a generalized problem. Igordebraga (talk) 03:27, 19 February 2024 (UTC)
Also in Hebrew Wikiedia. היידן (talk) 04:10, 19 February 2024 (UTC)
I've filled phab:T357910 Framawiki (talk) 15:23, 19 February 2024 (UTC)
@Framawiki: Thanks for the ticket, now marked as resolved, but... Its not working again, 21st February. Sturm (talk) 03:03, 21 February 2024 (UTC)
@Framawiki: Reported: T358132. Dušan Kreheľ (talk) 18:04, 21 February 2024 (UTC)
@Framawiki: I see, it's fixed. Dušan Kreheľ (talk) 09:30, 22 February 2024 (UTC)

Article WP (fr) J. Minot (created February, 17) : Pageviews doesn't work

La requête Statistiques de consultation (Analyse des pages vues) retourne le message d'erreur suivant : J. Minot: Erreur lors de la requête Pageviews API - Not Found Boncoincoin (talk) 08:16, 19 February 2024 (UTC)Boncoincoin

See message just above :) i've filled phab:T357910. Framawiki (talk) 15:24, 19 February 2024 (UTC)
It's OK now. Many thanks. Boncoincoin (talk) 16:22, 19 February 2024 (UTC)Boncoincoin