Wikipedia talk:WikiProject Check Wikipedia

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
 Check Wikipedia  Toolforge   List of Errors   Discussion

dewiki Last dump 2018-11-01[edit]

 Resolved Moin Moin together and Moin Bamyers99, when I open the mainpage of CheckWikipedia I can easily see, that the last full dump for dewiki is now five years old. From the last time I asked for a new, I remember difficults in the setup from this. So I wanted to asked, if it will be possible to setup a newer full dump for dewiki because of nearly 500.000 new articles in it and many changed templates. Bamyers99 what do you think, is it possible? King regards --Crazy1880 (talk) 08:52, 22 October 2023 (UTC)[reply]

@Crazy1880: I will look into running the dump scan after the November 1st regular dump scans are complete. --Bamyers99 (talk) 23:42, 22 October 2023 (UTC)[reply]
Thanks a lot, it will help ;) Regards --Crazy1880 (talk) 17:03, 23 October 2023 (UTC)[reply]
Moin Bamyers99, do you have a status for me? Regards --Crazy1880 (talk) 06:07, 8 November 2023 (UTC)[reply]
@Crazy1880: Based on last months dump scans, it takes until the 12th for the regular dump scans to finish. I will start the dewiki dump scan when those are finished. --Bamyers99 (talk) 14:09, 8 November 2023 (UTC)[reply]
Moin Bamyers99, thank, got ya, it helps me to understand how it works ;) Regards --Crazy1880 (talk) 17:53, 8 November 2023 (UTC)[reply]
@Crazy1880: The dump scan is done. I modified the dump scan so that it added the errors it found to the daily scans for rechecking. That way any dump scan found errors that had already been fixed would not be reported again. --Bamyers99 (talk) 20:00, 15 November 2023 (UTC)[reply]
Moin Bamyers99, really thanks a lot for the update, it will help, i saw that the last days already. ;) King regards --Crazy1880 (talk) 05:53, 16 November 2023 (UTC)[reply]

error #43 on frwiki[edit]

Hello
For some days we have on frwiki some errors #43, all related to {clade} model. (today's list).
When looking, I don't see anything wrong. Could you tell me what I'm missing ?
Thanks. Croquemort Nestor (talk) 09:38, 15 November 2023 (UTC)[reply]

Is it related to checkwiki.pl - limit number template end not found errors ? --NicoV (Talk on frwiki) 09:43, 15 November 2023 (UTC)[reply]
@Croquemort Nestor: I had limited the nested template check to 10 templates deep to reduce the processing for articles with many unterminated templates (caused by a malfunctioning bot). I have increased the limit to 25 nested templates deep. --Bamyers99 (talk) 19:55, 15 November 2023 (UTC)[reply]
Good morning. No change : we still have 15 errors #43 related to {clade} today. Croquemort Nestor (talk) 06:34, 16 November 2023 (UTC)[reply]
Hi Bamyers99. Maybe, it shouldn't report an error if there are more templates than the limit ? --NicoV (Talk on frwiki) 07:01, 16 November 2023 (UTC)[reply]
Fixed. Thanks. Croquemort Nestor (talk) 06:08, 17 November 2023 (UTC)[reply]

#11[edit]

For error id #11, HTML entities, it'd be neat if the list was on all Wikis (SKWiki lacks it, for example), together with the HTML entities. I've never seen some of these, I don't know what they are and what to look for in the code, so a full list of both HTML/Unicode would be quite nice. KormiSK (talk) 16:28, 19 November 2023 (UTC)[reply]

@KormiSK: The descriptions are configured on the translation page. Search for error_011_desc_skwiki=. Feel free to change the description. --Bamyers99 (talk) 23:24, 19 November 2023 (UTC)[reply]

#41[edit]

Errors such as #41, HTML text style element <big> (description on SKWiki only partial), do not say what the new solution should be. This error links to WP:HTML5#Big, which does not exist, so no alternative is actually given. This tag is also normally used by both Source and Visual editors, so I'm not sure what the alternate to this tag even is. KormiSK (talk) 16:28, 19 November 2023 (UTC)[reply]

@KormiSK: Example: <span style="font-size: 150%;">...</span> --Bamyers99 (talk) 23:34, 19 November 2023 (UTC)[reply]

#48[edit]

Errors id #48, Title linked in text, shows previews of pages with this error, but capitalizes the first letter in the shown example. That shouldn't be the case; the error recognizes things like [[title]], but shows them as [[Title]], which is misleading especially when trying to do it with PyWikiBot, for example. Might just learn how to use AWB instead, but it'd be neat in general if the preview matched the actual source text (in all errors, not just this one). KormiSK (talk) 16:28, 19 November 2023 (UTC)[reply]

@KormiSK: Fixed. The next dump scan will show the correct lettercase. --Bamyers99 (talk) 23:54, 19 November 2023 (UTC)[reply]
KormiSK, my bot is approved to run on those. I can fix them if you want. — Qwerfjkltalk 18:38, 10 December 2023 (UTC)[reply]
@Qwerfjkl: If your bot is globally approved, then yeah sure, why not. Although I've mostly fixed it on SKWiki myself recently, but you can definitely do that and ideally regularly as necessary. It'd be appreciated. :) KormiSK (talk) 21:30, 10 December 2023 (UTC)[reply]
KormiSK, sorry, it's only approved on enwiki. You could run the code if you want, it's not that hard to do. — Qwerfjkltalk 21:37, 10 December 2023 (UTC)[reply]

Time out[edit]

Hello ! There is a "Webservice request timed out" message since this morning - french time. Croquemort Nestor (talk) 08:15, 27 November 2023 (UTC)[reply]

Moin Moin Bamyers99, in the german too. Regards --Crazy1880 (talk) 09:29, 27 November 2023 (UTC)[reply]
@Croquemort Nestor, Crazy1880, and Msz2001: It has been restarted and is back up. --Bamyers99 (talk) 14:05, 27 November 2023 (UTC)[reply]
Thanks ! Croquemort Nestor (talk) 14:09, 27 November 2023 (UTC)[reply]
Thanks a lot ;) Regards --Crazy1880 (talk) 16:46, 27 November 2023 (UTC)[reply]

Removal of whitespace between lines[edit]

@Rich Smith: removed style="margin-bottom:1ex;" from br tags on http://en.wikipedia.org/w/index.php?title=The_Four_Burglars&diff=prev&oldid=1187696117 with the comment "v2.05 - Fix errors for CW project (Tag with incorrect syntax)".

Is this in your project guidelines? If so, that makes blocks of text harder to read; using normal paragraphs, on the other hand, results in too much whitespace. Can this guideline be changed?

Thanks,
cmɢʟeeτaʟκ 11:30, 2 December 2023 (UTC)[reply]

@Cmglee: Could use a List - RichT|C|E-Mail 20:12, 2 December 2023 (UTC)[reply]
A list indents text too much, unnecessarily lengthening the caption. Please compare the following. cmɢʟeeτaʟκ 01:01, 3 December 2023 (UTC)[reply]
No whitespace With margin-bottom Using list
Explanation of the Four Burglars card trick with a deck of 12 cards (solid cards are face-up; hatched cards are of the hatch colour, face-down)
1. The four jacks (blue) are revealed, one having three dummy cards hidden beneath (green).
2. The jacks and dummy cards are gathered into a pile.
3. The pile is turned over and placed atop the deck.
4. The top card is placed at the bottom, the second about a third up, the third about two-thirds up and the fourth on top.
5. The top four cards are revealed to be jacks.
Explanation of the Four Burglars card trick with a deck of 12 cards (solid cards are face-up; hatched cards are of the hatch colour, face-down)
1. The four jacks (blue) are revealed, one having three dummy cards hidden beneath (green).
2. The jacks and dummy cards are gathered into a pile.
3. The pile is turned over and placed atop the deck.
4. The top card is placed at the bottom, the second about a third up, the third about two-thirds up and the fourth on top.
5. The top four cards are revealed to be jacks.
Explanation of the Four Burglars card trick with a deck of 12 cards (solid cards are face-up; hatched cards are of the hatch colour, face-down)
  1. The four jacks (blue) are revealed, one having three dummy cards hidden beneath (green).
  2. The jacks and dummy cards are gathered into a pile.
  3. The pile is turned over and placed atop the deck.
  4. The top card is placed at the bottom, the second about a third up, the third about two-thirds up and the fourth on top.
  5. The top four cards are revealed to be jacks.

Is there a way to see the source code behind the detection rules?[edit]

Is there a way to see the source code behind the detection rules? --Pawngpawng (talk) 21:41, 7 December 2023 (UTC)[reply]

@Pawngpawng: checkwiki.pl (GitHub) --Bamyers99 (talk) 21:57, 7 December 2023 (UTC)[reply]

False positives for error #59?[edit]

Reposted with minor modifications from Wikipedia:Bots/Requests for approval/VulpesBot 3

It appears that the task that identifies error #59, br tag at the end of template parameter, has some false positives. Having never encountered this error flag before, I went to the Checkwiki page for error 59 and clicked on a page at random, 2015–16 Ulster Rugby season. That page has an error flagged as |lineup1 = '''Ulster lineup''':<br />, but the actual code on the page, which looks fine to me, is:

|lineup1 = '''Ulster lineup''':<br />
1. Callum Black, 2. Rory Best (c), 3. Wiehahn Herbst,<br />
4. Dan Tuohy, 5. Franco van der Merwe,<br />
6. Iain Henderson, 7. Chris Henry, 8. Nick Williams,<br />
9. Ruan Pienaar, 10. Paddy Jackson,<br />
11. Craig Gilroy, 12. Stuart McCloskey, 13. Darren Cave, 14. Andrew Trimble,<br />
15. Louis Ludik.<br />
Replacements:<br />
16. Rob Herring (for Van der Merwe 66'), 17. Kyle McCall (for Black 58'), 18. Ricky Lutton (for Herbst 70'),<br />
19. Robbie Diack (for Williams 66'), 20. Roger Wilson (for Henry 49'),<br />
21. Paul Marshall (for Pienaar 75'), 22. Ian Humphreys (for Jackson 70'), 23. Peter Nelson (for Ludik 73').
}}

The above appears to be valid, and renders fine. Removing the first br tag would change the rendered output in an undesirable way.

I then clicked on 2019 Women's PSA World Tour Finals, which has a br tag after "Qualification" that does not render unwanted whitespace. If the bot removed the br in that template, it would be a cosmetic edit, which is generally frowned upon.

Given that I was 0-for-2 in choosing articles listed in the report that demonstrated the usefulness of removing the identified br tag, and 1-for-2 in finding a removal that would make the page worse, should this flag be reevaluated and limited to cases in which it is actually a problem? – Jonesey95 (talk) 14:39, 31 December 2023 (UTC)[reply]

@Jonesey95: The 2015–16 Ulster Rugby season error #59 is currently showing lineup1 =...McCloskey''.<br /> in the Notice column. A live check of the article is showing the same thing. The extraneous br is at the end of the template value in a different template than the one shown above. It is being correctly reported. --Bamyers99 (talk) 16:37, 31 December 2023 (UTC)[reply]
I do not understand. Which "different template" has an extraneous br tag at the end of its value? I search through the whole article and did not find one. Also, how can lineup1 =...McCloskey''.<br /> in the Notice column be construed as "correctly reported" when removing that br tag would cause a problem be a cosmetic edit? How is it being "correctly reported" when the br tag shown in the report is not at the end of a template parameter value? Please help me understand. (Edited to add: I found it by searching again, but removal of this tag would be a cosmetic edit, I believe.)– Jonesey95 (talk) 18:06, 31 December 2023 (UTC)[reply]
Update: I clicked on ten more random articles in the list, and I found zero problems. Removal of the trailing br, in each case, would be a cosmetic edit. Are there any actual cases in which this trailing br is a problem? – Jonesey95 (talk) 18:16, 31 December 2023 (UTC)[reply]
I did not write the bot or choose the errata that it reports. I only maintain the bot. CW Error #59 says it is not cosmetic, is a Syntax error and Causes whitespace errors. --Bamyers99 (talk) 18:35, 31 December 2023 (UTC)[reply]
Should I change the documentation then? I see no evidence that it is not cosmetic. If there is anyone else who should be pinged and might be able to explain why it is marked as "not cosmetic", please let me know. Thanks. – Jonesey95 (talk) 19:01, 31 December 2023 (UTC)[reply]
If there's any update on this please ping me as well. Dr vulpes (Talk) 20:17, 31 December 2023 (UTC)[reply]
@Jonesey95 and @Bamyers99 I looked into this and updated my bot request over at Wikipedia:Bots/Requests for approval/VulpesBot 3. Dr vulpes (Talk) 21:06, 31 December 2023 (UTC)[reply]
NicoV, do you have any information about the history of error #59? I have yet to find a case in which it causes a problem. – Jonesey95 (talk) 22:28, 31 December 2023 (UTC)[reply]
Hi @Jonesey95. No real information about the history of this error. When I joined the project, there were already a lot of errors that were coded, probably by the original developer (Stefan Kühn). I believe the intention is that the template should handle the line break if it is needed, not added by each usage of the template.
I looked at 2015–16 Ulster Rugby season and I don't understand why it is flagged : the line break is not at the end of the template parameter, but in the middle of it (as expected, WPCleaner doesn't report any problem, only if you had the line break at the end of the last line). Was there a change of code at some point that made this check reports too many cases ? @Bamyers99, any idea ? The code should only report line breaks when they are the end of the template parameter value. NicoV (Talk on frwiki) 11:03, 3 January 2024 (UTC)[reply]
I have marked #59 as "cosmetic" until there is confirmation that it causes any problems. So far, editors here have looked at about 40 cases and have found none in which extra white space is created. – Jonesey95 (talk) 17:48, 3 January 2024 (UTC)[reply]

#6 with AWB[edit]

I'd really like to tackle task 6 using AWB but am unsure how. It doesn't fall under the genfixes. Is there a regex floating around? How would I do it? —Panamitsu (talk) 10:55, 3 January 2024 (UTC)[reply]

The guidelines should help you figure out appropriate replacements for some of the errors. It looks like a few hundred of them have underscores instead of spaces. Dashes should be replaced with hyphens (in, for example, "DEFAULTSORT:Cross for the War of Independence 1821–29"). The "×" character should be replaced with "x" (in, for example, "DEFAULTSORT:FIS Nordic World Ski Championships 2023 - Team large hill/4 × 5 km"). Those should be a trivial regex replacements. Once you have done those, the refreshed report should be more interesting. When I tested a sample of 200 pages, 156 of them had either _ or – characters. Fixing 3/4 of the report with three regexes would be great progress. – Jonesey95 (talk) 18:14, 3 January 2024 (UTC)[reply]

Excluding categories with {{DISPLAYTITLE:…}}[edit]

Please exclude categories containing {{DISPLAYTITLE:…}}, such as de:Kategorie:vPvB-Stoff, from this task.. The readability of the source is better if the common term (starting with a lowercase letter) is used. Leyo 09:22, 1 June 2023 (UTC)[reply]

The same would hold from for categories transcluding Template:Lowercase title, such as Category:pH indicators. --Leyo 14:40, 11 January 2024 (UTC)[reply]
@Leyo: I don't see the point in checkwiki reporting this issue. Other projects that use Wikipedia dumps should be coded to handle a lowercase category first letter. You can turn the reporting of this off by changing error_018_prio_dewiki=3 to error_018_prio_dewiki=0 in Wikipedia:WikiProjekt Syntaxkorrektur/Übersetzung. --Bamyers99 (talk) 15:40, 15 January 2024 (UTC)[reply]
Well, it's not about turning off the lowercase category first letter check completely. And as shown, there are also cases in en.wikipedia. --Leyo 22:15, 15 January 2024 (UTC)[reply]
@Leyo: #18 is turned off for enwiki. You can add the articles to the Error 018 whitelist. That will stop #18 from being reported for them. --Bamyers99 (talk) 22:36, 15 January 2024 (UTC)[reply]
Thank you. However, it would be more useful to add categories (e.g. de:Kategorie:iOS-Spiel) to the whitelist instead of all affected articles. --Leyo 09:19, 19 January 2024 (UTC)[reply]
@Leyo: The categories can now be added to error_018_templates_dewiki= in Wikipedia:WikiProjekt Syntaxkorrektur/Übersetzung. Even though it is named templates, that is just a standard parameter to get variable data into Checkwiki. They are used as excluded categories in Checkwiki for #18. END needs to be after the last category. You can add additional categories. I could not find an article to test on, so it is untested. --Bamyers99 (talk) 18:28, 19 January 2024 (UTC)[reply]
Thanks a lot for your implementation and the addition of categories to de:Wikipedia:WikiProjekt Syntaxkorrektur/Übersetzung. I have modified de:Die Sims (Computerspiel) so that it may be used as a test case. --Leyo 22:02, 20 January 2024 (UTC)[reply]

no new dump[edit]

Hello ! There are no fresh errors diplayed for two days on fr-wiki, and a strange date for last dump ("2022-05-20 (613 days old")
Thanks for your help. Croquemort Nestor (talk) 05:20, 23 January 2024 (UTC)[reply]

@Croquemort Nestor: The daily scan only scanned a few articles for an unknown reason. I have restarted the recent changes scanner. The dumps do not get processed for daily scanned projects anymore. The dumps for the large wikis take to long to process. I have implemented a program to process the large wiki dumps on request. I have started a dump scan for frwiki. --Bamyers99 (talk) 14:39, 23 January 2024 (UTC)[reply]
Thanks ! Croquemort Nestor (talk) 16:07, 23 January 2024 (UTC)[reply]
Good morning. I'm not sure the script ran fully properly. After two days of inactivity there should have been many errors reported. But it is not the case. For example error # 10 is 10-15 per day and today there are only 5, which looks strange.
Also the table list of errors has not been updated with correct quantities.
Best regards. Croquemort Nestor (talk) 06:24, 24 January 2024 (UTC)[reply]
@Croquemort Nestor: The missed days will not get re-scanned. See the next section for an explanation of the process. I will schedule another dump scan after February 1st. --Bamyers99 (talk) 14:01, 24 January 2024 (UTC)[reply]

Daily scan dewiki[edit]

Moin Moin Bamyers99, the daily scan didn't run yesterday and today, could you have a look at it and run it again? Big thanks --Crazy1880 (talk) 05:46, 23 January 2024 (UTC)[reply]

@Crazy1880: The daily scan only scanned a few articles for an unknown reason. I have restarted the recent changes scanner. --Bamyers99 (talk) 14:39, 23 January 2024 (UTC)[reply]
Moin Bamyers99, I don't know how long the scan will run, but until now, there is no change!? Regards --Crazy1880 (talk) 18:48, 23 January 2024 (UTC)[reply]
@Crazy1880: The process is as follows: 1) Every 6 minutes the recent changes scanner looks at the last 600 changes for dewiki and saves the article titles. 2) Once a day, the Checkwiki processor scans the saved articles for errors. For some unknown reason the recent changes scanner was not working correctly the past 2 days. Article changes were not saved and therefore were not checked by Checkwiki. I can schedule a dump scan after the February 1st dump scans are done, to catch the missed errors. --Bamyers99 (talk) 19:29, 23 January 2024 (UTC)[reply]
Moin Bamyers99, thanks for the explanings, that help to understand it. The scan is running now. Your offer for the dump scan is good, but didn't it took a long time? Regards --Crazy1880 (talk) 10:28, 28 January 2024 (UTC)[reply]

Possible new CheckWiki error: URL with invalid character in domain name[edit]

I haven't investigated in depth, but T350190 suggests a Linter check for invalid characters in domain names. It would probably be better to use CheckWiki to monitor this error. – Jonesey95 (talk) 17:04, 17 March 2024 (UTC)[reply]

@Jonesey95: CheckWiki is in maintenance mode, ie. bug fixes, keeping it running. There are no maintainers adding new rules. I can't speak for WPCleaner which is maintained by NicoV (talk · contribs). --Bamyers99 (talk) 20:02, 17 March 2024 (UTC)[reply]
I can always add such a rule in WPCleaner, independently of it being added to CheckWiki or Linter. If I understand correctly, it's just looking for some unauthorized characters in the part of the URL that's between http:// (or equivalent) and the following / : is it just that ? Linter has the advantage of working on the rendered page, so it's better at detecting problems caused through templates for example. NicoV (Talk on frwiki) 21:02, 17 March 2024 (UTC)[reply]
There is a suggested query at https://quarry.wmcloud.org/query/77756 linked from the Phab ticket. I don't know if it is valid. – Jonesey95 (talk) 17:41, 18 March 2024 (UTC)[reply]

Link points to French wikipedia[edit]

The link to WPCleaner in this line:

Data for the list is compiled from the following locations: WPCleaner • AutoWikiBrowser • Auto-Formatter.

points to the French wikipedia. Is that intentional? 76.14.122.5 (talk) 04:13, 18 March 2024 (UTC)[reply]

Probably not, but which page are you talking about ? NicoV (Talk on frwiki) 07:50, 18 March 2024 (UTC)[reply]
https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Check_Wikipedia/List_of_errors 76.14.122.5 (talk) 20:02, 18 March 2024 (UTC)[reply]
Ah yes, I never took the time to translate into English the list of errors... NicoV (Talk on frwiki) 06:20, 19 March 2024 (UTC)[reply]