Jump to content

User talk:InternetArchiveBot/Archives/2023

From Meta, a Wikimedia project coordination wiki

Not a dead link

In this edit the bot states that it’s a dead link see here, but that is not the case. So somewhere an error? SportsOlympic (talk) 23:13, 1 January 2023 (UTC)

Same problem here. This link is not dead, but IAB tags it as such: [1]. --Edelseider (talk) 07:19, 3 January 2023 (UTC)
@SportsOlympic and Edelseider:
If you view the user page, you will see that you are given some ability to do things ... https://iabot.toolforge.org/ There is a MANAGE URL and also REPORT A PROBLEM components.  — billinghurst sDrewth 00:53, 4 January 2023 (UTC)
I have tested and reported SportsOlympic's as LIVE. @Edelseider: For your link, I get This site can’t be reached. inventaire-strasbourg.grandest.fr took too long to respond.  — billinghurst sDrewth 01:01, 4 January 2023 (UTC)
@Billinghurst: thanks for your reply. My link opens instantly whenever I click on it. How could that be? --Edelseider (talk) 07:05, 4 January 2023 (UTC)
@Edelseider: Talk to your provider about the routing, etc. Run some routing checks from around the place. You are too close to yourself, and can be on the good side of firewalls, server side issues, and the like. It is happening for Wikimedia to you, and from me to you, so that is two completely different places in two different parts of the world, so it is unlikely the broader internet and something close to your site.  — billinghurst sDrewth 08:58, 4 January 2023 (UTC)
@Billinghurst: thanks again for your reply. To prevent any further misunderstanding: when I wrote "my link", I did not mean "the link to my own website", but simply "the link that I am referring to" - I am completely foreign to https://inventaire-strasbourg.grandest.fr/gertrude-diffusion/. I suppose that this routing issue will resolve itself over time. Best wishes, --Edelseider (talk) 09:14, 4 January 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:14, 18 January 2023 (UTC)

HTTPS

Hi! Could it be possible to modify the bot so if original source cite is http://example.com and it is now https://example.com and only the https form is in internet archive so the bot chacks also the https form? I found one article where bot had marked source dead but I found it on the archive by just replacing http to https. --HenriHa (talk) 08:05, 6 January 2023 (UTC)

HenriHa, while that is a good idea to do where it is possible, to have the bot do this, in every instance where it could, would result in the bot making about double the requests to the Wayback Machine. So unfortunately it would not scale well. Harej (talk) 22:07, 11 January 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:14, 18 January 2023 (UTC)

about.com usurped, now blacklisted



This domain is now just being used for marketing spam through redirects, and every sub-domain now redirects to unreliable data source and no consideration for anything trustworthy. I have blacklisted the domain as it is clearly all new links are spam. We need to manage existing links. Please mark it as usurped. Thanks.  — billinghurst sDrewth 07:50, 7 January 2023 (UTC)

billinghurst, we have marked about.com and its approximately 1,000 subdomains as permadead. It may take a while for the bot to go through and fix all of the URLs. Harej (talk) 22:21, 11 January 2023 (UTC)
@Harej: Yep <sad face>. Thanks to the team of people here for fixing. As mentioned at enWP, I have suspended the blacklisting to allow for clean up, and just need to be pinged to reimpose that blacklisting. I am guessing that there will be some new listings arriving during that time, but ... meh!  — billinghurst sDrewth 22:28, 11 January 2023 (UTC)
The links should be usurped with steps at WP:USURPURL. I'm doing it on Enwiki with WaybackMedic (which is most of them), the other language editions will have to be via IABot as normal dead links, no usurpation. -- GreenC (talk) 22:55, 11 January 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:15, 18 January 2023 (UTC)

fi-wiki: Selkäkipu, source removed

Bot romeved source from the article Selkäkipu. Is this a mistake? [2] --HenriHa (talk) 00:40, 10 January 2023 (UTC)

Hi HenriHa. I'm not managing IABot, but I can tell that it's not a mistake. That source (named ":85") is being used 3 times in the article, instead of writing the entire "{{Lehtiviite|Tekijä=Michael A. Adams, Manos Stefanakis, Patricia Dolan|Otsikko=Healing of a painful intervertebral disc should not be confused with reversing disc degeneration: Implications for physical therapies for discogenic back pain|Julkaisu=Clinical Biomechanics|Ajankohta=2010-12|Vuosikerta=25|Numero=10|Sivut=961–971|Doi=10.1016/j.clinbiomech.2010.07.016|Issn=0268-0033|www=https://pubmed.ncbi.nlm.nih.gov/20739107/}}" three times, one is enough. A more detailed explanation can be found at en:Help:Footnotes#Footnotes: using a source more than once. ~StyyxTalk? 22:20, 11 January 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:24, 18 January 2023 (UTC)

Khmer Wikipedia no need InternetArchiveBot

Khmer Wikipedia no need InternetArchiveBot broken link unknown. 27.109.114.127 03:26, 11 January 2023 (UTC)

Can you clarify your question? I am not sure what you are asking. Harej (talk) 22:24, 11 January 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:26, 18 January 2023 (UTC)

Please stop

Your bot does make strange changes in the articles, example in the Russian Wikipedia: insert extra empty pages and extra spaces into <code>{{Публикация...}}</code>. As result the text (article code) is maked less readable. Please stop this. Thanks. P.S. Also your bot make archives for the Goolge Books site, where text of book can't be readed. — Grumbler eburg (talk) 20:13, 11 January 2023 (UTC)

Grumbler eburg, the syntax issue should now be fixed as of version 2.0.9.3. As for Google Books we have recently removed those archives as well. Harej (talk) 22:26, 11 January 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:26, 18 January 2023 (UTC)

Is there a "force snapshot" feature planned?

I don't know if this question is relevant to a possible feature of InternetArchiveBot.

There have been several instances where I find a dead link, and I find no corresponding page archived on the Wayback Machine. Or worse, the Wayback Machine archived the 404 page of the target.

I was wondering if there was a way, during IABot's scans, to have IABot cause the Wayback Machine to make a snapshot of a page that isn't dead yet, if a snapshot doesn't already exist? Anachronist (talk) 16:53, 7 January 2023 (UTC)

Anachronist, whenever an external link is posted on Wikipedia or any Wikimedia wiki, it is picked up by the Wayback Machine for archiving by an automated process. Although that process is underway, unfortunately it does not always work. Harej (talk) 22:23, 11 January 2023 (UTC)
@Harej: Thanks, it is good to know that the Wayback Machine picks up new external links. I was referring, however, to external links that already exist, and wondered if there was some sort of bot that goes through these and triggers the Wayback Machine to snapshot it if the link isn't dead, and the link has no corresponding snapshot yet. Anachronist (talk) 01:16, 12 January 2023 (UTC)
Anachronist, the bot technically has this feature, but it has been disabled since it make the bot extremely slow (and it is already pretty slow). That said, what we could do is produce a dump of external links on Wikipedia and then run that through the Wayback Machine as a one-time job to cover those links that were added before we started working off of the recent changes feed. I will look to see how feasible that is. Harej (talk) 21:24, 18 January 2023 (UTC)
@Harej: That sounds like a good idea. The only catch would be to have some way to avoid archiving links that are already 404. I see a lot of those 404 pages archived on the Wayback Machine already. Certainly links already tagged as "dead" on a Wikipedia page should not be snapshotted, but maybe the list of links could be run through a script that checks to see if the target website returns code 404 in the reply header, instead of 200 or 301. (Also the list should be sorted first to remove any duplicates.) Anachronist (talk) 00:08, 19 January 2023 (UTC)
This section was archived on a request by: Harej (talk) 19:57, 25 January 2023 (UTC)

Syntax error in march 2021

Tracked in Phabricator:
Task T327345

Hello bot authors, in ca wikipedia https://ca.wikipedia.org/w/index.php?title=Perit&diff=26767761&oldid=25406639 the bot had interrupted a nbsp syntax. Maybe the error in the bot has been fixed long time ago, only the resulting error in the article has remained so far? --Himbeerbläuling (talk) 07:53, 16 January 2023 (UTC) (The part can be found by web browser phrase search for "p {{W".) --Himbeerbläuling (talk) 07:55, 16 January 2023 (UTC)

Himbeerbläuling, thank you for your report. We are tracking the issue on Phabricator. (Good find!) Harej (talk) 21:40, 18 January 2023 (UTC)
This section was archived on a request by: Harej (talk) 19:59, 25 January 2023 (UTC)

503 Service Unavailable

Reason behind this issue? Is it based on IABOT itself? If so, will it be fixed? --► Sincerely: SolaVirum 10:48, 27 January 2023 (UTC)

@Toghrul R:, any ideas? --► Sincerely: SolaVirum 11:37, 27 January 2023 (UTC)
@Solavirum, see the existing section: § IABot throwing 503 error czar 16:49, 29 January 2023 (UTC)
This section was archived on a request by: czar 16:49, 29 January 2023 (UTC)

Hungarian Wikipedia

Hello bot authors, in Hungarian Wikipedia if you find only link between brackets (eg: hu:Halálozások 2017-ben)

instead of this form

[http://szabadpecs.hu/metal/item/1374-elhunyt-dr-stark-andras-pecsi-pszichiater-filmklubvezeto] {{Wayback|url=http://szabadpecs.hu/metal/item/1374-elhunyt-dr-stark-andras-pecsi-pszichiater-filmklubvezeto |date=20171210180602 }}

please use this one

[https://web.archive.org/web/20171210180602/http://szabadpecs.hu/metal/item/1374-elhunyt-dr-stark-andras-pecsi-pszichiater-filmklubvezeto]

your current form is not fit to the table. Csurla (talk) 12:09, 17 January 2023 (UTC)

Csurla, unfortunately the bot can't distinguish between links in tables and otherwise. So to prevent the bot from using {{Wayback}} for standalone deadlinks, we would have to turn that feature off for the entire wiki. If you are proposing that you would need to show that the community prefers this. Harej (talk) 21:49, 18 January 2023 (UTC)

This bot is also active on dewiki, but similar articles are not edited there (eg: de:Nekrolog Dezember 2019).

How is it possible? I want the same thing on huwiki. Csurla (talk) 14:52, 24 January 2023 (UTC)

Csurla, on German Wikipedia they have the bot set to only modify links in references. This operating mode can be set for Hungarian Wikipedia if the community reaches a consensus to do that. Harej (talk) 21:11, 25 January 2023 (UTC)
This section was archived on a request by: Harej (talk) 17:57, 1 February 2023 (UTC)

add new archive link for nymphalidae butterflies

would like to add a archive link to the distribution of nymphalidae butterflies, specifically the "afrotopical butterflies: nymphalidae". the archive link is https://web.archive.org/web/20160304000050/http://atbutterflies.com/downloads/nymphalidae_limenitidini.doc but i don't know how -tynjee (talk) 13:21, 21 January 2023 (UTC)

-tynjee, what article would you like to add that link to? Harej (talk) 16:18, 23 January 2023 (UTC)
@Harej: 'almost every' article about butterflies thats part of the nymphalidae family but in africa -tynjee (talk) 02:17, 24 January 2023 (UTC)
-tynjee, I added the archive link to those pages where that URL already appeared. Those pages are: Neptis serena, Neptis sextilla, Neptis strigata, Neptis swynnertoni, Neptis troundi, Neptis vindo, Neptis vingerhoedti, Neptis woodwardi. Adding the archive link to additional pages when the link didn't already appear there to begin with would be out of scope for this bot. Harej (talk) 19:56, 25 January 2023 (UTC)
This section was archived on a request by: Harej (talk) 17:57, 1 February 2023 (UTC)

All batch jobs stalled

The five currently-running batch jobs (11994, 11995, 11996, 11997, and 11998) are all currently stalled at zero progress. It appears that the most recent batch-job edit was this one nearly two weeks ago. Is this related to the error noted in the section above which cropped up yesterday? Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 17:16, 25 January 2023 (UTC)

Whoop whoop pull up, the job queue is partially down as we migrate it to a new system. It should be back at the moment, albeit at reduced capacity. Harej (talk) 21:19, 25 January 2023 (UTC)
This section was archived on a request by: Harej (talk) 17:58, 1 February 2023 (UTC)

Bot is marking sites dead that are not

https://en.wikivoyage.org/w/index.php?title=Indianapolis&diff=prev&oldid=4605715 https://megabus.com/ is not dead. —Justin (koavf)TCM 11:14, 26 January 2023 (UTC)

Koavf, I am having trouble loading that website on my home connection, but it seems to work when I use a VPN. But this differentiated experience would explain why the bot detected it as offline even though it works for you and probably others. In any event, for that specific article, https://us.megabus.com may be a more appropriate link (and one that loads without problem on my connection). Harej (talk) 19:40, 26 January 2023 (UTC)
Agreed that your proposed URI is better, and I did replace it. Thanks. —Justin (koavf)TCM 20:06, 26 January 2023 (UTC)
This section was archived on a request by: Harej (talk) 17:58, 1 February 2023 (UTC)

IABot throwing 503 error

IABot fix-links tool is throwing 503 Service Unavailable error when run on en WP articles. Cheers! WikiWikiWayne (talk) 09:28, 25 January 2023 (UTC)

WikiWikiWayne, the management interface was down earlier but should be back up. Let us know if this is not the case. Harej (talk) 21:14, 25 January 2023 (UTC)
Thanks James! Cheers! WikiWikiWayne (talk) 21:30, 25 January 2023 (UTC)
James – The IABot Management Interface is again throwing the "503 Service Unavailable" page error. It was only fixed briefly. Cheers! WikiWikiWayne (talk) 18:56, 27 January 2023 (UTC)
Getting this error now: "DB ERROR : QUERY: CREATE DATABASE IF NOT EXISTS s51059__cyberbot; ERROR - : Error encountered while creating the database. Exiting..." when I visited the website just only. Paper9oll (talk) 13:04, 28 January 2023 (UTC)
@Cyberpower678 and Harej: - IABot Management Interface now throwing new results error: "DB ERROR : QUERY: CREATE DATABASE IF NOT EXISTS s51059__cyberbot; ERROR - : Error encountered while creating the database. Exiting..." - Cheers! WikiWikiWayne (talk) 15:01, 28 January 2023 (UTC)
WikiWikiWayne, there has been an ongoing operational outage and that error will display while this is ongoing. Everything should be back online within the next several hours. Harej (talk) 21:13, 1 February 2023 (UTC)
This section was archived on a request by: —CYBERPOWER (Chat) 03:26, 2 February 2023 (UTC)

False positive on rueWIKI

Igor Kercsa (talk) 21:00, 26 January 2023 (UTC)

One year ago I stopped the bot because of this. Now it repeats and You blocked the opportunity for users to stop it. Have mercy on us.--Igor Kercsa (talk) 08:53, 28 January 2023 (UTC)

@Igor Kercsa: Sorry about that. That was a mistake on our part. Twirpx has been whitelisted. The website blocked all Wikimedia IPs from accessing it.—CYBERPOWER (Chat) 16:17, 3 February 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:57, 10 February 2023 (UTC)

Bot flooded the Wuhan article in Tagalog Wikipedia with spaces

Hi, it seems that there is a bug in your bot. Check the article tl:Wuhan. It flooded with empty spaces in the past months. I reverted it to the version without the flooding. Please fix it or turn-off the bot if this cannot be fixed immediately. Thanks. --Jojit (talk) 01:24, 27 January 2023 (UTC)

Thank you for the report, Jojit fb. It appears to be an obscure bug that happens when template parameters are separated with nonstandard space characters such as the non-breaking space, & nbsp ;. We are tracking the bug on Phabricator, but in the meantime, if you erase the whitespace characters in the template call using these characters, it should avoid this problem. See: edit removing characters; subsequent normal edit. Harej (talk) 22:17, 1 February 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:58, 10 February 2023 (UTC)

False positives

Yesterday I ran through a number of edits by InternetArchiveBot to some of the pages on my watchlist at the English Wikivoyage, trying to update the links whenever possible. On three occasions, the links marked as dead seemed to work fine, so I just removed the tags (1, 2, 3), thinking that the websites might just have been temporarily offline when the bot checked. However, in a matter of hours, the bot put the tags back (1, 2, 3), despite links working fine the last time I checked again, after the tags were restored. As far as I could notice, what these three websites had in common is that they opened with pop-ups of sorts at the time of checking, so that might have to do with these false positives.

Any possibility to fix this? Vidimian (talk) 14:26, 28 January 2023 (UTC)

@Vidimian: the first and third links are dead from where I'm sitting. The second one is caused by a geo-restriction rule on this CloudFlare enabled site. I have whitelisted atus.konya.bel.tr.—CYBERPOWER (Chat) 16:20, 3 February 2023 (UTC)
Thanks. It's interesting that I can access the first and third links alright as I'm typing this. I'm not techie enough to understand clearly, but out of your words I guess they are not accessible outside Turkey. Perhaps I should look for a workaround by substitute websites that are equally informative about the subjects on hand and compliant with the external links policy of Wikivoyage. Vidimian (talk) 00:36, 4 February 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:59, 10 February 2023 (UTC)

Please check edit

Hi there! Could you please review this edit and make any appropriate changes to the article? Thanks! GoingBatty (talk) 06:31, 1 February 2023 (UTC)

@GoingBatty: This is a case of GIGO. Can't really fix the bot for this one. But I can fix the URL.—CYBERPOWER (Chat) 16:22, 3 February 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:59, 10 February 2023 (UTC)

Bot cause issues for already added correct archive links in Wiki Farsi

Hi! The bot is harming all of my nicely created links. When I work with old information or newspapers that are from unstable sources, I usually add the archive link myself. Your bot keep following, checking my links and find a way to mess with my already working links or their archived links. Please check this link as one of many examples and let me know if I should provide with more information. https://fa.wikipedia.org/w/index.php?title=%D8%AA%DB%8C%D9%85_%D9%85%D9%84%DB%8C_%D9%88%D8%A7%D9%84%DB%8C%D8%A8%D8%A7%D9%84_%D8%B2%D9%86%D8%A7%D9%86_%D8%A7%DB%8C%D8%B1%D8%A7%D9%86&diff=35498746&oldid=35462802 Aerospc (talk) 14:44, 1 February 2023 (UTC)

@Aerospc: Archive URLs need to go to the archive-url fields on the cite templates. Their original URL counterparts need to go in the 'url' field. This is so the citation is rendered correctly. The bot is simply enforcing this. If you look at the rendered page, you will see that the original URL does not render.—CYBERPOWER (Chat) 16:27, 3 February 2023 (UTC)
This section was archived on a request by: Harej (talk) 15:00, 10 February 2023 (UTC)

503 Service Unavailable

What should we do, generally, when the bot's page throws this error? Does it need to be reset? (old thread) czar 22:37, 11 February 2023 (UTC)

Looks like the answer is to restart the management interface webservice if this happens again czar 23:06, 11 February 2023 (UTC)
This section was archived on a request by: czar 23:06, 11 February 2023 (UTC)

Interface down for maintenance or crashed?

I was running en:List of awards and nominations received by Blackpink yesterday for around 30+ mins (longer than expected which isn't normal to me. After the makeshift fixes, it would run at 10 mins max which still is long compared to good old days but at least it's running) after which I was shown a white screen (no archiving was applied to the article as seen in the article's contribs) and couldn't visit the inteface website since then, it has been down since yesterday. Is this a bug caused by the makeshift fixes applied previously or did I crashed it by running a huge article through the interface website. Paper9oll (talk) 16:05, 13 February 2023 (UTC)

As of 14 February, I'm getting this error DB ERROR : QUERY: CREATE DATABASE IF NOT EXISTS s51059__cyberbot; ERROR - : Error encountered while creating the database. Exiting... when visiting the website, this is the same error as reported previously with User talk:InternetArchiveBot/Archives/2023#IABot throwing 503 error and tracked with task T327851. Paper9oll (talk) 10:31, 14 February 2023 (UTC)
This section was archived on a request by: Paper9oll (talk) 05:53, 18 February 2023 (UTC)

Allowing GhostArchive?

Can GhostArchive be added into the code as an second alternative to the current default Wayback Machine and alternative Archive.today? GhostArchive is able to proper archives troublesome website that uses tons of javascript in their coding which both Wayback Machine and Archive.today has trouble in doing so at times and/or all of the time for certain websites such as only able to archive header and footer but body content is whitespace because it uses javascript. Paper9oll (talk) 09:37, 12 January 2023 (UTC)

Paper9oll, while InternetArchiveBot primarily queries archive.org, we do look at other archive providers as well. Pinging GreenC: would GhostArchive be a good fit for Wayback Medic? Harej (talk) 21:31, 18 January 2023 (UTC)

Ghost is good for end-users who want to archive a troublesome site that has a lot of JS. The bot can't determine what a troublesome site is, so it would be up to end-users to create the archive via the Ghost website, then add the archive URL into Wikipedia. Once in Wiki, IABot will detect it and add it to its database, so if it ever encounters that URL elsewhere it will be inserted into Wikipedia. The user who is most active adding Ghost archives is Rlink2 on enwiki. I'll add some users find Ghost controversial because it's small and run by 1 person of unknown origin. -- GreenC (talk) 20:41, 25 January 2023 (UTC)

@Harej and GreenC Apologies for the late reply as I'm waiting for the interface to be up which it finally did today, I'm actually referring to "Modify URL Data" page in which adding GhostArchive's archived URL there would be disallowed which in turn means that no changes would be made, this however doesn't happens for archive.today URL that are manually added there. The bot also doesn't actually add GhostArchive's archived URL which were already in the article into the database but instead would overwrite with useless Wayback Machine ones (useless because basically not working as Wayback Machine couldn't handle the bunch of JS), the only method to prevent such is to simply use Cbignore template, of which this isn't sustainable as I can't be the only dude adding Cbignore template on one article but other article (I won't know which article has and which doesn't) that uses the same troublesome site but doesn't has Cbignore template hence causing inconsistent archival URL between article to article with the same citation. Paper9oll (talk) 15:09, 2 February 2023 (UTC)
Thank you for clarifying, Paper9oll. We are tracking the issue on Phabricator and you can follow there for updates. Harej (talk) 21:21, 22 February 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:21, 22 February 2023 (UTC)

work in uk-wiki

The edit. Firstly, replacing of archived links to books and scientific articles with original+archived links should not be done at all: these pages can not change, so it is useless cluttering of the text and inserting of links which eventually will be dead. Secondly, even this unwanted work is made incorrectly: the bot inserts empty parameter "назва", and it suppresses display of the parameter "заголовок". Sneeuwschaap (talk) 01:14, 4 February 2023 (UTC)

Sneeuwschaap, thank you for your report. There are two parts to this. First, there is the moving of the archive link. This is expected—archive links go in the "archive link" parameter, rather than the chapter link. That should be reserved for the original URL. The second part is the hiding of the source title. That was a misconfiguration on the bot's part – it was configured to recognize заголовок but not Заголовок (with capital З). This has now been fixed and a test confirms this. Harej (talk) 21:54, 15 February 2023 (UTC)
Thank you for the second part. Regarding the first, I think that such approach is an empty formality which gives no advantages and a modest disadvantage (cluttering of the text). Sneeuwschaap (talk) 22:28, 15 February 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:23, 22 February 2023 (UTC)

Unable to use IA Bot

Hello. I was trying to use the Internet Archive Bot in meta wiki. So, I clicked the run bot on a single page option but it shows that the action I request to perform needs the analyzepage permission. Can you help me please? The Abnormal Guy (talk) 11:59, 5 February 2023 (UTC)

The Abnormal Guy, you won't be able to use the tool on Meta-Wiki until you have made a total of ten edits. Harej (talk) 21:26, 22 February 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:11, 1 March 2023 (UTC)

Template parameter numbering

Apparently the bot doesn't parse unnamed parameters to templates correctly, see this edit. 70.172.194.25 10:00, 6 February 2023 (UTC)

Thank you for your report; we are tracking this issue on on Phabricator. Harej (talk) 21:33, 22 February 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:11, 1 March 2023 (UTC)

Missing language codes on en.wikt

Hello. This diff by InternetArchiveBot resulted in this version of the page having an error message due to a missing language code, which I have just corrected in this diff. Would it please be possible to take these into account going forward? Happy to advise if you have any questions. Theknightwho (talk) 15:12, 10 February 2023 (UTC)

In the particular diff you cited, it shouldn't even be using {{quote-web}} at all; instead it should be using {{cite-web}}, which doesn't require a language code and so the issue you mentioned would have been entirely avoided. (See next section.) 70.172.194.25 01:52, 14 February 2023 (UTC)

Following up in subsequent section. Harej (talk) 21:11, 1 March 2023 (UTC)

This section was archived on a request by: Harej (talk) 21:11, 1 March 2023 (UTC)

It didn't work on archiving?

Hi, I can't archive links on en:MARTA rail, but the bot isn't working. CastJared (talk) 20:34, 13 February 2023 (UTC)

CastJared, can you describe what happens when you try to use the bot? Harej (talk) 21:43, 22 February 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:27, 1 March 2023 (UTC)

Reporting a false positive

I would like to report a afalse positive when the bot tried rescuing an archived link, that was already working fine. Conifer archive link. Conifer like Ghost archive is one of the Youtube video archiving sites for Wikipedia citations. I reverted the bot's edit and am notifying the devs about it here. Qwerty284651 (talk) 18:37, 19 February 2023 (UTC)

Qwerty284651, thank you for letting us know. It looks to be the new name of Webrecorder, an archive provider that is supported, but under its old name and domain. We will update the bot accordingly. Harej (talk) 21:48, 22 February 2023 (UTC)
@Harej:, thank you for responding swiftly. Also, I found a similar phabricator located here. Qwerty284651 (talk) 00:45, 23 February 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:28, 1 March 2023 (UTC)

Error in Turkish Wikipedia

In Turkish Wikipedia, if we add an off-site link, this bot writes a sentence next to the link saying "Archived on November 15, 2013 at the Wayback Machine". This happens in the actual text, not in the bibliography. I have come across this many times, for example:

https://tr.wikipedia.org/wiki/IV._A%C4%9Fa_Han

https://tr.wikipedia.org/wiki/III._A%C4%9Fa_Han

https://tr.wikipedia.org/w/index.php?title=%C3%82det&oldid=28742182

https://tr.wikipedia.org/w/index.php?title=Yerel_toplum&oldid=28739553 Fathylmz (talk) 11:19, 20 February 2023 (UTC)

Fathylmz, the bot is configured this way per local policy to not replace URLs with archive links, but rather to add them alongside. To prevent this, the bot would have to be disabled from editing outside of the references section, and this requires community consensus. Harej (talk) 21:56, 22 February 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:28, 1 March 2023 (UTC)

Bot is defaulting to the wrong template on en.wikt

I've realised that InternetArchiveBot is using {{quote-web}} in places where {{cite-web}} should be used. The former requires a language code, because it's used for attestations. However, it shouldn't be used in reference sections, as that's what the latter is for. {{cite-web}} does not require a language code. This diff is an example of the wrong template being applied.

Could the bot please be fixed in the following way?

  1. Use {{quote-web}} only when the wikitext line begins with the regex #+\*+ (i.e. #*, ##*, ###*, ##** etc). That is the only context in which it is allowed to be used.
  2. Otherwise, use {{cite-web}}.

Theknightwho (talk) 14:59, 11 February 2023 (UTC)

{{quote-web}} should probably be the default for Citations namespace, though, even when the line doesn't start with #. 70.172.194.25 01:51, 14 February 2023 (UTC)
Agreed - I had forgotten about that. For completeness:
  • In the main namespace, use the behaviour I describe above.
  • In the Citations namespace, always use {{quote-web}} unless enclosed in ref tags.
  • In any other namespaces, always use {{cite-web}} (likely only going to come up in the Reconstruction namespace, where academic citations are common, but quotations never occur as this would cause the term to be moved into the main namespace). Theknightwho (talk) 19:30, 21 February 2023 (UTC)
Theknightwho, the bot only operates in the main namespace, and it is now configured to use cite-web and not quote-web. See test edit confirming this. Harej (talk) 21:43, 22 February 2023 (UTC)
@Harej: Thanks. It still seems to be adding the wrong template, however (diff). I spotted this due to the error. Theknightwho (talk)
Theknightwho, the configuration change is being overridden by this configuration page. Specifically "webpage" needs to be changed to the preferred template. This needs to be done by an administrator. This will also change the default template used by the "Cite" button in the visual editor. Harej (talk) 21:25, 1 March 2023 (UTC)
@Harej: Thanks - I'll do it. Theknightwho (talk) 21:28, 1 March 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:04, 8 March 2023 (UTC)

Bot keeps tagging links as dead even though they've been reported as false positives

This is happening on en.Wikivoyage, the targeted links are to sub-pages of the site of the Bulgarian National Railways: https://en.wikivoyage.org/w/index.php?title=Bulgaria&curid=5063&diff=4624656&oldid=4623337&diffmode=source It's possible that the site blocks bots, or that something in the nature of those pages confuses the bot. I've reported the URLs via the "false positives" thing on Toolforge, but without any effect - the bot tagged the same links just now. Daggerstab (talk) 13:10, 25 February 2023 (UTC)

Daggerstab, thank you for your report. We think this false positive was caused by the website being georestricted. Your submission as a false positive should have prevented the bot from continuing to act on these links, but it didn't, so that's a bug. We are tracking it on Phabricator. Harej (talk) 21:47, 1 March 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:06, 8 March 2023 (UTC)

Bot adding unnecessary archivedate parameter on en.wikt

See diff. On the English Wiktionary, {{quote-web}} and {{cite-web}} both automatically generate archivedate using the Internet Archive URL. Could you set the bot so that the archivedate parameter is not used when archiveurl begins with https://web.archive.org? Ioaxxere (talk) 00:32, 28 February 2023 (UTC)

Ioaxxere, there needs to be an archive date parameter in the template call because the bot can't selectively drop "archivedate" – it always either does it across the board, for all archive providers (including those that do not have dates in the URL), or not at all. Harej (talk) 21:54, 1 March 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:07, 8 March 2023 (UTC)

Why replace warp.da.ndl.go.jp with web.archive.org?

In en:Special:Diff/1142300795 for en:Japanese community of Düsseldorf, InternetArchiveBot replaced "archive-url=" from warp.da.ndl.go.jp to web.archive.org without explaining why. The archive WARP (https://warp.da.ndl.go.jp/?_lang=en) is a legitimate archive operated by the en:National Diet Library of Japan, and the particular archive page https://warp.da.ndl.go.jp/info:ndljp/pid/9597364/www.dus.emb-japan.go.jp/profile/japanisch/j_wirtschaft/j_DUS.htm is valid and visible. If such substitutions are really necessary, please make the edit summary more specific, such as "Archive with English headers are preferred", "Third party archives are preferred over governmental sites" etc. The action of rewriting "url-status=live" to "url-status=bot: unknown" for a live link is also puzzling. -- Wotheina (talk) 13:28, 2 March 2023 (UTC)

Presumably the actual situation is that the bot had never heard of the archive so thought it was an invalid URL, not that it has an opinion on its merits. * Pppery * it has begun 03:51, 4 March 2023 (UTC)
Wotheina, as Pppery explained the bot hasn't been configured to support that archive provider. We will work on adding support; you can track progress on Phabricator. Harej (talk) 21:27, 8 March 2023 (UTC)
@Harej: Thanks (BTW the correct URL on Phabricator is https://phabricator.wikimedia.org/T331570 ). For such cases, please add to the edit summary something like "Removed unknown archive". Regarding the "url-status=" field here, I suggest avoid touching it, because "url-status=bot: unknown" obstructs displaying the live (non-archived) link for no good reason, adding extra burden to verify dubious edits such as en:Special:Diff/1143488072 in en:Japanese Mexicans. --Wotheina (talk) 06:57, 9 March 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:14, 15 March 2023 (UTC)

Parameter question

Is it standard practice, when archiving a ref, to add the generic "Archived copy" as the title? (eg: here). Along with being inaccurate, it also creates a "script warning" (in preview mode) and a notice (in the ref list) such as: "Script warning: One or more {{cite web}} templates have maintenance messages". Are there any other options for this? Thanks Thewolfchild (talk) 15:48, 8 March 2023 (UTC)

Thewolfchild, this isn't an error, it's just a maintenance message to keep track of references that are titled "Archived copy". It does not mean there is anything wrong with the reference. Harej (talk) 21:32, 8 March 2023 (UTC)
@Harej: I didn't say it was/or created an "error", nor did I say there was "anything wrong with the reference". I asked if it was standard practice for the bot to add "Archived copy" to the title parameter. I wanted to know if there are other options for this action. Also, to what purpose does it keep track of these refs? Thanks again Thewolfchild (talk) 20:22, 15 March 2023 (UTC)
Thewolfchild, it is. This setting is set in the "cite defaults" page, a configuration page only accessible to the tool operators. I believe User talk:Trappist the monk requested the tracking. Harej (talk) 20:45, 15 March 2023 (UTC)
Thanks for the reply, I'll drop Ttm a note and see what more info I can find. Thanks again Thewolfchild (talk) 04:03, 17 March 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:21, 17 March 2023 (UTC)

IABot causing massive havoc to multiple articles due to double archiving error

IABot (the bot itself and also through the interface) is doing double archiving as seen here, here, here. here, here, here, and here despite multiple citations in the article already archived previously with filled {{archive-date=|archive-url=|url-status}} as seen in the diffs. Not just the linked articles that were affected, more than 50+ articles were also affected based on my quick scan of en:Special:Contributions/InternetArchiveBot. And also, as seen in the diffs, it seem like the IABot is adding {{Webarchive ...}} template instead of appending to the existing {{Cite web ...}} or its other variations, which is abnormal. Paper9oll (talk) 15:45, 9 March 2023 (UTC)

Paper9oll, this was caused by a brief operational problem which has since been addressed. Can you confirm? Harej (talk) 19:54, 15 March 2023 (UTC)
@Harej Yes, this has been resolved. Paper9oll (talk) 14:57, 16 March 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:21, 17 March 2023 (UTC)

Archive date in Farsi Wikipedia

Recent changes made the references red, the error is "Check date values in: |archive-date". Salome mi (talk) 11:23, 25 February 2023 (UTC)

Salome mi, please link to a page where this is happening. Harej (talk) 21:30, 1 March 2023 (UTC)
Sure, I already fixed it though: https://fa.wikipedia.org/w/index.php?title=%DA%A9%D8%B1%D8%B4_%28%D8%B1%D9%88%D8%AF%29&diff=36633143&oldid=36271374&diffmode=source Salome mi (talk) 21:47, 1 March 2023 (UTC)
Salome mi, can you confirm that, in citation templates on Farsi Wikipedia, the dates should use English formatting (e.g. "12 October 2022") instead of Farsi (e.g. "۱۲ اكتبر ۲۰۲۲")? Harej (talk) 21:17, 8 March 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:42, 22 March 2023 (UTC)

Které stránky Wayback Machine archivuje?

Dobrý den, omlouvám se, že píši česky. Rád bych se zeptal, zda byste dokázali zodpovědět dotaz z české Wikipedie na stránce wikipedia:cs:Wikipedie:Pod lípou (technika)/Archiv 2022-1#Archivace URL opět (označuji @Draceane). Nevím, zda v tomto se Vy vyznáte, nebo nevyznáte, ale aspoň to zkouším a ptám se. Zajímalo by nás, zda víte, dle jakých kritérií Wayback Machine ukládá externí odkazy z Wikipedie a sesterských projektů. Ukládá jen odkazy z článků, nebo i z diskusních stránek, nebo i z jiných jmenných prostorů? Bere to odkazy například i ze šablon?

A věděli byste, proč někdy externí odkazy z článků uloží a jindy ne? Například wikipedia:cs:Kateř Tureček – některé externí odkazy nebyly uloženy v kolekci Wikipedia Eventstream.

Nebo stránka https://www.bbc.com/news/election-2019-50770798 není v kolekci Wikipedia Eventstream, ale přitom se nachází v článku wikipedia:cs:Diagram rozdělení voleb. Mohu dohledat i další příklady.

Děkuji za odpověď, pakliže víte. Marek Genius (talk) 18:12, 1 March 2023 (UTC)

Marek Genius, I am not sure if this answers your question, please let me know if it doesn't, but the Wayback Machine is crawling every external link posted to every Wikimedia wiki. It tries to save external links where it can, but if a site goes down before the bot gets to it, or something prevents any of the crawlers from getting to it, then it won't archive the link. It may also be possible for something to not end up in EventStream for whatever reason. Harej (talk) 22:03, 1 March 2023 (UTC)
@Harej: Does it apply for User namespace, too? — Draceane talkcontrib. 16:13, 15 March 2023 (UTC)
Draceane, not automatically, but you can request specific pages through the management interface. Harej (talk) 20:00, 15 March 2023 (UTC)
This section was archived on a request by: Harej (talk) 18:56, 22 March 2023 (UTC)

Add 1 book for Verifiability

I wonder where are edits like this documented and configured. I don't think we originally approved this feature in our wiki, and also IABot interface doesn't seem to provide a way to switch it off without disabling the bot altogether. Book references necessarily don't need an URL, especially if this URL is to a limited preview (subscription item) such as in this example. @GreenC --Pikne 12:15, 28 February 2023 (UTC)

Pikne, it is an option we have enabled for some wikis to add links to previews of books/print resources, in addition to fixing dead links. I can see this task wasn't approved on Estonian Wikipedia so it has been shut off there. Harej (talk) 21:09, 8 March 2023 (UTC)
This task appears to be still active: et:Special:Diff/6353198. Pikne 20:27, 21 March 2023 (UTC)
Pikne, that thread has been shut off as well (the second half of what I described above). Harej (talk) 01:43, 22 March 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:56, 26 March 2023 (UTC)

Empty ref name

Bot causing havoc (1, 2) as one as making a references mess when deleting content of all same-named references not even analyzing is that content the same or not when "optimizing" empty named (<ref name="">) references. Any ideas how to avoid such bot activity? Be112 (talk) 20:10, 11 March 2023 (UTC)

Thank you for your report Be112. We are tracking the issue on Phabricator. Harej (talk) 20:24, 15 March 2023 (UTC)
@Harej: You're welcome. Probably there's an task name given does not fully convey the meaning of a problem as it's much wider. Probably it would be better to name it: When two (or more) different references, all named "" (empty string), exist in an article, the bot removes all of them except first one leaving an error at all the next ones as there's 3 such reflinks exists at given example and only 2 of them were deleted (first occurence - "''[[Аляксандр Уладзіміравіч Піскуноў..." was left intact).Be112 (talk) 00:07, 18 March 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:56, 26 March 2023 (UTC)

Dpreview

dpreview.com is going down. We've got about 1700 links to the site. Would it be possible to make sure they are all archived?Geni (talk) 01:39, 22 March 2023 (UTC)

Geni, I can confirm that work is being done to preserve this site. Harej (talk) 01:41, 22 March 2023 (UTC)
This section was archived on a request by: Harej (talk) 15:05, 4 April 2023 (UTC)

parameter duplication

Bot ignores template primary named parameter existence adding a duplicate alternatively named one, i.e. here it adds |accessdate=13 сакавіка 2023|archiveurl=https://web.archive.org/web/20100324052855/http://starlife.com.ua/posts/kto-ona-vse-ob-alyosha-alesha-predstavitele-ukrainyi-na-evrovidenii-2010-biografiya-foto--4112.html|archivedate=24 сакавіка 2010|deadurl=yes to "cite web" template where already archive-date=2010-03-24 have place, which leads sometimes to duplicating resulting values. Despite concomitant template error has been resolved, ignoring parameter existence (that way nor changing nor deleting it) while adding same-meaning parameter is a bad bot behaviour. Be112 (talk) 18:49, 26 March 2023 (UTC)

Be112, we identified the template configuration problem causing this, and it should now be fixed. Harej (talk) 20:43, 27 March 2023 (UTC)
This section was archived on a request by: Harej (talk) 15:05, 4 April 2023 (UTC)

WP-uk

Turning images into gibberish on WP-uk, e.g. uk:Емінак Kwamikagami (talk) 15:58, 4 April 2023 (UTC)

This edit unnecessarily added archive version of museum website!!! Or is there any policy to add archive versions by default? రుద్రుడు (talk) 16:29, 4 April 2023 (UTC)
రుద్రుడు, these edits were made over a year ago. The bug occurring in these edits has since been solved. Harej (talk) 20:44, 5 April 2023 (UTC)
Kwamikagami, see my response in the line above. Harej (talk) 21:05, 5 April 2023 (UTC)
Thanks! Kwamikagami (talk) 21:09, 5 April 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:36, 5 April 2023 (UTC)

Trying to archive

Hi, I keep on getting trapped on Google Chrome, which is "This site can’t be reached". Is there any problem? CastJared (talk) 17:59, 29 March 2023 (UTC)

CastJared, it is possible the management interface was down while you were trying to load it, but it should be online now. Harej (talk) 20:23, 5 April 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:05, 12 April 2023 (UTC)

Question

Hello,
Why IABot for suwiki isn't active ? Ariandi Lie (talk) 13:02, 30 March 2023 (UTC)

Ariandi Lie, only because we have not gotten to it yet. If you would like, we can work on setting the bot up on that wiki. Harej (talk) 20:29, 5 April 2023 (UTC)
Ariandi Lie, the bot has begun editing; please review the bot's contributions. Harej (talk) 21:35, 5 April 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:05, 12 April 2023 (UTC)

AIbot

I try to use the AIbot but is met by this text "The action you are trying to perform requires the analyzepage permission. This permission is obtainable with the following groups: basicuser, user, admin, root, bot". I then get to nowhere where I can confirm to use the bot. BabbaQ (talk) 20:16, 30 March 2023 (UTC)

BabbaQ, make sure the correct wiki is selected from the dropdown on the top-right. You may have a wiki selected where you do not have sufficient edits. Harej (talk) 20:32, 5 April 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:05, 12 April 2023 (UTC)

email confirmation not receiving

Updated email id ends with listru in Email preferences of User preferences. I have never received email to confirm it. Checked spam directory also. రుద్రుడు (talk) 02:27, 31 March 2023 (UTC)

రుద్రుడు: the system responsible for sending emails is not currently working. Harej (talk) 20:33, 5 April 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:05, 12 April 2023 (UTC)

Error 503 when trying to access IABot site

Hi, when I try to access the IABot site I receive an Error 503 (service unavailable). Just thought I'd report it here because I hadn't seen it reported here yet. XtraJovial (talk) 17:14, 6 April 2023 (UTC)

This section was archived on a request by: Harej (talk) 20:24, 12 April 2023 (UTC)

"Bad title: The page you entered is invalid or doesn't exist. Please check your spelling and try again."

For the last few days, I have been receiving this error every time I try to run the Bot. Any ideas on what is going? « Gonzo fan2007 (talk) @ 17:09, 10 April 2023 (UTC)

Gonzo_fan2007, what text have you been inputting? Harej (talk) 20:22, 12 April 2023 (UTC)
Harej I have been trying to submit a number of different articles: I tried Bud Jorgensen, Bob Harlan and I even tried my sandbox at User:Gonzo_fan2007/sandbox. I honestly haven't ben able to get the bot to run any article for a while. « Gonzo fan2007 (talk) @ 14:30, 13 April 2023 (UTC)
I just tried to do List of Green Bay Packers stadiums and get the same error message. Honestly, I can't get the bot to run a page for 3 days now. « Gonzo fan2007 (talk) @ 23:16, 13 April 2023 (UTC)
Have you checked that your are running on the correct wiki? There's a wiki dropdown on the menubar in the top right.—CYBERPOWER (Chat) 23:26, 13 April 2023 (UTC)
Fml, ya that was the problem. Not sure how it got swtiched! « Gonzo fan2007 (talk) @ 20:52, 14 April 2023 (UTC)
This section was archived on a request by: Harej (talk) 18:25, 17 April 2023 (UTC)

What is the easiest way to run on more than 100 articles

Is there any easy way to run on more than 100 articles as a single batch? రుద్రుడు (talk) 05:24, 30 March 2023 (UTC)

రుద్రుడు, you can submit a bot job to submit at least 500 articles. Make sure the correct wiki is selected from the drop-down box on the top right. Harej (talk) 20:26, 5 April 2023 (UTC)
User:Harej I thought there may be some easy when i came across submit a bot job. Typing 50 or more will be tedious. I tried using my watchlist url. It was not helpful. Now, there seems to be ways: [1] make a list in text file and submit as batch job [2] copy and paste from edit raw watchlist (alas, i need either marking all edits as watchlists) [3] make feature request to upload or parse downloadable json file provided by xtools.
రుద్రుడు, if you go to Special:EditWatchlist/raw, that is probably your best option for now. Harej (talk) 20:17, 12 April 2023 (UTC)
This section was archived on a request by: Harej (talk) 19:16, 2 May 2023 (UTC)

"Add archives to all non-dead references" for jobs

Is there a way to make multi-page job batches behave like the "Add archives to all non-dead references" for single page runs is enabled? Apocheir (talk) 02:34, 6 April 2023 (UTC)

Apocheir, it does not. Multi-page job batches are made from the bot's account, and thus are subject to the wiki's bot policy and the terms of the bot's approval. Therefore there are no options to customize those jobs. Single-page analysis is run under your own account so you have more leeway to customize. Harej (talk) 20:21, 12 April 2023 (UTC)
This section was archived on a request by: Harej (talk) 19:16, 2 May 2023 (UTC)

Orignal links broken and "url-access=subscription" parameter question

On this edit the bot turned a slash in the original URL into a "%2F" on several references which apparently broke those original URLs, and since the references were behind a subscription paywall and had the "url-access=subscription" parameter the resulting archive links weren't actually useful in any way. Is there a way to either prevent the bot from modifying the original URLs or to skip references that have the "url-access=subscription" parameter since it's just going to archive the login screen for those instances? Aoidh (talk) 00:33, 9 April 2023 (UTC)

Thank you for your report, Aoidh. We have submitted a patch to IABot and this should be fixed soon. Harej (talk) 20:52, 12 April 2023 (UTC)
I see it's been resolved, thank you very much for taking the time to address this. - Aoidh (talk) 20:17, 17 April 2023 (UTC)
This section was archived on a request by: Harej (talk) 19:16, 2 May 2023 (UTC)

schedule run info on various wikis

Where can i find info about iabot scheduled runs, for example on whole kn wiki? I am planning to run few hundred of articles at once. It will be meaningless if iabot was run couple of months ago or recently. రుద్రుడు (talk) 07:53, 9 April 2023 (UTC)

రుద్రుడు, you can see the outstanding run status of the bot on the run pages. The bot doesn't run on a schedule. Rather, the bot cycles through an entire wiki in alphabetical order, and then starts again when it is done. Harej (talk) 20:54, 12 April 2023 (UTC)
This section was archived on a request by: Harej (talk) 19:16, 2 May 2023 (UTC)

"url-status=bot: unknown"

In en:Special:Diff/1149091736 for en:Jugemu, InternetArchiveBot wrongly rewrote "url-status=live" to "url-status=bot: unknown". This differs from the previously filed phabricator.wikimedia.org/T332221 "When incorrectly flipping unrecognized archive to archive.org, it converts link status from live to "bot unknown"" (report), because the new case happened with web.archive.org, a recognized archive. Again, I ask the bot made not to touch the "url-status=" field. Wotheina (talk) 07:00, 10 April 2023 (UTC)

Wotheina, if you change the status back, the bot will respect it from there. When the bot is editing an existing template, it can't know what the original context was, so it has to be later clarified. Harej (talk) 20:59, 12 April 2023 (UTC)
This section was archived on a request by: Harej (talk) 19:16, 2 May 2023 (UTC)

Wrong bot edits

Hello, I would like to report two cases where the bot is wrong:

  1. Wikipedia:it:Speciale:Diff/132884554: the original URL to the Microsoft website is dead, unlike what the bot thinks (moreover, the archive URL, strangely, works but in my opinion that's not reason to change it)
  2. Wikipedia:it:Speciale:Diff/132884599: as discussed here, the archived version on the Internet Wayback Machine is bugged (a JavaScript script automatically redirects the user to a 404 page after a few seconds), while the archived version at Ghost Archive is clean

Thank you in advance Luca Ghio (talk) 12:44, 15 April 2023 (UTC)

Luca Ghio, the bot has issues processing the "short form" Ghost Archive URL, so the longer form that spells out the total URL should be used instead and the bot will leave it alone. Harej (talk) 21:07, 3 May 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:43, 11 May 2023 (UTC)

Question: queue bot on lynx

I am on debian testing using lynx. When i tried to submit few pages using "queue bot..." it fails with network read error. Is it technically possible to use lynx for running queue bot. రుద్రుడు (talk) 08:56, 20 April 2023 (UTC)

రుద్రుడు, the IABot Management Interface does not support lynx. Harej (talk) 21:08, 3 May 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:43, 11 May 2023 (UTC)

{Cite ___} parameters

Good work, bot. Including (apparently) scraping edit history to provide missing "access-date"s!

But: In Wikipedia, parameters of templates, such as {cite ___}, are supposed to have one space before each "|"(pipe), no space after any "|"(pipe), no space before or after any "="(equals_sign), and no space before any "}"(closing brace). (I am very comfortable with this format. For one, it puts the forced line-breaks in better places.) You create some of these the opposite way. I think every bot should add or change parameters in the preferred style that is used in most examples and human edits.

Also, I think these parameters, which you often add, would be added by humans in this natural order:
|url= |access-date= |url-status= |archive-url= |archive-date=

Thus, looking at an edit in which you inserted (appended) parameters thusly (alphabetically??):
|access-date = 11 September 2015|archive-date = 31 January 2015|archive-url = https://web.archive.org/web/20150131192706/http://test.com/folder/file.htm|url-status = dead,

I think you should have inserted them in this style and this order:
|access-date=11 September 2015 |url-status=dead |archive-date=31 January 2015 |archive-url=https://web.archive.org/web/20150131192706/http://test.com/folder/file.htm

I don't know how others feel about it. Many existing manual refs are styled and ordered as I have shown. Having variations is not a great thing; it makes reading tedious. These automatic edits are doing them differently than humans are likely to. I don't get to mass-produce my preferences and mistakes for better or for worse. I am sad to see a robot casually doing just that. I feel more obliged to nudge a bot than human editors, even very active ones, because of its numerous edits. I appreciate that a human(s) is behind these bots.

A next level would be to adjust (correct) the spacing of all existing parameters to the same style. (Someday maybe a project will standardize all ref parameters or all template parameters. But even this little reformatting of existing markup, mass-applied, might be controversial.)

A next level would be to change the order of existing parameters. One of my prefs is to include always place |url= just before |access-date= (and everything else before that). Another pref is to always order |last= |first= |author-link= |last1= |first1= |author-link1= |last2= |first2= |author-link2= (and similar). (But I have no complete order in mind for all 99 parameters -- should |title= always precede |author=, or vice-versa?, etc.) (Someday maybe a project will standardize all ref parameters or all template parameters. But this degree of reordering existing markup might be controversial.) A876 (talk) 23:52, 20 April 2023 (UTC)

A876, the bot is programmed to respect the space formatting used in the page, so if you want the bot to use a certain style, have that style used consistently throughout the page. Rearranging parameter order is completely out of scope. Harej (talk) 21:13, 3 May 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:43, 11 May 2023 (UTC)

503 Service Unavailable

Hello, there is a issue of trying to use the IABot tool. It thrown into a 503 Service Unavailable error message. CastJared (talk) 11:48, 24 April 2023 (UTC)

Yes, unfortunately the problem persists. --Nyxaros (talk) 21:01, 25 April 2023 (UTC)
CastJared, Nyxaros, the interface should be up now. Harej (talk) 21:15, 3 May 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:43, 11 May 2023 (UTC)

Error in pt.wikipedia

Hello.

The bot is replacing %20 in the URL's with spaces, making it invalid. Exemple: diff.

Regards. --Stegop (talk) 20:47, 28 April 2023 (UTC)

Another issue: here the bot added the "title" parameter but the synonymous "título" was already present. --Stegop (talk) 15:33, 30 April 2023 (UTC)
Stegop, a user enabled the bot for Portuguese Wikipedia without authorization. It has been turned off. Harej (talk) 21:21, 3 May 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:43, 11 May 2023 (UTC)

Overzealous archiving at the Zeelandic test wiki

Hi, can you pleeeease disable your bot for this page? It just keeps on archiving a page that's alive and well at its present location. And what's mote: it contains a script that automatically refreshes itself. It would be quite a rotten thing if you settled for an obsolete version of that page.
What I think your bot does, is visit the page and then triggering the counter. This happens when the page is visited and the last refresh was at least six days ago. Scanning the entire database for entries takes quite a while - longer than your bot can wait. When its patience runs out, it draws the mistaken conclusion that the page is unavailable and goes looking for an archived version. But having to revert this manually over and over is quite galling. So, can please you stop it? Steinbach (formerly Caesarion) 21:12, 29 April 2023 (UTC)

Steinbach, according to our scan log, the website appeared to have gone down at a point. However, we manually scanned the page you linked, and the bot recognizes the link as being alive again. This should solve the problem. Harej (talk) 21:32, 3 May 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:43, 11 May 2023 (UTC)

failed to archive

On te wiki Page, iabot failed to add archive "Office of the Registrar General & Census Commissioner, India - Village amenities of 2011" రుద్రుడు (talk) 13:46, 3 May 2023 (UTC)

రుద్రుడు: This edit should take care of it. Harej (talk) 21:49, 3 May 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:43, 11 May 2023 (UTC)

Twitter

Is the bot archiving tweets? If not, please can it? I note that Elon Musk said recently that inactive accounts will be deleted; and some sources say this refers to any account with no activity for just 30 days. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 17:41, 14 May 2023 (UTC)

I've responded in a different location. Harej (talk) 21:30, 11 July 2023 (UTC)
This section was archived on a request by:  — billinghurst sDrewth 01:51, 12 July 2023 (UTC)

Is there a queue where users can place article requests?

Is there a queue where users can request that IABot process citations on the page? Recently, a set of glossary pages at the French Ministry of Justice went dark, and there are some pages (such as en:Glossary of French criminal law, and others) that rely heavily on the now missing pages. Can I pass you a set of article names to operate on, to add the archive links? How about just create User:InternetArchiveBot/Requests or similar, and I'll add the link as a bullet item, or something like that? Mathglot (talk) 04:13, 9 July 2023 (UTC)

Hi @Mathglot, as I don't expect the devs to respond soon: you can already do that here. Ennomien (talk) 10:41, 9 July 2023 (UTC)
@Ennomien: thank you so much! Mathglot (talk) 22:16, 9 July 2023 (UTC)
This section was archived on a request by:  — billinghurst sDrewth 01:45, 12 July 2023 (UTC)

Bad bot. Unsupervised bot.

1. With https://en.wikipedia.org/w/index.php?title=Non-specific_effect_of_vaccines&diff=prev&oldid=1163112453, the bot deleted my marking of https://www.who.int/immunization/sage/meetings/2014/april/3_NSE_Epidemiology_review_Report_to_SAGE_14_Mar_FINAL.pdf as a dead link. It IS a dead link. The other work that's part of that edit seems good.-RudolfoMD (talk) 03:08, 11 July 2023 (UTC)

@RudolfoMD: To address this one at enWP. The second ref had the same name as the first ref name=(2), so they were concatenated. If the second ref was yours, you should have applied a different name to it, eg. name="WHO (2)". The bot did nothing wrong. Fix your referencing to have a unique name for a unique reference.

2. The bot seems to be rather poorly supervised? Namely, it seems like most of the reports on this page have gone unaddressed. And the problem is presumably much worse than it seems, because SpBot archives all sections tagged with { {Section resolved|1=~ ~ ~ ~} } after 7 days. What should be done? RudolfoMD (talk) 03:08, 11 July 2023 (UTC)

The archiving bot only archives sections manually marked as resolved, so if they are not resolved, they should still be on this page. So I don't see the issue, that is simply a delay, an inaction, rather than an incorrect action.  — billinghurst sDrewth 03:40, 11 July 2023 (UTC)
This section was archived on a request by:  — billinghurst sDrewth 01:45, 12 July 2023 (UTC)

Question

Why Queue bot to run on multiple pages acces for idwiki and suwiki are disabled ? Ariandi Lie Talk with me 16:25, 7 May 2023 (UTC)

Ariandi Lie, the bot queues have now been enabled for those wikis. Harej (talk) 20:32, 11 July 2023 (UTC)
Thank you. Ariandi Lie Talk with me 02:11, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:03, 12 July 2023 (UTC)

Norwegian parameters

The bot seems to be ignoring Norwegian archive parameters, such as dødlenke, arkivurl and arkivdato. It adds url-status, archive-date, and archive-url where these exist. This causes error messages. See this diff and previous changes for an example. Is it possible to get it to accept the Norwegian parameters? Ranveig (talk) 09:52, 29 March 2023 (UTC)

This issue seems to be similar to the above one, "parameter duplication", which seems to have been resolved. Link here if that discussion gets archived. Ranveig (talk) 08:44, 5 April 2023 (UTC)
Ranveig, we fixed the citation template configuration to point to nn:Modul:Citation/CS1-nb/Configuration and that should solve the problem. Harej (talk) 20:22, 5 April 2023 (UTC)
Thank you so much for looking into this, as I couldn't figure out the problem! The bot now seems to have ignored "archiveurl" and "archivedate" a couple of times, but I have added those to the configuration, which should hopefully work. --Ranveig (talk) 04:13, 6 April 2023 (UTC)
The bot is now behaving well in relation to the Norwegian templates, but is still adding archivedate and archiveurl when archive-date and archive-url exists, mainly to template:Ciation. Example. Ranveig (talk) 06:39, 7 April 2023 (UTC)
Ranveig, I notice that Template:Citation invokes Citation/CS1; if you change it to call Citation/CS1-nb instead that should fix it. Harej (talk) 20:08, 12 April 2023 (UTC)
Thanks Harej. I tried doing that, but the system is set up so that English templates like Citation, Cite web and so on use parameters in English, Norwegian-nb templates like Kilde... use parameters in that language, and Norwegian-nn like Kjelde... use parameters in that language. The Citation template simply gave too many unknown parameters when I tried using it with a different module. --Ranveig (talk) 05:33, 13 April 2023 (UTC)
Ranveig, is there a reason you couldn't merge those two modules together and have templates call from the same Lua module? CS1 is coded in a language-neutral way and this would reduce the complexity of this problem a lot. Harej (talk) 20:38, 2 May 2023 (UTC)
Harej, I have spent some time merging the modules to the best of my ability, and it seems to be working fairly well. I'd be happy for InternetArchiveBot to have another go as I monitor the error categories (and deal with old messes). Ranveig (talk) 07:12, 5 May 2023 (UTC)

I did a test run and it's still causing problems, ignoring "archive-date" and adding additional "archivedate" data. Examples: [3], [4]. Ranveig (talk) 07:21, 19 May 2023 (UTC)

Ranveig, this seems to be a bug, so we have opened a ticket. Thank you for your effort so far. Harej (talk) 20:24, 11 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:44, 18 July 2023 (UTC)

Archive problems with sources behind paywall and a GoogleBook

InternetArchiveBot added useless archive links twice, reverted here and here.

  • The archived url for the ancestry link returns an error message, "Hrm. Wayback Machine has not archived that URL."
  • The archived url for the google book reference sys only <meta property="

Can the program be modified to ignore paywalled sites such as Ancestry? Not sure what went wrong with the GoogleBook, but maybe you could look into that, too?

Thanks. Grand'mere Eugene (talk) 01:07, 9 May 2023 (UTC)

Grand'mere Eugene, regarding the first archive, we think it was most likely a valid archive at the point the archive was added to the bot's database, but was then taken offline by the Internet Archive. We have invalidated that particular archive.
The second one is more interesting. We have escalated that with the Wayback Machine team. Harej (talk) 21:07, 11 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:44, 18 July 2023 (UTC)

Replacing 'webarchive'

Messy edit at da:Eidsborg diff. Two links have been handled:

  1. {{Citation}} with link to live text, with trailing Template:Webarchive was altered to archive data embedded into 'Citation' and deleted 'webarchive'
  2. {{Citation}} with dead link, also with trailing Template:Webarchive was altered to archive data embedded into 'Citation' and deleted 'webarchive'

There are two problems here. First, both links are reported with 'status=dead', but the first link is live. Secondly, adding the trailing 'webarchive' was an intentional edit (diff). I have reverted the bot edit, but also removed the defunct url from #2. Sechinsic (talk) 12:09, 10 May 2023 (UTC)

Bot continues to delete webarchive template and also marks live urls dead. Diff. List of live urls marked dead:

Please stop this. Sechinsic (talk) 19:54, 14 May 2023 (UTC)

Sechinsic, the community of Danish Wikipedia has set up citation templates to have built-in web archive parameters, and the bot accordingly implements this. If you want the bot to do something else, you need to take it up with your community. The second problem is more with the web archive template. By default, if there is a web archive template, it's because the underlying link is assumed broken, unless there is a specific indicator stating whether the link is alive or dead. So the problem is the lack of explicit indicator, not with the bot. And this is avoided by adding the web archives within the citation template, as the bot is programmed to do. If you think the templates should work differently, this is something you need to discuss with your community. The bot is configured based on the wiki's templates. Harej (talk) 21:16, 11 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:44, 18 July 2023 (UTC)

Report false positives fails

I just reported the above urls via the IABot Management Interface. I click 'Submit' and the page reloads to show formatted html-links, and yet another submit-button. I click on 'submit', and nothing happens. And the 'Summary of user activity' just shows '0'. Sechinsic (talk) 20:19, 14 May 2023 (UTC)

Sechinsic, more information on how to use that feature is available at InternetArchiveBot/Documentation/Submitting bug reports#Report false positive. Harej (talk) 21:32, 11 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:44, 18 July 2023 (UTC)

Link to "Internet Archive" for pages about novels

Two pages on wiki.it, Le strade di polvere and Il caso Courrier, lack this type of edit: “Add 1 book for Wikipedia:Verifiability”. Here is an example of what I mean: [5]. Could the Bot do it? Thanks.--151.19.238.36 12:13, 15 May 2023 (UTC)

It has and does try. If there are no links, it's either because the books don't exist at IA, the algorithm can't make a match due to some problem with the metadata, or the books are not in template (cite libro). -- GreenC (talk) 23:41, 11 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:44, 18 July 2023 (UTC)

Iabot.org

Iabot.org domain is not working, please fix it, thanks Notrealname1234 (talk) 15:13, 17 May 2023 (UTC)

Notrealname1234, should be up at this point. The interface is prone to occasionally go out these days, unfortunately, but is typically brought back shortly after. Harej (talk) 23:02, 11 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:44, 18 July 2023 (UTC)

Issues with InternetArchiveBot on it.wiki

Hello dear bot's developper, two issues which are not exactly a "fatal problem", though cause manual work to be done on it.wiki:

  1. the bot adds "dataarchivio" (archive date) but the templates Cita web, Cita news and so on are written to automatically extract the archive date from the archive.org url string. Thus that funcion on it.wiki is redundant.
  2. the bot often marks as urlmorto = no (dead url = no) urls that technically aren't dead but link to an article that, since the date of its publishing, has been moved behind a paywall and is not - fully or partly - readable. Thus the archived url is often the only way to read it fully.

Thanks for your attention. -- Blackcat 08:37, 21 May 2023 (UTC)

Blackcat, number one is likely not going to be addressed so long as the addition of the date does not harm the article (Italian Wikipedia is the only wiki to infer the date like this). Number two, can you provide an example? Harej (talk) 20:07, 12 July 2023 (UTC)
Yes, @Harej:, sure! See this article of the Daily Telegraph, and its archived version: while technically the article's URL is not 'dead', nonetheless is unusable because the article is behind a paywall, whereas the archived version was still available for free. In this case I write that the url is dead. -- Blackcat 21:16, 12 July 2023 (UTC)
Setting aside the difficult policy issues of intentionally bypassing paywalls systematically.. how would the bot know when a page is behind a paywall and the archive version is not? That's technically challenging because every site is different. BTW this is similar to (and maybe even the same as) the soft-404 problem, where a page is reported as status 200 working, but actually is not working eg. redirects to the home page, contains some other content etc.. it's a classic problem with not many good solutions right now. There are some engineers working on this for IABot during the Google Summer of Code, it remains to be seen how well it works, past attempts have not worked out very well. Maybe the new AI technologies will help? -- GreenC (talk) 21:50, 12 July 2023 (UTC)
Ok thanks :) Btw... "no crime without law" :-) If an article was free and freely archived in 2013, then protected by paywall in 2017, it's not Archive.org's fault if the publisher had changed its mind. What was free until that date still continues to be free, I guess -- Blackcat 12:09, 15 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:44, 18 July 2023 (UTC)

ShareMap: interwiki // sharemap.org

We have 961 uses of //sharemap.org that seem to need to be managed. Plus, and I am not sure whether there is anything that you can or want to do here, the interwiki to ShareMap: Anyway, it is usurped per special:diff/25085500#Usurped domains and the interwiki will be redirected to a local page with an explanation (over a thousand uses.)  — billinghurst sDrewth 06:30, 29 May 2023 (UTC)

I added sharemap.org and comixpedia.org to the enwiki usurpation process [6] run by Waybackmedic. IABot has no support for usurpation unfortunately. We could mark the domain as dead or blacklisted, but it might end up adding an archive URL to the usurped site. It might be the only option available and hope for the best. -- GreenC (talk) 23:59, 11 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:44, 18 July 2023 (UTC)

bot job on tewiki in queue since 28th may

https://iabot.toolforge.org/index.php?page=viewjob&id=13916  in queue since 28th may. Please. హరుడు (talk) 01:25, 6 June 2023 (UTC)

The job has been killed. -- GreenC (talk) 00:04, 12 July 2023 (UTC)
Sorry this appears related to User_talk:InternetArchiveBot#Bot's_been_stalled_for_nearly_two_months?. I should not have killed your job. Recommend recreating the job and will have to wait for the problem causing the stall to be fixed. -- GreenC (talk) 01:16, 12 July 2023 (UTC)
@GreenC I should not have killed your job. Ooops.... No issues..... Anyhow my other month old jobs 13955 — 13958 will start soon..... ;) హరుడు (talk) 03:42, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:44, 18 July 2023 (UTC)

Need a help

Hi, I need help in keeping the sources of my contributions in the Russian and English Wikibooks. (Many of the links have already disappeared and I had to remove them from the pages.)
https://pageviews.wmcloud.org/userviews/?project=ru.wikibooks.org&platform=all-access&agent=user&namespace=0&redirects=0&range=latest-1300&sort=views&direction=1&view=list&user=Виктор%20Пинчук
https://pageviews.wmcloud.org/userviews/?project=en.wikibooks.org&platform=all-access&agent=user&namespace=0&redirects=0&range=latest-1300&sort=views&direction=1&view=list&user=Виктор%20ПинчукВиктор Пинчук (talk) 16:37, 7 May 2023 (UTC)

Виктор Пинчук, we are not currently deployed on either English Wikibooks or Russian Wikibooks. Harej (talk) 20:56, 11 July 2023 (UTC)
So there is no way to save links in the Russian and English versions of the Wikibooks? — Виктор Пинчук (talk) 03:46, 12 July 2023 (UTC)
@Виктор Пинчук: If either community has not had the discussion about saving links on those wikis, and reached a consensus that it is what they want to do, then naturally, no. Start your conversations at the wikis, and if there is a consensus, then come and make a request that such a service be provided by the bot operators.

Go to InternetArchiveBot for the meta information page, which is different detail than the bot user page.  — billinghurst sDrewth 07:01, 12 July 2023 (UTC)

Is this a recommendation to get my own bot? That's a little difficult for me... Виктор Пинчук (talk) 10:29, 12 July 2023 (UTC)
Виктор Пинчук, no, it is a recommendation to read the documentation for https://iabot.toolforge.org Harej (talk) 20:04, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:53, 19 July 2023 (UTC)

Jobs finished but still showing as running

I think the bot has gotten stuck on batch submissions and some jobs may need to be killed. Each of these:

has completed but is marked as still running. I tried to kill each of them, but that only gave me an error 500. Could you kill them for me?

Also, sometimes when I submit a batch job, a duplicate submission is created, even though I just pressed the submit button once. Eastmain (talk) 12:03, 19 May 2023 (UTC)

I wasn't able to kill these jobs before, but a few minutes ago I successfully killed all three. Eastmain (talk) 07:34, 6 July 2023 (UTC)
There's some kind of problem, see below: User_talk:InternetArchiveBot#Bot's_been_stalled_for_nearly_two_months? -- GreenC (talk) 00:49, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:53, 19 July 2023 (UTC)

Bot Job Issue

Hello, I appear to be having an issue with the bot. Last night I tried to queue a batch submission job I kept getting errors for it. I tried it again this morning and it still didn't work. I checked my account and says these jobs are still active but they aren't doing anything and making no progress. The job IDs in question are 13,872-13,875 and 13,880. I tried killing the duplicate jobs but they return as an error. Sorry if I caused anything wrong, the bot has been having issues for me for a while. Captain Galaxy (talk) 12:28, 22 May 2023 (UTC)

Captain Galaxy, the bot queue appears to have gotten very long and we are investigating that. If you are getting the "checksum error" that is a transient problem that comes when you try to use the bot with multiple browser tabs. It should work when you try again. Harej (talk) 20:20, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:53, 19 July 2023 (UTC)

False positive

I don't know if it's more ridiculous that the bot is keep on tagging some working external links as dead or that some user is restoring them everyday. That said, please support on solving the following issues:

  1. https://it.wikivoyage.org/w/index.php?title=Aeroporto_di_Pisa-San_Giusto&action=history
  2. https://it.wikivoyage.org/w/index.php?title=Aeroporto_di_Firenze-Peretola&action=history
  3. https://it.wikivoyage.org/w/index.php?title=Aeroporto_Internazionale_di_Delhi&action=history

Thanks, Andyrom75 (talk) 13:52, 23 May 2023 (UTC)

@Cyberpower678, @Harej, @GreenC, while waiting for your answer I've tried to use this tool to report this working external link but unfortunately it says that has been already reported. Notwithstanding this, if I use this tool on voy:it:Aeroporto_di_Firenze-Peretola the bot keep on wrongly considering the working external link as dead.
Your support is required. Andyrom75 (talk) 14:28, 30 May 2023 (UTC)
@Cyberpower678, @Harej, @GreenC, it seems that "1" & "3" are now not recognized as dead links, but on "2" the problem still persists. Could you support (and maybe reply me with a ping)? Thanks, --Andyrom75 (talk) 12:15, 9 June 2023 (UTC)
User:Andyrom75: Until whatever is causing this is fixed, you can keep the bot off any link with Cbignore. -- GreenC (talk) 23:45, 11 July 2023 (UTC)
GreenC, thanks for your reply. I didn't know about it. So, can I just create this new and void template on it:voy and InternetArchiveBot, is already programmed to ignore the associated external link, right? --Andyrom75 (talk) 07:33, 12 July 2023 (UTC)
Yes. Like in the examples, the template goes right after the [] or after the {{}}, the bot will see and knows to skip those, as long as the cbignore is there. -- GreenC (talk) 13:18, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:53, 19 July 2023 (UTC)

Access request

Hello! I've used Wikipedia editor since '04, but I have only 768 global edits. On iabot's metainfo page it mentions 1000+ edits to go from basicuser to user ... May I request an upgrade? ^_^;;

Hobart (talk) 02:02, 24 May 2023 (UTC)

Sure, done. -- GreenC (talk) 00:46, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:53, 19 July 2023 (UTC)

Errors on the page "1950–1951 Baghdad bombings"

I believe the bot caused the errors "Cite error: A list-defined reference with the name "BlackMorris92" has been invoked, but is not defined in the <references> tag" and "Cite error: A list-defined reference with the name "Mossad1" has been invoked, but is not defined in the <references> tag" on the page 1950-1951 Baghdad bombings. History person 2 (talk) 12:48, 6 June 2023 (UTC)

Thank you for your report History person 2. We are tracking this issue: task T321941. Harej (talk) 20:27, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:53, 19 July 2023 (UTC)

Archiving problem of the bot in Turkish Wikipedia

Hello, for a long time, the archives made by this bot continue to appear as text on the pages, inside the tables. Although this issue was on the agenda before, I don't remember if it was not solved or the problem was repeated. Example. You can examine the table. As you can see from the example, the archives made by InternetArchiveBot turn into text instead of remaining as a reference. Nevmit (talk) 11:14, 13 June 2023 (UTC)

Nevmit, for this page in particular I would recommend blocking the bot from editing by adding {{bots|deny=InternetArchiveBot}} to that page. Harej (talk) 20:31, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:53, 19 July 2023 (UTC)
Thanks for your suggestion. But this is not a suggestion that fixes current bugs and fixes future bugs. I think there should be a way to "InternetArchiveBot works and doesn't error". Nevmit (talk) 19:45, 20 July 2023 (UTC)

Meddling with quotation templates

[7]

Bot added explicit |1= and |2= parameters when there was no need, and also erased the argument of |2=. --Biolongvistul (talk) 22:14, 17 June 2023 (UTC)

Thank you for your report Biolongvistul. The "1=" and "2=" appears correct but the dropping of the parameter value is not. I have filed a report here: task T341127. Harej (talk) 20:34, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:53, 19 July 2023 (UTC)

http/https

Hello, sometimes url still exists, but actually on https:, not on http: [8]. I think, this case is easy to check and repair. JAn Dudík (talk) 11:57, 25 June 2023 (UTC)

This is a common sort of problem, but probably outside the scope of IABot to check every dead http, that would be way too many GET requests at the scale of the bot severely slowing it down. Maybe an optional switch in the interface to tell the bot to convert to https. I opened a feature request: https://phabricator.wikimedia.org/T341643 .. GreenC (talk) 00:16, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:53, 19 July 2023 (UTC)

Documentation about bot's settings

Hello! Can someone provide a documentation of any kind explaining with examples what each of the bot's settings does in this page? For many months I've thought about modifying them now for my homewiki but I'm unsure what some of them serve for. I'd be really glad if someone could provide some extra info. - Klein Muçi (talk) 09:52, 28 June 2023 (UTC)

Klein Muçi, documentation is available here: InternetArchiveBot/Documentation/Configuring bot behavior. Harej (talk) 20:35, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:53, 19 July 2023 (UTC)

Not recognising full url

Here, the bot marks an url as dead while it is still active. It's strange that it places the template in between the url, as if it didn't recognise the full url. Can someone look how that happened? Ennomien (talk) 17:02, 29 June 2023 (UTC)

Not sure what is going on but recommend removing the double ticks around the URL, or better expand to a proper templated citation to avoid bugs. -- GreenC (talk) 00:20, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:53, 19 July 2023 (UTC)

Wrong URL

Wrong URL was added here. I have removed 2406:3400:518:3D50:47:EC48:AD48:2603 08:16, 2 July 2023 (UTC)

I reported this to archive.org [9] there is some confusing metadata with wrong ISBN info. -- GreenC (talk) 01:00, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:53, 19 July 2023 (UTC)

nybrovikings.com

IAbot does not catch the redirects old news articles have. The site now redirect old url:s like http://nybrovikings.com/uppdatering-kring-ungdom-och-juniorverksamheten/ to http://nybrovikings.com/ and not to the article which makes the references useless. Is it possible to add archive links fore these links? If some links is missing in the archive and get marked as dead I can probably fix them manually. The http answer seems to be 301 Moved Permanently. New working news articles looks like: https://www.nybrovikings.com/article/b21arwx-38k4d/view and no archive links is needed for the working URL:s. If you need more information contact me at my Swedish Talkpage /Machatjkala (talk) 18:32, 2 July 2023 (UTC)

IABot is unable to detect when a URL is good vs. when it redirects to the home page. It should be able too, actually. This is a problem with the dead link checker. Phab ticket: https://phabricator.wikimedia.org/T341645 -- GreenC (talk) 01:13, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:53, 19 July 2023 (UTC)

Bot's been stalled for nearly two months?

As of right now, IABot's most-recent completed batch run is job 13844, back in mid-May and over 250 jobs back. It seems to have been stuck on jobs 13846 and 13847 since then. What's gone wrong? Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 19:59, 8 July 2023 (UTC)

I killed the 5 jobs that were stuck at 100% completion. This freed up space for 5 more jobs to start. If those 5 have the same problem we know there is a problem with 100% complete jobs not exiting properly. -- GreenC (talk) 00:30, 12 July 2023 (UTC)
The problem is unfortunately still there, completed jobs don't exit, holding the slot not allowing queued jobs to start. -- GreenC (talk) 00:34, 12 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 14:53, 19 July 2023 (UTC)

Article size limit

Having IABot create archive links on a new featured article is a common use case. However, such articles often exceed the size limit, requiring a bot job to be used, and I don't think bot jobs create archive links for non-dead links, limiting their utility. Would it be possible to increase the size limit or resolve this issue some other user-friendly way? {{u|Sdkb}}talk 17:15, 10 July 2023 (UTC)

Sdkb, the article limit is unfortunately a hard limit that Toolforge imposes that we can't get around. A workaround to this is to split the contents of the page across two separate sandbox page. Harej (talk) 20:37, 12 July 2023 (UTC)
@Harej, that's a very inconvenient workaround, especially given that the bot is often queued far in advance. What needs to happen at Toolforge to raise the limit, and where would we go to ask for that? {{u|Sdkb}}talk 20:52, 12 July 2023 (UTC)
Sdkb it is not a limit that can be raised. Really the only solution is for us to move it from Toolforge to Cloud VPS, where it will have access to more resources. We may pursue this in the future. Harej (talk) 20:15, 18 July 2023 (UTC)
That would be good, as otherwise the problem will remain unsolved. There should be some way to achieve the desired functionality — I'll leave it to others with more knowhow to figure out the means. {{u|Sdkb}}talk 03:53, 19 July 2023 (UTC)
No, this isn't resolved. The resolution identified is to move it from Toolforge to Cloud VPS. Mismarking this as resolved effectively hides that info by burying it with resolved issues. Unless there's a Phab ticket, but I don't see mention of one. RudolfoMD (talk) 19:33, 19 July 2023 (UTC)
@RudolfoMD, the Phab ticket (task T342168) is in the "Tracked" box at upper right. I do hope it'll be taken up rather than just disappearing into the backlog. Any thoughts on who could make that happen? Cheers, {{u|Sdkb}}talk 19:41, 19 July 2023 (UTC)
Sdkb, we hope to do it as soon as our Cloud VPS quota increase is approved. We need more capacity to deploy a new virtual machine. Please let me know if you have any further questions. Harej (talk) 20:13, 19 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:13, 19 July 2023 (UTC)

Source URLs on commons

Is the bot archiving the |source= URLs of files on Commons? If not, please can it? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 18:54, 31 May 2023 (UTC)

I used to do that with WaybackMedic and got a pile of grief on different occasions from users who say the archive URL is not the original source image and it should not be archived even if dead. So I stopped trying to help Commons, at least until they figure out what they want. -- GreenC (talk) 00:02, 12 July 2023 (UTC)

┌─────────────────────────────────┘

Late reply; sorry. There are two actions:

  1. Archive URL in Wayback Machine
  2. Update links on Commons

Even if [2] is not done, [1] can be, and that was my query.

But I have raised [2] at c:Commons:Village pump#Archiving of source URLs by bot. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 14:59, 19 July 2023 (UTC)

Pigsonthewing, any external link posted to Wikimedia projects in the last ~10 years has been submitted to the Wayback Machine. Adding such archive links to the source parameter may be trickier since it's not strictly a URL parameter; it allows arbitrary text. Harej (talk) 20:09, 19 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:44, 1 August 2023 (UTC)

worldaerodata.com PermaLive?

worldaerodata.com has been usurped and now serves malware/phishing/spam for all URLs. Apparently it is marked as PermaLive. Could someone change it to PermaDead or what is the usual procedure in this case? Count Count (talk) 10:11, 19 July 2023 (UTC)

On Enwiki, these will get usurped by WaybackMedic. On other wikis there is no mechanism/support to do usurpations. I'll set it to permadead which is sort of a best we can do until/if IABot supports usurpation. -- GreenC (talk) 13:16, 19 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:44, 1 August 2023 (UTC)

Bad bot. Unsupervised bot. (revived)

1. With https://en.wikipedia.org/w/index.php?title=Non-specific_effect_of_vaccines&diff=prev&oldid=1163112453, the bot deleted my marking of https://www.who.int/immunization/sage/meetings/2014/april/3_NSE_Epidemiology_review_Report_to_SAGE_14_Mar_FINAL.pdf as a dead link. It IS a dead link. The other work that's part of that edit seems good.-RudolfoMD (talk) 03:08, 11 July 2023 (UTC)

@RudolfoMD: To address this one at enWP. The second ref had the same name as the first ref name=(2), so they were concatenated. If the second ref was yours, you should have applied a different name to it, eg. name="WHO (2)". The bot did nothing wrong. Fix your referencing to have a unique name for a unique reference.
It's not my referencing. The bot did wrong. Again, the bot deleted my marking of https://www.who.int/immunization/sage/meetings/2014/april/3_NSE_Epidemiology_review_Report_to_SAGE_14_Mar_FINAL.pdf as a dead link. That was wrong, irrespective of the ref name reuse which I didn't introduce. And it did not concatenate. It discarded. Bug. Please don't mark this resolved, which billinghurst did at 01:45, 12 July 2023, unless it's actually resolved.
You have indeed identified a bug; we are tracking it here: task T342299. Thank you for your report. Harej (talk) 20:32, 19 July 2023 (UTC)
Thank you. RudolfoMD (talk) 01:34, 20 July 2023 (UTC)

2. The bot seems to be rather poorly supervised? Namely, it seems like most of the reports on this page have gone unaddressed. And the problem is presumably much worse than it seems, because SpBot archives all sections tagged with { {Section resolved|1=~ ~ ~ ~} } after 7 days. What should be done? RudolfoMD (talk) 03:08, 11 July 2023 (UTC)

The archiving bot only archives sections manually marked as resolved, so if they are not resolved, they should still be on this page. So I don't see the issue, that is simply a delay, an inaction, rather than an incorrect action.  — billinghurst sDrewth 03:40, 11 July 2023 (UTC)
Thanks for clarifying / correcting me.
I didn't get much of a chance to read or respond before my comment was archived because SpBot archives all sections tagged with { {Section resolved|1=~ ~ ~ ~} } after 7 days. Problematic - I hadn't seen it. Please be more patient. Change to 14 days? RudolfoMD (talk) 19:26, 19 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:44, 1 August 2023 (UTC)

Twitter

Please respond. https://meta.wikimedia.org/w/index.php?title=User_talk:InternetArchiveBot&oldid=prev&diff=25321113 RudolfoMD (talk) 01:31, 20 July 2023 (UTC)

As it was a sensitive matter I corresponded with him over email. Harej (talk) 15:08, 20 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:44, 1 August 2023 (UTC)

Vandalism in User:InternetArchiveBot

Hello. I come to inform you that an anonymous user has vandalized the user page of this bot on the Galician Wikipedia. I have tried to revert the vandalism, but I don't see previous revisions in the history. They should take a look at it to solve this problem. Mark Gasoline (talk) 13:35, 22 July 2023 (UTC)

I marked the page for deletion. Once deleted the meta user page of the bot will be displayed. Count Count (talk) 13:37, 22 July 2023 (UTC)
They should protect the page so that only admins can edit it, otherwise it will be vandalized again. Mark Gasoline (talk) 13:47, 22 July 2023 (UTC)
Mark Gasoline, you should make that request to the Galician Wikipedia administrators – we are unable to protect pages there. Harej (talk) 19:59, 25 July 2023 (UTC)
Ok, if they vandalize the page again I will notify an administrator. Greetings. Mark Gasoline (talk) 21:42, 25 July 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:44, 1 August 2023 (UTC)

Central Library (Brooklyn Public Library)

I tried to run the bot on en:Central Library (Brooklyn Public Library) but it said it was "too big". The article has 337 references, which might be a bit excessive for an article on a library, but there are many articles that deservedly have more refs. Can the bot's cutoff be raised? Abductive (talk) 03:58, 31 July 2023 (UTC)

Abductive, we will be able to raise this limit once we move the bot frontend from Toolforge to a dedicated virtual machine, which we will be working on shortly. Harej (talk) 20:19, 1 August 2023 (UTC)
Great, I look forward to it. Thanks, Abductive (talk) 20:47, 1 August 2023 (UTC)
This section was archived on a request by: Harej (talk) 19:54, 8 August 2023 (UTC)

Adding 1.5 MB of text into a page

Please investigate what went wrong here. I have blocked the bot from editing this specific page for time being. Gikü (talk) 23:10, 31 July 2023 (UTC)

Thank you Gikü, that is an interesting bug. Blocking the bot from running on that page in the meantime is good. We are tracking the issue on Phabricator. Harej (talk) 20:44, 1 August 2023 (UTC)
This section was archived on a request by: Harej (talk) 19:54, 8 August 2023 (UTC)

Falsely tagging a link as permanently dead rather than just dead

In this edit, the bot claimed a link was permanently dead. It was dead, but the URL could be found on Archive.org. Any idea why the bot didn't pick that up, and if it might be similarly missing other not-actually-permanently-dead links? {{u|Sdkb}}talk 17:11, 26 July 2023 (UTC)

Sdkb, thank you for reporting that. Since you've found an archive we've now registered it as the archive for that URL. On occasion, the Internet Archive's availability API fails to return results when it should, and that may be what happened here. Harej (talk) 20:12, 1 August 2023 (UTC)
Thanks! Hopefully the API doesn't fail too often, or we'll have to change w:Template:Permanently dead link to read Possibly permanently dead link. Maybe talk with the Internet Archive folks to see if they can figure out what's happening and make sure it doesn't happen for other links in the future? {{u|Sdkb}}talk 20:20, 1 August 2023 (UTC)
Is there a way to reach out to the IA folks? {{u|Sdkb}}talk 20:40, 8 August 2023 (UTC)
Sdkb, we (the Internet Archive) are already aware of the issue; the problem is that it's such a rare bug we cannot reproduce it. We believe it is very rare. Harej (talk) 20:06, 9 August 2023 (UTC)
OK. I can report any future instances I spot to you if that would help track down the error. {{u|Sdkb}}talk 20:54, 9 August 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:45, 16 August 2023 (UTC)

False Positive, Not accessible outside Sweden

This URL i not ot accessible outside Sweden. Whitelist all URL begining with https://pubs.sub.su.se. Stockholm University publication is not accessible outside Sweden. I have whitelisted these over and over again, but it doesn't seem to help. Skivsamlare (talk) 17:38, 5 August 2023 (UTC)

We have added the domain to the permalive list. While you added URLs to the queue, they were not automatically processed, but now the whole domain has been. Harej (talk) 20:15, 9 August 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:45, 16 August 2023 (UTC)

replaced reference with empty name into invalid ref name on kn wiki

this diff replaced reference with empty name into invalid ref name. frankly, i dont know how bot must do in this situation. just bringing to your attention ;) రుద్రుడు చెచ్క్వికి (talk) 09:25, 7 August 2023 (UTC)

Thank you for your report. We are tracking this bug on Phabricator. In the meantime, as a workaround, removing "name=" from the reference will work to avoid this situation. Harej (talk) 20:21, 9 August 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:45, 16 August 2023 (UTC)

Bot incorrectly removing dead link tag

This bot has been removing (diff) a dead link tag on en:Montgomery County, Pennsylvania. The link in question 404s and is not archived by IA, so I tagged it as dead per en:Template:Dead link. It's possible I am misunderstanding proper usage of en:Template:Dead link, but I believe the bot should not be removing the dead link tag. Arjsd (talk) 02:30, 8 August 2023 (UTC)

Thank you for your report. We are tracking this bug on Phabricator. Harej (talk) 20:23, 9 August 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:45, 16 August 2023 (UTC)

Permission error

Hi, I've used the tool once before but got a permission error when I tried to use it on a different computer. My username is Toobigtokale. Could you help me out?

This is the message I'm getting.

  The action you are trying to perform requires the analyzepage permission.
  This permission is obtainable with the following groups: basicuser, user, admin, root, bot

I've already confirmed my email. Toobigtokale (talk) 20:31, 9 August 2023 (UTC)

Nvm im a dummy, i was on the wrong wiki. My default was set to enwiki tho :/ Toobigtokale (talk) 20:34, 9 August 2023 (UTC)
This section was archived on a request by: Harej (talk) 20:12, 16 August 2023 (UTC)

Encyclopedia of Fantasy link at H. P. Lovecraft

For some reasons, the bot marked the Encyclopedia of Fantasy link in the external links section of H. P. Lovecraft's article as being dead. It has done that twice. Is that the result of an error with how the bot reads the website? Susmuffin (talk) 00:55, 11 August 2023 (UTC)

Susmuffin, we looked into it and the web server is returning a 404 error for that page even though it shouldn't. We have set the URL as "permalive" so that it will treat it as a working link. Harej (talk) 20:19, 16 August 2023 (UTC)
This section was archived on a request by: Harej (talk) 17:55, 23 August 2023 (UTC)

Falsely dead link

For some reasons, the bot marked a link to the site of the Comune di Milano in the article w:it:Magda Olivero as being dead, whereas it is still operational. Is that the result of an error with how the bot reads the website? Jeanambr (talk) 23:12, 11 August 2023 (UTC)

Jeanambr, I investigated and while the site does not load in the United States (where the bot is based), it does load when using an Italian VPN. So the bot is probably running into geo-restrictions. I have set the URL as "permalive" so that it will be treated as a working link. Harej (talk) 20:30, 16 August 2023 (UTC)
Thank you very much. Jeanambr (talk) 21:54, 16 August 2023 (UTC)
This section was archived on a request by: Harej (talk) 17:55, 23 August 2023 (UTC)

Usurped websites => now gambling spam

Where possible, these need to be marked as unsafe/usurped, thanks.

  • readbookonline.net
  • terimarejeki.com
  • zicohouse.org
  • pablototortp.online
  • xn--sltjmpl-eya6gya.com

Thanks.  — billinghurst sDrewth 13:27, 17 August 2023 (UTC)

All have been marked as permadead except for "terimarejeki.com" and "xn--sltjmpl-eya6gya.com" which don't appear in the IABot database. Harej (talk) 20:12, 23 August 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:38, 30 August 2023 (UTC)

Links going to a Geographic blocked pages

Hi! I run IABot a lot on pages, and it's fantastic. We currently have a domain (Eurosport.co.uk), where sometimes the archived url goes to a geoblocked link (such as https://web.archive.org/web/20220401230655/https://www.eurosport.com/geoblocking.shtml). I tried changing the domain through https://iabot.toolforge.org/index.php?page=manageurldomain but I just get an error message saying I can't do this. Any ideas on how to either prevent IABot from trying to archive these urls, or better yet, get it to pick a working archive? Best Wishes, Lee Vilenski (talkcontribs) 15:46, 22 August 2023 (UTC)

Eurosport.co.uk and its subdomains have now been marked as permalive, so the bot should leave them alone now. Harej (talk) 20:21, 23 August 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:38, 30 August 2023 (UTC)

Permission error

I'm getting this error when I try to Run bot > Fix a single page.

"Permission error The action you are trying to perform requires the analyzepage permission. This permission is obtainable with the following groups: basicuser, user, admin, root, bot" 99% fad-free (talk) 13:02, 29 August 2023 (UTC)

Oh, never mind. I need ten edits on Mediawiki in addition to Wikipedia, I think. 99% fad-free (talk) 13:08, 29 August 2023 (UTC)
99% fad-free, make sure you have the correct wiki selected from the drop-down menu on the top right. In addition to having the minimum number of edits. Harej (talk) 21:39, 30 August 2023 (UTC)
That did it! Thank you @Harej. 99% fad-free (talk) 23:01, 30 August 2023 (UTC)
This section was archived on a request by: 99% fad-free (talk) 23:01, 30 August 2023 (UTC)

Incorrect tagging

The Bot comment says Rescuing 2 sources and tagging 0 as dead. But the two sources were tagged as dead, even though they are not dead. [10] Hawkeye7 (talk) 21:54, 29 August 2023 (UTC)

I am assuming you are referring to these links:
When I visited both of these URLs earlier I got a "bad request" error. However, just now, when visiting them I saw them come back up. So I think these resources have been going in and out. Interestingly, on the top of www.history.army.mil there is this notice: "Pardon our dust. Our website is undergoing maintenance and some content may be inaccessible or load incorrectly. Thank you for your patience." which may be why this is happening. In the meantime, I've flipped the status of those URLs in the database as being alive. Harej (talk) 20:18, 30 August 2023 (UTC)
This section was archived on a request by: Harej (talk) 18:43, 6 September 2023 (UTC)

Uccelli e aerei

Per favore, interrompere l'operazione di aggiungere un libro che riguarda gli uccelli a tutte le voci che parlano invece di aerei e di un'altra enciclopedia (esempio). Grazie :-) Pil56 (talk) 09:26, 12 August 2023 (UTC)

Pinging User:GreenC. Harej (talk) 18:16, 31 August 2023 (UTC)
I made a change that I think will address these sorts of edits in a general sense (non-specific to this book). I tested this book and it works. The bot will process itwiki in the next day or two, let's see what happens. -- GreenC (talk) 17:26, 10 September 2023 (UTC)
It ran and everything is OK. -- GreenC (talk) 20:53, 11 September 2023 (UTC)
This section was archived on a request by: GreenC (talk) 20:53, 11 September 2023 (UTC)

multiple month name issues on tewiki

  • if iabot is adding archive parameters, dates must be in english if ref date format is in english only
  • date names must follow rules of te wiki
  • ex_ edit
  • for date names, refer doc

రుద్రుడు చెచ్క్వికి (talk) 17:47, 3 August 2023 (UTC)

Citation Style specific about how to write or use telugu month names, bot seems to be ignoring? anyhow please refer te month names need రుద్రుడు చెచ్క్వికి (talk) 17:57, 3 August 2023 (UTC)
రుద్రుడు చెచ్క్వికి, for a given wiki, the bot only adds template parameters in one language. So it would have to be all English or all Telugu. Which would be better? Harej (talk) 20:09, 9 August 2023 (UTC)
The English months in the sources need not be converted into Telugu. They should only in English. --యర్రా రామారావు (talk) 03:13, 10 August 2023 (UTC)
యర్రా రామారావు, if the bot is configured to use English dates, it will convert Telugu dates it comes across and translate them into English. Is that okay? Harej (talk) 20:11, 16 August 2023 (UTC)
This section was archived on a request by: Harej (talk) 18:57, 15 September 2023 (UTC)

Problem with translation

Hi. I have been translating some messages on translatewiki but I do not understand this message.

"The IABot Management Interface is undergoing needed maintenance the tool and/or the bot may interfere with. ..."

It seems to me that some words are missing. Could you check/fix? --MGA73 (talk) 16:30, 5 September 2023 (UTC)

Does "The IABot Management Interface has been disabled for ongoing maintenance" make more sense? Harej (talk) 20:12, 6 September 2023 (UTC)
This section was archived on a request by: Harej (talk) 18:57, 15 September 2023 (UTC)

Problem with IABot workers jamming on job completion

Lately IABot seems to've been having a problem where, on completing a batch job, the worker that's just completed the batch job apparently jams and fails to release itself for further bot jobs (current example: job #14726, although it may well've been resolved by the time anyone reads this). As a result, the job-progress bar shows 100% completion, yet the run status continues to be listed as "Running" and the bot doesn't move on to handling subsequent bot jobs. Could someone look into (and, if possible, fix) this? Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 03:58, 24 August 2023 (UTC)

Also, are there a few workers still stuck on jobs further in the past? The number of active jobs that can be processed at the same time's seemed to have gone down from 4 to just 2 very recently. Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 04:01, 24 August 2023 (UTC)
And now one of those two workers seems to've died outright. Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 14:41, 30 August 2023 (UTC)
Whoop whoop pull up, thank you for your report. This is an issue we've been dealing with for a while unfortunately. You can track our work here. Harej (talk) 20:09, 30 August 2023 (UTC)
Y'welks! Any idea what's causing it? Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 00:02, 31 August 2023 (UTC)
Whoop whoop pull up, what's likely happened is that management interface has "outgrown" Toolforge, so we are in the process of moving it to a dedicated virtual machine so that it will run better. This is in process but we have had some delays. Harej (talk) 20:08, 6 September 2023 (UTC)

┌─────────────────────────────────┘
The problem appears to've gotten worse; there've now been some batch jobs that've frozen only partway through, without having finished. My jobs #14952 and #14955 and User:Eastmain's long-running job #14875 appear to've exhibited this behavior, and, when I killed 14952 and 14955 to free up their workers (requeueing the remaining portions of their batches for later jobs), the bot worker for one of those apparently died rather than being freed up, as killing those two jobs only allowed one queued job to begin (thus taking the number of active jobs that can be processed at any one time from 4 down to 3). Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 16:15, 13 September 2023 (UTC)

...aaand the same seems to've happened when Eastmain killed 14875, given that it looks like we're now down to just two workers. Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 19:47, 13 September 2023 (UTC)
This section was archived on a request by: Harej (talk) 17:02, 20 September 2023 (UTC)

Erroneous tagging

On two of the enwiki articles that happen to be on my watchlist, en:J. R. R. Tolkien‎ and en:C. S. Lewis‎, the bot has recently tagged external links to the Encyclopedia of Fantasy as being dead [11][12], when they are in fact live. I've reverted those edits; but I bring this up because the bot may have performed similar mistaggings on other articles that should be reverted and may need some sort of adjustment to avoid such mistagging. (I don't know how to interpret the reference to "Whoop whoop pull up - 14897" in the edit summaries, but if Whoop whoop pull up is the person to whose attention I should be bringing this matter, please let me know.) Deor (talk) 15:51, 8 September 2023 (UTC)

The presence of my username in the edit summary just means that I was the user who queued the batch job that those edits were made as part of. You'll want to be bringing up the bug with the bot's maintainers, not with me. Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 20:12, 8 September 2023 (UTC)
This is one of those things. Determining if a link is dead is a dark art not even Google with the worlds best programmers and resources always gets it right. Mistakes will be made. The question is what the error rate is, assume greater than 0%. The tool user/initiator is ultimately responsible for their work. If you choose to use it, you are responsible for the edits it makes, in the sense that if a messes something up the tool is not going to go back and fix the problem created on wiki. Of course the tool developer will try hard to make it accurate for future uses of the tool. So it's collaborative effort, both tool maker and tool user are responsible in different ways. -- GreenC (talk) 18:00, 10 September 2023 (UTC)
This section was archived on a request by: Harej (talk) 17:02, 20 September 2023 (UTC)

Can't run bot

Hi, I tried to use the bot on en:Wii to fix a broken archive link, but it's telling me that I need the analyzepage permission. It says that I can get that permission with the User group, and I think I have that, but I can't run it, so what's going on? RteeeeKed (talk) 20:46, 8 September 2023 (UTC)

What's your home wiki, enwiki? -- GreenC (talk) 16:52, 10 September 2023 (UTC)
Yes, also someone fixed the link I wanted to fix already, but I still want to know why I can't use the bot. RteeeeKed (talk) 17:51, 10 September 2023 (UTC)
When I try load your profile at iabot.org it says: Nonexistent user: The user you are trying to look up does not exist. They may have not yet used this interface. Have you logged into iabot.org before? -- GreenC (talk) 18:06, 10 September 2023 (UTC)
This section was archived on a request by: Harej (talk) 17:02, 20 September 2023 (UTC)

duplicated sources in Alan Turing

Could be fix this, it's repeatedly adding a duplicated sources. Please see include links above. Thanks. - 2001:4451:B52:CF00:3810:D909:D24A:6D59 02:48, 10 September 2023 (UTC)

Not sure what the bug is but this should stop it: [13] -- GreenC (talk) 16:51, 10 September 2023 (UTC)
This section was archived on a request by: Harej (talk) 17:02, 20 September 2023 (UTC)

Can't access management interface

As of this morning, I can't access the IABot management interface, with all attempts to do so throwing a 504 Gateway Time-out error.

I'm assuming this is not the intended behavior? Whoop whoop pull up Bitching Betty ⚧️ Averted crashes 12:19, 11 September 2023 (UTC)

There have been ongoing outages. As previously discussed we are in the middle of moving the management interface to a new syste; it has been taking longer than expected. Harej (talk) 20:11, 13 September 2023 (UTC)
This section was archived on a request by: 17:02, 20 September 2023 (UTC)

Dead link, but not actually dead

https://nl.wikipedia.org/w/index.php?title=H%C5%99ebe%C4%8D&diff=next&oldid=64810196 It marked the link to https://www.obechrebec.cz/ as dead, but it works fine on my end. In fact, it loads within a few milliseconds even. IMHO, that's a major flaw. If it took 10 seconds to load, I'd understand, but a few ms should not be grounds for being marked as dead. Please fix this, so that I don't have to disable the bot on that article. Mondo (talk) 09:39, 14 September 2023 (UTC)

And over here as well: https://nl.wikipedia.org/w/index.php?title=Ky%C5%A1ice_%28okres_Kladno%29&diff=65969182&oldid=65896077
Archiving the first link was fine, but the second (under ‘Bedrijven’) was not: it loads within a few ms, so it's not dead at all. Again: please fix this, I'm getting more and more convinced that I should disable the bot on certain pages. Mondo (talk) 08:45, 17 September 2023 (UTC)
The problem with https://www.obechrebec.cz/ is that it has an invalid SSL certificate, and so the redirect is not carried out because the bot is stuck on this error. Even if the destination site has a valid certificate, the original domain does not. With the second edit you linked to, it's just moving template parameters around, not changing anything. The archive URLs were already there. Harej (talk) 20:35, 20 September 2023 (UTC)
My browser always displays a warning when there's a invalid SSL certificatie, however, I received no such warning here. So I don't think that's the issue. But still, there should be a fix for it in the bot, else I'll disable the bot. Mondo (talk) 20:40, 20 September 2023 (UTC)
There is indeed an SSL error here, both User:Harej just encountered it while inspecting the URL. My recommendation is to replace the URL with the redirect target which is https://www.hrebec.cz/ that has a valid, working, SSL certificate. This will solve the problem here. —CYBERPOWER (Chat) 20:44, 20 September 2023 (UTC)
Then why does my browser not report the SSL error, even though it does so on other sites with SSL errors? Mondo (talk) 20:46, 20 September 2023 (UTC)
Presumably because you have already told the browser to trust the invalid certificate which is now saved in your browser as a trusted certificate. —CYBERPOWER (Chat) 20:47, 20 September 2023 (UTC)
I did no such thing. I didn't get the error in the first place. Mondo (talk) 20:49, 20 September 2023 (UTC)
I reported it to the bug tracker of the browser devs, let's see what they have to say about it. Mondo (talk) 20:55, 20 September 2023 (UTC)
This is standard behavior for every browser. —CYBERPOWER (Chat) 20:48, 20 September 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:04, 27 September 2023 (UTC)

Bad data being fed to this bot

Bad data being fed to this bot. I reverted the InternetArchiveBot actions.

The link went to a gaming site. I alerted User Eastmain as well. -- Ancheta Wis (talk) 16:24, 14 September 2023 (UTC)

Ancheta Wis, which of these links is linking to a gaming site? Harej (talk) 20:39, 20 September 2023 (UTC)
In revision All the links resurrected by InternetArchiveBot (talk | contribs | block) at 15:44, 14 September 2023 (Rescuing 5 sources and tagging 0 as dead.) #IABot (v2.0.9.5) (Eastmain - 14985). are properly dead. The bot is linking to a gaming site which usurped Fort Bliss Bugle. --Ancheta Wis (talk) 21:45, 20 September 2023 (UTC)
I looked at every archive URL the bot added, and they all load a historical copy of a news article from a defunct news service. Not a single one of them even remotely mentions gaming. —CYBERPOWER (Chat) 21:57, 20 September 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:04, 27 September 2023 (UTC)

API Error: An unknown error occurred

analyze a page_> https://kn.wikipedia.org/wiki/%E0%B2%B8%E0%B3%8D%E0%B2%A4%E0%B3%8D%E0%B2%B0%E0%B3%80 error_ API Error: An unknown error occurred rudhrudu (talk) 15:30, 17 September 2023 (UTC)

rudhrudu, I tried running https://iabot.wmcloud.org/index.php?page=runbotsingle&action=analyzepage and it should work now. Harej (talk) 20:41, 20 September 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:04, 27 September 2023 (UTC)

"Spam link" error when archiving Disney's My Son Pinocchio: Geppetto's Musical Tale

I have been attempting to archive the article Disney's My Son Pinocchio: Geppetto's Musical Tale but the bot tells me there are blacklisted links in it. I have checked all the links and not found any blacklist. Can someone please assist with this? SanAnMan (talk) 17:04, 19 September 2023 (UTC)

SanAnMan, the bot will not be able to edit the page so long as it links to filmreference.com. You would have to remove the link to that website for the bot to edit. Harej (talk) 20:53, 20 September 2023 (UTC)
Thank you, I've replaced the link with another RS and the bot worked perfectly. Appreciate the assistance! - SanAnMan (talk) 14:19, 21 September 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:04, 27 September 2023 (UTC)

|title=Archived copy

Could I kindly ask the developers not to insert |title=Archived copy when processing enwikt (English Wiktionary) quote-* templates? This title is useless (conveys no information) and in many cases it actively interferes with other parameters or even leads to errors. Thanks! Benwing2 (talk) 06:40, 15 August 2023 (UTC)

Benwing2, for us to test our fix, please link us to an example of the bot inserting title parameters. Harej (talk) 20:37, 16 August 2023 (UTC)
Yes I think we need to see which templates are a problem (quote-* ?), where adding this title actually leads to errors and interferes with other parameters. A before and after diff would be good? -- GreenC (talk) 18:02, 10 September 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:14, 9 October 2023 (UTC)

Bot creates link inside of link

The bot is still, at least sometimes, misplacing the webarchive template by inserting it into an existing external link, like this. The webarchive template needs to be placed after the external link is closed with a square bracket. Jonesey95 (talk) 13:21, 11 September 2023 (UTC)

I wonder if the trailing & in the URL is triggering the bug somehow. It's a benign "error" in the URL, only thing I see wrong with the data. -- GreenC (talk) 20:59, 11 September 2023 (UTC)
Here's another one, for troubleshooting purposes. The only characters in the link before the bot edited were alphanumeric and [/.:- ]. Jonesey95 (talk) 16:37, 15 September 2023 (UTC)
This is still happening (23 September 2023). There are no more "link in link" errors in article space. Please prevent the bot's code from making new ones. Jonesey95 (talk) 22:39, 23 September 2023 (UTC)
I have commented on the linked ticket. Harej (talk) 21:04, 27 September 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:14, 9 October 2023 (UTC)

Please consider w:en:The Africa Report

The bot rescued(!) a dead link, but all it did was delete the tag. See this diff.

The site fails to open. Safari cannot find the server globalfinance.mu

It would be lovely if this link could be rescued. Timtrent (talk) 14:59, 26 September 2023 (UTC)

Thank you for your report Timtrent. We think this is a bug, so we have filed a report. Harej (talk) 20:47, 27 September 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:14, 9 October 2023 (UTC)

No dead link

Hello @Cyberpower678:, the bot marks different pages on the website https://iwf.sport/ as dead links, for example the link to the following athlete's file:

https://iwf.sport/weightlifting_/athletes-bios/?athlete=hou-zhihui-1997-03-18&id=12208

But those links are live and the bot should not modify them (the website first asks if the person consulting is a living being, to avoid being bombarded by bots), for this reason the bot cannot open those links, but anyone can. Please modify this, so that the bot no longer modifies the links with the generic url

https://iwf.sport/weightlifting_/athletes-bios/

Thanks. Leonprimer (talk) 01:50, 27 September 2023 (UTC)

Leonprimer, as you've observed, the website checks for human traffic. For most websites that use Cloudflare DDoS protection, our bot has special permission to access those websites. However, if a given website has particularly strict settings, then not even our bot will be able to access this. To work around this, we have marked the iwf.sport domain as "permalive," so the bot will regard links to that website as being online. Harej (talk) 20:54, 27 September 2023 (UTC)
This section was archived on a request by: Harej (talk) 21:14, 9 October 2023 (UTC)

ntnews.com using language=te-IN? OR iabot is adding?

there are more than 2500 pages with ntnews.com on te wiki. it seems iabot or website incorrectly using language code. it should not use te-IN, only te. https://en.wikipedia.org/wiki/List_of_ISO_639-2_codes   and https://en.wikipedia.org/wiki/List_of_ISO_639-1_codes  does not have te-IN. ex_1: https://te.m.wikipedia.org/wiki/%E0%B0%AA%E0%B1%8D%E0%B0%B0%E0%B0%A4%E0%B1%8D%E0%B0%AF%E0%B1%87%E0%B0%95:MobileDiff/3990778 ex_2: https://te.m.wikipedia.org/wiki/%E0%B0%AA%E0%B1%8D%E0%B0%B0%E0%B0%A4%E0%B1%8D%E0%B0%AF%E0%B1%87%E0%B0%95:MobileDiff/3990777 is iabot adding lang code? హరుడు (talk) 18:53, 28 September 2023 (UTC)

హరుడు, InternetArchiveBot hasn't edited either of these pages. Harej (talk) 20:08, 11 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:01, 19 October 2023 (UTC)

Persistent 504 error

Since few days already, when running "Analyze a page" which will run around 5 mins or more before throwing "504 Gateway Time-out". Despite getting 504 error, for unknown reasons, I would see in my contributions that it is successful for certain runs. However, today despite trying on larger article like en:Blackpink which has lesser than 300 references or on smaller articles like en:Roh Jeong-eui which has only 11 references, both are throwing 504 error and also not showing magically showing as successful in my contributions. The only way I don't get 504 error is by manually archiving every single sources then adding the archived URL to "Modify URL Data" then running "Analyze a page" which would complete within seconds however this clearly make zero sense because this tool is supposed to lessen the fellow editor's load not increase it. Paper9oll (talk) 18:16, 29 September 2023 (UTC)

Looks like the archival runs for Blackpink, Roh Jeong-eui, and amongs other has been made however it has a 1 day delay. Not sure if there is some sort of logs available on your end to crosscheck on why it's such so it can be fixed. Paper9oll (talk) 15:27, 30 September 2023 (UTC)
Paper9oll, we think we know what is happening, but not what's causing it. What's happening is that there are delays with the availability API, which checks for the availability of web archives, and when this system is delayed requests get backed up. This is where the delay is coming from, but we don't know what is causing this delay. We are still investigating. Harej (talk) 21:11, 11 October 2023 (UTC)
@Harej Thanks for the update. So far I'm not re-encountering this issue yet. Paper9oll (talk) 08:57, 12 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:01, 19 October 2023 (UTC)

Archiving

I tried to archive links for the article Pokémon but it appears that it is "too large for this tool." I also tried to submit a job request but it just analyzes the article and did not archive any links. Wingwatchers (talk) 04:38, 1 October 2023 (UTC)

Wingwatchers, when you submit a job request, it will not change all links, just the ones that are broken. That is because these edits are attributed to the bot and the bot is only authorized to fix broken links. To fix all links requires you to use the single-page tool but there is a size limit as you have found. We will increase this page size limit as soon as we address the issues in the section above. Harej (talk) 21:05, 11 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:01, 19 October 2023 (UTC)

Techincal problems with InternetArchiveBot

Hi, I am here to ask or see if there is something that can be done to correct the InternetArchiveBot, since I was checking on this edit -[14] on the page of Indi Hartwell, and I saw that he deleted the link to the archive website Ghost Archive, and he replaced it with one of the Internet Archive, but the archived link didn't take to somewhere and it was damaged, while the original one from Ghost Archive was working fine. I have also notice that this is not the first time that he does this kind of editions, he replaces a link of the Ghost Archive or "fixes it", and instead of making a correction he damages it or just changes the whole link for one that doesn't works. I have reverted the change for the reasons that I have just exposed in here, but, is there any fix that can be done to it so this kind of stuff stops happening? Greetings. TheBellaTwins1445 (talk) 22:53, 1 October 2023 (UTC)

Thank you for your report TheBellaTwins1445. You can follow updates on this report on Phabricator. We still need to add support for this archive provider. Harej (talk) 21:17, 11 October 2023 (UTC)
Thank you for the response, I will be checking on the updates for the case. Greetings. TheBellaTwins1445 (talk) 22:34, 11 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:01, 19 October 2023 (UTC)

Is iabot error ?

Hello. I can't use for several days. Is it error ? Ariandi Lie Talk with me 16:52, 8 October 2023 (UTC)

@Ariandi Lie: https://guc.toolforge.org/?by=date&user=InternetArchiveBot Dušan Kreheľ (talk) 07:45, 9 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:01, 19 October 2023 (UTC)

Spamming to last changes

Last days I have hundreds reports about this bot's activity. Please, could you perform its setup to exclude reporting to pages' followers its activity? Thank you very much! Bojars (talk) 12:56, 11 October 2023 (UTC)

Bojars, make sure your recent changes filter is set up for "Human (not bot)" edits. The bot edits only show up without this filter. Harej (talk) 21:21, 11 October 2023 (UTC)
Harej, yes I was trying this, but (maybe I am wrong), after setup/ and reloading it was "Human (not bot)" setup's lost. --Bojars (talk) 06:53, 12 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 01:01, 19 October 2023 (UTC)

Dealing with redirect

Hi, is it possible for the bot to recognise a specific redirect as equivalent to a dead link? My use case is the British Film Institute which this week has closed down all its individual webpages for movies, which now all redirect to an info page. For example, https://www2.bfi.org.uk/films-tv-people/4ce2b6e822633 (for the movie Action Stations) redirects to https://www.bfi.org.uk/this-page-no-longer-exists, as do all other per-film pages. In many cases the original page will be on the Internet Archive (in this case https://web.archive.org/web/20201030232522/https://www2.bfi.org.uk/films-tv-people/4ce2b6e822633), so the link could be replaced. Can the bot be configured to do this? Many thanks for any advice. Tobyhoward (talk) 09:45, 1 October 2023 (UTC)

This could be a good task for User:GreenC's bot. Harej (talk) 21:18, 11 October 2023 (UTC)
I can do this (Enwiki only). Opened a request at w:Wikipedia:Link_rot/URL_change_requests#bfi.org.uk_soft-404s please follow up there. -- GreenC (talk) 21:19, 11 October 2023 (UTC)

This request has been resolved mostly for now on enwiki. About 15k new archive URLs have been uploaded to IABot and marked dead, which will propagate to the other wikis. There is a lot more work to be done on Wikidata and other wikis, particularly with the BFI external link template used in a couple dozen wikis. It is beyond the scope of IABot, and probably me also. BFI has issues, more info at bottom of this discussion page. -- GreenC (talk) 16:09, 17 October 2023 (UTC)

This section was archived on a request by: Harej (talk) 16:03, 25 October 2023 (UTC)

IAB marked URL as dead, even though an archived copy was available

https://nl.wikipedia.org/w/index.php?title=Wind_Rose&diff=66078906&oldid=65892316

As you can see, it marked the URL to Masters of Rock as dead, which is kind of fair. But it didn't add the archived copy, even though it was available: http://web.archive.org/web/20190617195034/https://www.mastersofrock.cz/en/Wind-Rose

Seems like a bug to me. Mondo (talk) 09:12, 2 October 2023 (UTC)

Mondo, it's possible the bot did not add that archive link because it was not able to load it. I tried loading it in my browser just now and it would not load. If in the future that archival copy manages to load the bot will be able to add it as an archive link. Harej (talk) 21:27, 11 October 2023 (UTC)
Since I posted this, I have tried to load that archive link on several occasions in several browsers and it loads just fine. Is it so hard to believe the bot could have a bug? Mondo (talk) 11:11, 12 October 2023 (UTC)
Can confirm the Wayback Machine was having issues loading at the time we answered your request last week, but that isn't actually the point of the issue. What's going on here is that an editor borked the original URL on the template, which is very much dead, as it returns a 403, and that specific URL has no Wayback Machine copy. In other words, the Wayback snapshot you provided is technically different than the URL in the citation template, which is why the bot won't find it. I suggest stripping the accidental garbage from the end of the URL which appears to be a URL encoded snippet of the citation template itself. Right now the URL is https://www.mastersofrock.cz/en/Wind-Rose%20%7Ctitle=Wind%20Rose instead of https://www.mastersofrock.cz/en/Wind-Rose which actually loads, and is the original of the Wayback snapshot you suggested.
In short, there is no bot bug here, just a case of GIGO. —CYBERPOWER (Chat) 20:23, 18 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 16:03, 25 October 2023 (UTC)

ruwiki

Hi, botowner. Your bot does many edits like this or this. Please, stop it and cancel it. 91.197.junr3170 (talk) 18:35, 3 October 2023 (UTC)

91.197.junr3170, can you describe what is wrong with these edits? Harej (talk) 21:22, 11 October 2023 (UTC)
ЛА-1978-№04→ %D0%9B%D0%90-1978-%E2%84%9604 91.197.junr3170 (talk) 19:40, 18 October 2023 (UTC)
91.197.junr3170, while percent-encoding is turned off for Russian Wikipedia, it is still required for URLs because of a limitation the bot has. A future version of the bot will address this. Harej (talk) 20:25, 18 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 16:03, 25 October 2023 (UTC)

url containing percent-encoded spaces not processed correctly

https://en.wikipedia.org/w/index.php?title=Bill_Robertson_(English_footballer)&curid=23002245&diff=1179729739&oldid=1164066991

Hello. What happened is, the bot found an archive copy at https://web.archive.org/web/20160304000852/http://theblues.chelseafc.com/cgi-bin/playersearch.pl?Bill%20H%20ROBERTSON , but didn't copy the whole of it to the archive-url parameter, instead it stopped at the first percent-encoded space, so only copied https://web.archive.org/web/20160304000852/http://theblues.chelseafc.com/cgi-bin/playersearch.pl?Bill

I've undone it for now. Struway2 (talk) 07:55, 12 October 2023 (UTC)

Thank you for your report Struway2. We have filed a bug report on Phabricator. Harej (talk) 20:38, 18 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 16:03, 25 October 2023 (UTC)

Generated link marked as dead

Generated URL improperly marked as dead:

Glrx (talk) 16:47, 13 October 2023 (UTC)

Glrx, we have opened a ticket on Phabricator. Harej (talk) 20:47, 18 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 16:03, 25 October 2023 (UTC)

InternetArchiveBot keeps adding blank archive save

Hi. Can InternetArchiveBot at English Wiki please stop adding this blank archive save to en:2020 NBL1 season. It is not a useful archive link as it does not load the content of the page that was there in March 2020. I have checked all the saves of that page from Wayback machine and none of them are useful as NBL1.com.au pages have traditionally not saved properly. Thanks. DaHuzyBru (talk) 02:01, 14 October 2023 (UTC)

DaHuzyBru, that archive URL has been removed from the database. Harej (talk) 20:50, 18 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 16:03, 25 October 2023 (UTC)

Article size limit redux

Hi @Harej! I see that phab:T342168 has been resolved. Does that mean it's now possible to increase the size limit of articles that the bot can handle? {{u|Sdkb}}talk 03:19, 14 October 2023 (UTC)

Sdkb, we have just now removed the limit. Thank you for checking in! Harej (talk) 21:05, 18 October 2023 (UTC)
Fantastic; thanks! Cheers, {{u|Sdkb}}talk 22:27, 18 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 16:03, 25 October 2023 (UTC)

Percent encoding of URLs

Hello. Do not convert URLs into these endless gibberish things, please. Sneeuwschaap (talk) 13:02, 14 October 2023 (UTC)

Sneeuwschaap, unfortunately, while percent-encoding of article text is turned off for Russian Wikipedia, it is still required for URLs because of a limitation the bot has. A future version of the bot will address this. Harej (talk) 21:06, 18 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 16:03, 25 October 2023 (UTC)

On the Esperanto Wikipedia, the bot sometimes adds archive URLs when they are already present

This results in multiple pointers to the same archive page appearing, when there really only should be one.

Here is an example (which I later manually corrected): Rozalia Zamenhof Mayhair (talk) 06:12, 15 October 2023 (UTC)

Mayhair, we have adjusted the template configuration on our end to now also accept English-language parameters as well. However I'd like to note the "cite web" call on that page is using Esperanto-language parameters even though it only has English parameters. I would recommend updating that template call to use the Esperanto variant of that template. Harej (talk) 21:18, 18 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 16:03, 25 October 2023 (UTC)

Time out

The bot has timed out today and yesterday. TwoScars (talk) 21:19, 15 October 2023 (UTC)

TwoScars, we've been trying to reproduce the underlying issue but unfortunately have not been able to. If you have any additional information about when the bot times out, such as loading a particular page, that would be useful. If it just occasionally goes offline without anything in particular prompting it to, we are still looking into that. Harej (talk) 01:00, 19 October 2023 (UTC)
I just got it to work. It took a while, but did not time out. The file was User:TwoScars/sandbox. Also got it to work on Mambourg Glass Company. TwoScars (talk) 16:41, 19 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 16:03, 25 October 2023 (UTC)

Rollback or ..

Hi. Bot added dead link to some articles in 2021. Link is not dead now, how can you rollback these edits?

Example: "SEDS" link in NGC_6246

Tou can find list of these articles and first 4 symbols of name is the same; "NGC " Bikar (talk) 11:28, 12 October 2023 (UTC)

Bikar, our scan logs don't go back to 2021 so we could not tell you the context there. I recommend keeping the archive links in place in case the website goes down again. If it went down a first time it could go down again. Harej (talk) 20:41, 18 October 2023 (UTC)
Can you explain me why i cant archive "SEDS" link via bot? Example: NGC 7142 Bikar (talk) 20:46, 18 October 2023 (UTC)
Bikar, the link is outside of the references, so the single-page tool will not touch it. Harej (talk) 20:06, 25 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 15:57, 2 November 2023 (UTC)

Bot down since this morning

Hello team! I went to run the archive bot for an article and found that the site is currently returning a 500 error. I double-checked on "Down for Everyone or Just Me" and it does not appear to be exclusive to me. Sock (talk) 17:04, 23 October 2023 (UTC)

Thank you for your report, Sock. We are still working on figuring out the underlying cause; we have been unable to exactly reproduce what causes it. Harej (talk) 20:08, 25 October 2023 (UTC)
This section was archived on a request by: Harej (talk) 15:57, 2 November 2023 (UTC)

How do I ask the bot not to edit pages