Jump to content

User talk:InternetArchiveBot

Add topic
From Meta, a Wikimedia project coordination wiki
Latest comment: 7 hours ago by SusanLesch in topic Blank page


Archive
Archives

Connect with the developers and other users[edit]

Telegram IRC (irc.libera.chat #iabot)

Operation status[edit]

For the most up to date information see the run pages or Wiki Operations Summary on Airtable

  • 🟢 InternetArchiveBot is currently running on 300+ Wikimedia wikis.
  • 🟢 We have moved the management interface to a new server. Please start using iabot.wmcloud.org instead of iabot.toolforge.org. Please let us know if anything broke during this process.
  • 🟡 Testing is stalled on Alemannisch Wikipedia (als), Asturian Wikipedia (ast), and Japanese Wikipedia (ja).
  • 🔴 Bot is approved but disabled indefinitely pending software improvements on German Wikipedia (de), French Wikipedia (fr), MediaWiki.org, Norwegian Nynorsk Wikipedia (nn), Polish Wikipedia (pl), and Portuguese Wikipedia (pt).

Last updated: 15:02, 3 June 2024 (UTC)

How this page works[edit]

  1. Ask your question in any language. Questions in English or German will receive the fastest responses.
  2. Our team will try to respond within seven days.
  3. Seven days after our response we will mark the thread as resolved. This queues the thread for archiving.
    If our response does not answer your question, you are welcome to remove the "section resolved" tag and write an additional comment.
  4. Seven days after the thread is marked as resolved, it will be archived. Once a thread is archived, it should not be un-archived. Instead, create a new thread and link to the old one.


SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 7 days.


Inaccessible link[edit]

IABot “fixed” a link that it reported as inaccessible here: https://nl.wikipedia.org/w/index.php?title=Chora_%28Patmos%29&diff=66511300&oldid=66454976

However, the link works fine with http on my end. Now I do agree that https is safer (although in this case it was hardly an improvement), but that's no reason to treat a link as “inaccessible”. Mondo (talk) 18:15, 13 December 2023 (UTC)Reply

Hello Mondo. The bot did not necessarily declare the link inaccessible, though the edit summary would indicate that because the bot's edit summaries are very imprecise. The bot upgrades HTTP links to HTTPS where possible, separately from its process of fixing dead links. Harej (talk) 18:20, 13 December 2023 (UTC)Reply
Hello Harej, in that case, it's against the guidelines of the Dutch Wikipedia. We have the guideline “bij twijfel niet inhalen”, which is similar to the one on EN:WP called If it ain't broke, don't fix it, except that ours is much more detailed. The link was not broken and https hardly made a difference with this specific link, therefore it was against the guideline. I have reverted IABot and added the article to the deny list, but I hope this can be fixed, because this will happen again on other pages. Mondo (talk) 18:23, 13 December 2023 (UTC)Reply
.
3750 2409:4081:2E1B:10CF:C8EC:865C:203E:844A 06:20, 16 April 2024 (UTC)Reply

It's not resolved. I explained what the issue was back in December and nothing has changed. Mondo (talk) 19:27, 3 April 2024 (UTC)Reply

Mondo, as explained above, it is our practice to replace HTTP with HTTPS on all wikis, and we are not changing that. Continuing to remove the "section resolved" template will not change this. If changing HTTP to HTTPS is in fact against policy, please cite the policy. Harej (talk) 20:09, 3 April 2024 (UTC)Reply
If you wanted me to cite the policy, it would've been nice to know that when I posted my last comment instead of not responding to me for months. But here you go:

https://nl.wikipedia.org/wiki/Wikipedia:Bij_twijfel_niet_inhalen
“De ene goede variant door de andere goede variant vervangen is geen verbetering of verslechtering, maar een neutrale bewerking. Dergelijke bewerkingen zijn ongewenst”

Which translates to: “Replacing one good variant with another is not an improvement nor the opposite. It's a neutral edit. Such edits are undesirable.

Replacing http with https is exactly that: http works fine, i.e. it's a good variant, which makes it against policy. Now I could see it being somewhat useful if it's a URL where security is of the utmost importance, but in this case it's a link to a spreadsheet file. There's nothing that https will do to protect the user in this case. (Or if the http link was dead and replaced with https.) Mondo (talk) 20:18, 3 April 2024 (UTC)Reply

Your tool docs are featured as an example in the new Tool Docs Guide![edit]

Hello InternetArchiveBot maintainers, contributors, and fans! I wanted to let you know that I highlighted the InternetArchiveBot documentation as a shining example in the new Tool Docs guide that I just published. Thank you for creating lovely tool documentation that can serve as an example to help others create and improve tool docs :-) This guide was created as part of the Doc Your Tool project for the upcoming 2024 Hackathon. If you're interested, please join that project to work on or talk about tool documentation during the hackathon! TBurmeister (WMF) (talk) 16:52, 16 April 2024 (UTC)Reply

That's exciting to hear, thank you! Harej (talk) 21:59, 24 April 2024 (UTC)Reply

The bot keep adding archive link where it isn't required.[edit]

Hello, The bot always try to add this link but it isn't needed. It happened like more than 3 times and I had to cancel the change every time. https://web.archive.org/web/20211012034604/https://incubator.wikimedia.org/w/index.php?hidebots=1&translations=filter&hidecategorization=1&hideWikibase=1&limit=50&days=3&title=Special%3ARecentChanges&testwiki=wp%2Fryu&urlversion=2

The unwanted modifications occurs on this page: https://incubator.wikimedia.org/wiki/Wp/ryu/%E3%83%A1%E3%82%A4%E3%83%B3%E3%83%9A%E3%83%BC%E3%82%B8

And this is an example of the unwanted modification. https://incubator.wikimedia.org/w/index.php?title=Wp/ryu/%E3%83%A1%E3%82%A4%E3%83%B3%E3%83%9A%E3%83%BC%E3%82%B8&diff=prev&oldid=6254326 Patronus95 (talk) 04:51, 2 May 2024 (UTC)Reply

I just looked and this seems to have fixed itself. Is there anything more you need me to look at? —CYBERPOWER (Chat) 14:54, 15 May 2024 (UTC)Reply

Archive.ph → Archive.today[edit]

Tracked in Phabricator:
Task T361746

https://nl.wikipedia.org/w/index.php?title=Patreon&diff=next&oldid=66920330

and

https://nl.wikipedia.org/w/index.php?title=Prog_(tijdschrift)&diff=next&oldid=66920158

But archive.ph is the same service and the link with ph works fine. This is again a clear violation of the Dutch version of “if it ain't broke, don't fix it” guideline, just like the most recent time we spoke. Mondo (talk) 20:11, 1 February 2024 (UTC)Reply

Mondo, bug report has been filed. Harej (talk) 20:29, 3 April 2024 (UTC)Reply
Thank you. 🙂 Mondo (talk) 20:38, 3 April 2024 (UTC)Reply
I replied in the Phab giving the technical reason why, it's done for functional reasons not cosmetic, archive.today is a special domain that is functionally more reliable then the other ones, and it's also the domain the owners of archive.today requested we use on Wikipedia as a safeguard against potential future outages. -- GreenC (talk) 14:42, 4 April 2024 (UTC)Reply
They can request whatever they want, but at least on the Dutch Wikipedia, changes at the request of owners are seen as an unwanted change and even without their request it's seen as an unwanted change, so something still needs to be done about it. Mondo (talk) 14:57, 4 April 2024 (UTC)Reply
Besides, it looks like the bot doesn't even care for archive.today that much anyway, as it just changed an archive.today URL to archive.is: https://nl.wikipedia.org/w/index.php?diff=prev&oldid=67337586 (the second highlighted reference). I used IABot for this. Mondo (talk) 19:56, 7 April 2024 (UTC)Reply

I am disabling the bot indefinitely on Dutch Wikipedia until this is addressed. Harej (talk) 18:41, 10 May 2024 (UTC)Reply

Thank you for taking action, I really appreciate that. 🙂 Mondo (talk) 19:14, 10 May 2024 (UTC)Reply

Can't archive[edit]

Hi IAB admins, so I'm having an issue. With one of my articles, en:Aston Martin Rapide, i've been trying to archive the sources, but it comes up with this. No links were analyzed for some reason. Any reason as to why? 750h+ (talk) 05:08, 25 May 2024 (UTC)Reply

I can't see what you are referring. The image is not showing for me.—CYBERPOWER (Chat) 21:30, 29 May 2024 (UTC)Reply

The bot wrongly claims links are dead[edit]

Hi! The bot keeps claiming that links in the following article are dead, but they are ok, as far as I see. https://el.wikipedia.org/wiki/%CE%95%CE%B8%CE%BD%CE%B9%CE%BA%CE%AE_%CE%95%CE%BB%CE%BB%CE%AC%CE%B4%CE%B1%CF%82_(%CE%A6%CE%B5%CE%BD%CF%84_%CE%9A%CE%B1%CF%80) Can you do something, please? Thank you. --Harry Deconstructing (talk) 12:11, 29 May 2024 (UTC)Reply

Can you please provide more concrete examples?—CYBERPOWER (Chat) 21:32, 29 May 2024 (UTC)Reply
If you check the references, most of the links are deemed dead. However they work. For example
Reference No 25: Greece - Mexico 1 - 2 (1983)[νεκρός σύνδεσμος]
The link is https://www.billiejeankingcup.com/en/draws-and-results/W-FC-1983-WG-M-GRE-MEX-01?matchId=itf_2610164d79ebc202150c3ed3669cb0b6 Harry Deconstructing (talk) 00:32, 30 May 2024 (UTC)Reply

Category:CS1 maint: url-status at EN Wikipedia[edit]

Hello, I was wondering if InternetArchiveBot could go through CS1_maint:_url-status on EN Wikipedia. I checked some of them from the list. Articles like Alan Barinholtz and Football at the 2024 Summer Olympics have a working URL and don't need |url-status=live. So far, I haven't seen an |url-status=dead parameter that's missing an archived URL and archive date. There's over 2,000 articles to check. Thanks! MrLinkinPark333 (talk) 23:33, 30 May 2024 (UTC)Reply

False positive dead link[edit]

Hi, https://geoportal.rsd.cz (see https://cs.wikipedia.org/w/index.php?title=D%C3%A1lnice_D1&diff=prev&oldid=23963893) is not dead, maybe is just geo-restricted. --Harold (talk) 15:53, 31 May 2024 (UTC)Reply

Ambiguous message[edit]

Hi,

There's this message:

Once the search results load, which can take time, select the domains it found from the list that you want to modify and push "Submit" on the bottom of the page.

What does "that you want to modify" refer to? The domains or the list?

If it's the domains, could it perhaps be written like this:

Once the search results load, which can take time, select the domains it found and that you want to modify from the list and push "Submit" on the bottom of the page.

I don't know the tool well, and I'm just guessing and it's possible that I'm wrong :) Amir E. Aharoni (talk) 19:56, 1 June 2024 (UTC)Reply

A message with "from all wikis"[edit]

There's this message:

All citation templates are listed here from all wikis. Format it as if you are transcluding the template, and put new templates on a new line. Failure to follow correct formatting may break the bot.

What does "from all wikis" mean here?

Could it perhaps be rephrased as "All citation templates from all wikis are listed here."? Amir E. Aharoni (talk) 19:44, 2 June 2024 (UTC)Reply

non-critical DB[edit]

Hello again :)

There's this message:

A non-critical DB has returned an error. This may have minor impacts on reliability.<br>Error {{errno}}: {{errormessage}}

It may be correct, but I wanted to make sure: Is it really a non-critical DB? Like, is there a critical DB and a non-critical DB?

I'm asking because the same sentence also mentions error, and it's much more common in software user interfaces that the errors are critical or non-critical, and not the DBs.

If the message is correct as is, everything's fine :) Amir E. Aharoni (talk) 22:19, 3 June 2024 (UTC)Reply

Weird "%20" added to archive URL[edit]

Hello, I'm relatively short on time at the moment due to being on holiday among other things, but in this edit on the English Wikipedia, the bot took the URL http://www.washingtontimes.com/news/2010/feb/27/us-clinches-medals-total-canada-most-golds/ and added an archive link https://web.archive.org/web/20181225175051/https://www.washingtontimes.com/news/2010/feb/27/us-clinches-medals-total-canada-most-golds/%20/ (which doesn't work). The correct archive link should be https://web.archive.org/web/20190203234549/https://www.washingtontimes.com/news/2010/feb/27/us-clinches-medals-total-canada-most-golds/ but when I try to modify the URL data in Internet archive bot, it says that URL doesn't match the original link. Graham87 (talk) 19:49, 6 June 2024 (UTC)Reply

A few more message corrections[edit]

Hi!

Something slightly different this time.

I've completed the translation of the bot into Hebrew on translatewiki. Along the way, I sent a few more message corrections. There are now six pull requests at GitHub. It would be nice to review them.

Thanks! :) Amir E. Aharoni (talk) 20:40, 7 June 2024 (UTC)Reply

Blank page[edit]

Hi and thank you for your bot. I am running it on a giant article that used to take maybe 5 minutes. Now after a couple minutes the screen goes blank and doesn't recover. I guess due to lack of patience I ended up with three different IABot edits. You need a better way to tell the user when the bot is done. -SusanLesch (talk) 23:33, 7 June 2024 (UTC)Reply