Talk:Spam blacklist

From Meta, a Wikimedia project coordination wiki
(Redirected from WM:SBL)
Jump to navigation Jump to search
Requests and proposals Spam blacklist Archives (current)→
The associated page is used by the MediaWiki Spam Blacklist extension, and lists regular expressions which cannot be used in URLs in any page in Wikimedia Foundation projects (as well as many external wikis). Any meta administrator can edit the spam blacklist; either manually or with SBHandler. For more information on what the spam blacklist is for, and the processes used here, please see Spam blacklist/About.
Proposed additions
Please provide evidence of spamming on several wikis and prior blacklisting on at least one wiki. Spam that only affects a single project should go to that project's local blacklist. Exceptions include malicious domains and URL redirector/shortener services. Please follow this format. Please check back after submitting your report, there could be questions regarding your request.
Proposed removals
Please check our list of requests which repeatedly get declined. Typically, we do not remove domains from the spam blacklist in response to site-owners' requests. Instead, we de-blacklist sites when trusted, high-volume editors request the use of blacklisted links because of their value in support of our projects. Please consider whether requesting whitelisting on a specific wiki for a specific use is more appropriate - that is very often the case.
Other discussion
Troubleshooting and problems - If there is an error in the blacklist (i.e. a regex error) which is causing problems, please raise the issue here.
Discussion - Meta-discussion concerning the operation of the blacklist and related pages, and communication among the spam blacklist team.
#wikimedia-external-linksconnect - Real-time IRC chat for co-ordination of activities related to maintenance of the blacklist.
Whitelists
There is no global whitelist, so if you are seeking a whitelisting of a url at a wiki then please address such matters via use of the respective Mediawiki talk:Spam-whitelist page at that wiki, and you should consider the use of the template {{edit protected}} or its local equivalent to get attention to your edit.

Please sign your posts with ~~~~ after your comment. This leaves a signature and timestamp so conversations are easier to follow.


Completed requests are marked as {{added}}/{{removed}} or {{declined}}, and are generally archived quickly. Additions and removals are logged · current log 2020/05.

Projects

snippet for logging
{{sbl-log|20120882#{{subst:anchorencode:SectionNameHere}}}}

Proposed additions[edit]

Symbol comment vote.svg This section is for proposing that a website be blacklisted; add new entries at the bottom of the section, using the basic URL so that there is no link (example.com, not http://www.example.com). Provide links demonstrating widespread spamming by multiple users on multiple wikis. Completed requests will be marked as {{added}} or {{declined}} and archived.

Proposal to add Lenta.ru to the global spam blacklist[edit]



Lenta.ru earlier this year is in the English Wikipedia spam blacklist entry. Lenta.ru over the years has published fake news, conspiracy theories and propaganda, just like Alex Jones' InfoWars.The Guardian I think that Lenta.ru should be added to the global spam blacklist in order to prevent COVID-19-related conspiracy theories. 58.230.133.106 22:43, 20 May 2020 (UTC)

There would appear to be 38,082 uses across the wikis if global search tells me the truth. I do not think that this can be determined with a simple conversation here. It would require numbers of wikis to independently start blacklisting it and reach the criteria expressed above; or a sizeable groundswell of opinion that the domain is problematic.  — billinghurst sDrewth 22:48, 20 May 2020 (UTC)

Proposed additions (Bot reported)[edit]

Symbol comment vote.svg This section is for domains which have been added to multiple wikis as observed by a bot.

These are automated reports, please check the records and the link thoroughly, it may report good links! For some more info, see Spam blacklist/Help#COIBot_reports. Reports will automatically be archived by the bot when they get stale (less than 5 links reported, which have not been edited in the last 7 days, and where the last editor is COIBot).

Sysops
  • If the report contains links to less than 5 wikis, then only add it when it is really spam
  • Otherwise just revert the link-additions, and close the report; closed reports will be reopened when spamming continues
  • To close a report, change the LinkStatus template to closed ({{LinkStatus|closed}})
  • Please place any notes in the discussion section below the HTML comment

COIBot[edit]

The LinkWatchers report domains meeting the following criteria:

  • When a user mainly adds this link, and the link has not been used too much, and this user adds the link to more than 2 wikis
  • When a user mainly adds links on one server, and links on the server have not been used too much, and this user adds the links to more than 2 wikis
  • If ALL links are added by IPs, and the link is added to more than 1 wiki
  • If a small range of IPs have a preference for this link (but it may also have been added by other users), and the link is added to more than 1 wiki.
COIBot's currently open XWiki reports
List Last update By Site IP R Last user Last link addition User Link User - Link User - Link - Wikis Link - Wikis
amp.onedio.com 2020-05-29 01:44:20 COIBot 104.16.229.51 151.135.163.243
178.66.99.51
178.66.99.91
217.66.158.186
95.54.103.156
2070-01-01 05:00:00 8 2
axismf.com 2020-05-29 00:15:50 COIBot 202.78.251.186 Amovingpixel
Bigstory1
Rupika08
Swarupsengupta2007
122.160.111.251
2070-01-01 05:00:00 10 2
codingplayground.blogspot.it 2020-05-29 00:57:25 COIBot 172.217.15.97 R Jumpow
Tökömmá
217.133.116.99
79.46.199.15
82.105.245.49
82.188.224.250
2070-01-01 05:00:00 8 6
cupcakes 2.fandom.com 2020-05-29 00:01:12 COIBot X R Deanhansen2 2070-01-01 05:00:00 12 0 0 0 0
deza-par.com 2020-05-28 21:12:21 COIBot 95.130.171.85 Mazazaye 2070-01-01 05:00:00 6 5 0 0 2
drckeener.googlepages.com 2020-05-28 11:55:24 COIBot 172.217.15.115 R Jhonnata Cabral
TMDrew
2070-01-01 05:00:00 2 2
fos-kastoria.blogspot.bg 2020-05-28 18:29:51 COIBot 172.217.15.97 R Мико
Мико
2070-01-01 05:00:00 13 2
gaysaltlake.com 2020-05-29 01:39:07 COIBot 74.208.236.212 R Maher (Beit al Hikma)
Nathan 02
2070-01-01 05:00:00 64 7
gorefest.nl 2020-05-29 00:46:49 COIBot 0.0.0.0 R NicoGoro90 2070-01-01 05:00:00 185 20 0 0 7
happinessday.org 2020-05-29 01:33:12 COIBot 151.101.130.159 2601:0:B080:1D4:D69A:20FF:FE5E:7B06
2601:C2:200:B30:B5FB:60C5:6A0D:9F43
2A02:908:1A0:91A0:1DEE:2D0E:1766:4B26
Evil berry
Hungchaka
Juniperusco
LilyKitty
Me, Myself, and I are Here
Nuxnap
Reme Semopla
Sati010
Սահակ
178.49.148.100
69.203.149.29
76.97.64.56
2070-01-01 05:00:00 50 12
haubenhuehner-seltene-huehnerrassen.blogspot.de 2020-05-27 21:43:47 COIBot 172.217.15.97 R Andreas Franziskus 2070-01-01 05:00:00 35 4 0 0 2
israblog.org 2020-05-28 05:36:45 COIBot 192.117.165.189 Calc 19 2070-01-01 05:00:00 8 8 0 0 3
janakilenin.blogspot.ru 2020-05-28 23:54:37 COIBot 172.217.15.97 R Պետրոսյան Անահիտ 2070-01-01 05:00:00 963 3 0 0 2
kajankajan.blogspot.rs 2020-05-28 21:43:21 COIBot 172.217.15.97 R Качуровська 2070-01-01 05:00:00 2503 2 0 0 2
memorialspaceflights.com 2020-05-28 21:29:27 COIBot 52.39.234.239 R FNAFPUPPETMASTER
SombreHéros
2070-01-01 05:00:00 12 5
remzltd.com 2020-05-28 23:49:51 COIBot 104.18.49.165 217.63.121.150
84.53.244.123
89.191.231.87
2070-01-01 05:00:00 6 2
ricosylibres.com 2020-05-28 10:06:50 COIBot 91.134.184.247 37.222.212.143
80.224.122.158
84.78.240.110
84.78.240.175
84.78.241.8
2070-01-01 05:00:00 7 2
thetribeug.blogspot.ug 2020-05-29 00:18:22 COIBot 172.217.15.97 R Pierre Andr� Leclercq 2070-01-01 05:00:00 0 5 0 0 2
toponimograncanaria.blogspot.fr 2020-05-29 00:32:49 COIBot 172.217.15.97 R Silvia Scanu 2070-01-01 05:00:00 17 4 0 0 2
ustaz-nik-mohd-zawawi.blogspot.my 2020-05-28 11:20:57 COIBot 172.217.15.97 R PutraOsakaTokyo 2070-01-01 05:00:00 99 4 0 0 2
varianzasvenezuela.blogspot.no 2020-05-29 00:12:47 COIBot 172.217.15.97 R Orgullomoore 2070-01-01 05:00:00 1266 8 0 0 2
xn----7sba0bce7bg3c.xn--p1ai 2020-05-29 02:00:48 COIBot 212.92.101.5 5.166.98.231
95.105.41.190
2070-01-01 05:00:00 16 2

Proposed removals[edit]

Symbol comment vote.svg This section is for proposing that a website be unlisted; please add new entries at the bottom of the section.

Remember to provide the specific domain blacklisted, links to the articles they are used in or useful to, and arguments in favour of unlisting. Completed requests will be marked as {{removed}} or {{declined}} and archived.

See also recurring requests for repeatedly proposed (and refused) removals.

Notes:

  • The addition or removal of a domain from the blacklist is not a vote; please do not bold the first words in statements.
  • This page is for the removal of domains from the global blacklist, not for removal of domains from the blacklists of individual wikis. For those requests please take your discussion to the pertinent wiki, where such requests would be made at Mediawiki talk:Spam-blacklist at that wiki. Search spamlists — remember to enter any relevant language code

newmail.ru[edit]



Two third level domains were proposed, viagra.newmail.ru and phentermine.newmail.ru, why other web pages should suffer is not clear, could not find any arguments. Macuser (talk) 16:13, 13 May 2020 (UTC)

@Macuser: They are clearly blocked a long while ago. You can seek local whitelisting of the domains at the wiki of interest. Can you tell us why you think that the domains are useful to Wikimedia sites? There is nothing evident at the sites themselves that give evidence that they are needed and reliable sites that would be used by the Wikimedia communities, there is no general path access to look at their value. To be removed from the blacklist needs an argument to remove.  — billinghurst sDrewth 22:36, 13 May 2020 (UTC)
The newmail.ru is dead, there is nothing evident it will be linked for spam. Local history site for Tiversk was hosted there, with quite irreplaceable collection of soviet time newspapers' headlines.Macuser (talk) 22:56, 13 May 2020 (UTC)
@Macuser: if there is pertinent information it is likely best to whitelist the specific document (or document tree or subdomain). --Dirk Beetstra T C (en: U, T) 13:54, 14 May 2020 (UTC)

Discussion[edit]

Symbol comment vote.svg This section is for discussion of Spam blacklist issues among other users.

Rename?[edit]

There have been periodic discussions over renaming this feature, in part because not all blacklisted links are in fact spam (e.g. URL shorteners). In the light of the recent announcement by the UK's National Cyber Security Centre that it will no longer use the terms blacklist and whitelist, I think it might be worth considering the one-time disruption that would be caused by a change to something like "external link deny list". JzG (talk) 13:51, 4 May 2020 (UTC)

Think that the whole conversation belongs on mediawikiwiki, maybe at mw:Extension talk:SpamBlacklist or in a phabricator: ticket. Probably phabricator if you want to get the attention of mediawiki developers.  — billinghurst sDrewth 14:02, 4 May 2020 (UTC)
That is where it would go from here, but I would first like to establish whether there is consensus that this is a good idea. JzG (talk) 14:03, 4 May 2020 (UTC)
I am not certain that this is the place for that consensus. Personally I am focused on the functionality, and that any future functionality can be maintained with the minimal amount of work. If the name is considered problematic and insulting, then it seems worthwhile having that discussion wherever it is held.  — billinghurst sDrewth 14:21, 4 May 2020 (UTC)
There is phab:T16719#4079220, which is a (declined) task about renaming the list and links to two more tasks about renaming the list. Jo-Jo Eumerus (talk, contributions) 15:06, 4 May 2020 (UTC)
Sure. So... where? JzG (talk) 07:58, 7 May 2020 (UTC)

Support I think it would be good to get the misnomer (and stigmatization) out of the way MediaWiki-wide. I know it was originally written just to combat spam, but it has become more than that, and the argument is there regularly 'but it was not spammed, remove it from this list' (not that it matters). I presume this needs to go through a phab ticket, and it may have technical problems to do this. Alternatively, if the extension could be rewritten so wikis would be able to choose which page (or pages, which would be great) and give their own name would be containing the regexes . --Dirk Beetstra T C (en: U, T) 08:56, 7 May 2020 (UTC)

  • I'm fine with this but I also wish we had multiple lists, or the ability to comment more easily, so that the list could have more nuance. For example, "\bgoatse\.info\b" is not allowed a link for a very different reason than what most people think of as "spam". I don't know the history on "\bbible\-history\.com\b" but given that it looks like a fairly innocuous (but probably not WP:RS) site I wonder if someone was being a jerk and going around actually 'spamming' by adding too links over and over.--Jimbo Wales (talk) 09:26, 5 May 2020 (UTC) - from [1]
  • I've got no issues with the terms "blacklist" or "whitelist" they are industry terms and well understood. No concerns with rename from "spam" to something more specific like "Link blacklist". — xaosflux Talk 16:14, 12 May 2020 (UTC)
  • Oppose Oppose: should be kept as "spam blacklist". 107.77.189.17 20:20, 15 May 2020 (UTC)