Talk:Spam blacklist

From Meta, a Wikimedia project coordination wiki
Jump to navigation Jump to search
Requests and proposals Spam blacklist Archives (current)→
The associated page is used by the MediaWiki Spam Blacklist extension, and lists regular expressions which cannot be used in URLs in any page in Wikimedia Foundation projects (as well as many external wikis). Any meta administrator can edit the spam blacklist; either manually or with SBHandler. For more information on what the spam blacklist is for, and the processes used here, please see Spam blacklist/About.
Proposed additions
Please provide evidence of spamming on several wikis and prior blacklisting on at least one. Spam that only affects a single project should go to that project's local blacklist. Exceptions include malicious domains and URL redirector/shortener services. Please follow this format. Please check back after submitting your report, there could be questions regarding your request.
Proposed removals
Please check our list of requests which repeatedly get declined. Typically, we do not remove domains from the spam blacklist in response to site-owners' requests. Instead, we de-blacklist sites when trusted, high-volume editors request the use of blacklisted links because of their value in support of our projects. Please consider whether requesting whitelisting on a specific wiki for a specific use is more appropriate - that is very often the case.
Other discussion
Troubleshooting and problems - If there is an error in the blacklist (i.e. a regex error) which is causing problems, please raise the issue here.
Discussion - Meta-discussion concerning the operation of the blacklist and related pages, and communication among the spam blacklist team.
#wikimedia-external-linksconnect - Real-time IRC chat for co-ordination of activities related to maintenance of the blacklist.
Whitelists
There is no global whitelist, so if you are seeking a whitelisting of a url at a wiki then please address such matters via use of the respective Mediawiki talk:Spam-whitelist page at that wiki, and you should consider the use of the template {{edit protected}} or its local equivalent to get attention to your edit.

Please sign your posts with ~~~~ after your comment. This leaves a signature and timestamp so conversations are easier to follow.


Completed requests are marked as {{added}}/{{removed}} or {{declined}}, and are generally archived quickly. Additions and removals are logged · current log 2018/10.

Translate this page
Projects

snippet for logging
{{sbl-log|18494987#{{subst:anchorencode:SectionNameHere}}}}

Proposed additions[edit]

Symbol comment vote.svg This section is for proposing that a website be blacklisted; add new entries at the bottom of the section, using the basic URL so that there is no link (example.com, not http://www.example.com). Provide links demonstrating widespread spamming by multiple users on multiple wikis. Completed requests will be marked as {{added}} or {{declined}} and archived.

spambot regexes (Oct 1)[edit]

  • Link/text requested to be blacklisted: \bglamour\.de/content/search/
  • Link/text requested to be blacklisted: \bmartindale\.com/Results.aspx\?
  • Link/text requested to be blacklisted: \bchange\.org/search\?q
  • Link/text requested to be blacklisted: \brealitysandwich\.com/\?s
  • Link/text requested to be blacklisted: \bmy3uka\.com/search/
  • Link/text requested to be blacklisted: \bdailymail\.co\.uk/home/search\.html\?
  • Link/text requested to be blacklisted: \btelegraph\.co\.uk/search

checked and tested  — billinghurst sDrewth 10:08, 1 October 2018 (UTC)

Added Added to Spam blacklist. -- — billinghurst sDrewth 10:08, 1 October 2018 (UTC)

hdorg.ru[edit]



Spam-bot target. -- Tegel (Talk) 19:16, 8 October 2018 (UTC)
@Tegel: Added Added to Spam blacklist. --Tegel (Talk) 19:16, 8 October 2018 (UTC)

sci-hub[edit]





































And many others. Per discussion on en.wikipedia (User:DMacks, User:JzG), this is a largely copyright violating site, where all links are replaceable with non-copyright violating material (the originals). On en.wikipedia now

  • Link/text requested to be blacklisted: \bsci-hub\.

has been blacklisted. On en.wikipedia, JzG suggested this to be meta-blacklisted (to which I agree per en:WP:COPYLINK), hence discussion here. --Dirk Beetstra T C (en: U, T) 13:09, 15 October 2018 (UTC)

Also @Galobtter and Anachronist:. --Dirk Beetstra T C (en: U, T) 13:53, 15 October 2018 (UTC)
Agreed. Sci-Hub jumps from one TLD to another (bz, cc, ac, and now tw), so blacklisting sci-hub.* (any TLD) would be appropriate globally, not just on en-wiki. Anachronist (talk) 16:38, 15 October 2018 (UTC)
Commons has a few dozen links, and they involve various different TLDs as copyright lawsuits force them to keep jumping. Global blacklist sounds good. DMacks (talk) 16:54, 15 October 2018 (UTC)
@Biochemlife and Anahit.19: adding broader users.  — billinghurst sDrewth 01:06, 16 October 2018 (UTC)




Also these IPs may need to be blacklisted. --Dirk Beetstra T C (en: U, T) 08:37, 17 October 2018 (UTC)

omn.kr[edit]



URL shortener. — regards, Revi 07:55, 16 October 2018 (UTC)

Added Added to Spam blacklist. — regards, Revi 07:56, 16 October 2018 (UTC)

spiritualsadhana.com[edit]



Astrologers ad site, that was spammed persistently since 2015 on en-Wikipedia, but also sometimes on te and ta Wikipedias. Various users are blocked, others only warned. PaleoNeonate (talk) 21:50, 18 October 2018 (UTC)

@PaleoNeonate: Added Added to Spam blacklist. -- — billinghurst sDrewth 11:23, 19 October 2018 (UTC)

dtf.ru[edit]



Possible irreliable source, wiki website or blog. Anyone can add his own post and edit it. Globally (in all wikis) invalidate is requested. HanzoHanzo (talk) 11:27, 19 October 2018 (UTC)

Comment Comment There is allowed use at some wikis, and it is not banned at any wikis.  — billinghurst sDrewth 21:51, 19 October 2018 (UTC)

Proposed additions (Bot reported)[edit]

Symbol comment vote.svg This section is for domains which have been added to multiple wikis as observed by a bot.

These are automated reports, please check the records and the link thoroughly, it may report good links! For some more info, see Spam blacklist/Help#COIBot_reports. Reports will automatically be archived by the bot when they get stale (less than 5 links reported, which have not been edited in the last 7 days, and where the last editor is COIBot).

Sysops
  • If the report contains links to less than 5 wikis, then only add it when it is really spam
  • Otherwise just revert the link-additions, and close the report; closed reports will be reopened when spamming continues
  • To close a report, change the LinkStatus template to closed ({{LinkStatus|closed}})
  • Please place any notes in the discussion section below the HTML comment

COIBot[edit]

The LinkWatchers report domains meeting the following criteria:

  • When a user mainly adds this link, and the link has not been used too much, and this user adds the link to more than 2 wikis
  • When a user mainly adds links on one server, and links on the server have not been used too much, and this user adds the links to more than 2 wikis
  • If ALL links are added by IPs, and the link is added to more than 1 wiki
  • If a small range of IPs have a preference for this link (but it may also have been added by other users), and the link is added to more than 1 wiki.
COIBot's currently open XWiki reports
List Last update By Site IP R Last user Last link addition User Link User - Link User - Link - Wikis Link - Wikis
agroservice.hu 2018-10-22 21:13:33 COIBot 212.92.23.125 DimaNik1983
Dingar
PhanAnh123
2018-10-22 08:53:40 14 5
amiscathedralechalonsenchampagne.blogspot.de 2018-10-23 00:06:39 COIBot 172.217.8.1 R Dldwg 2018-10-22 21:54:15 1557 3 0 0 2
arterina.ir 2018-10-22 13:01:25 COIBot 88.99.160.208 ArteRina 2018-10-22 12:43:06 27 27 0 0 3
benatintxausti.com 2018-10-23 05:13:52 COIBot 87.98.231.4 R Hejsa 2018-10-22 19:07:53 2506 14 0 0 11
de.uncyclopedia.org 2018-10-23 05:17:43 COIBot 151.101.192.194 R Bullenwächter
Camervan
GenieGenieGenie
Holy Goo
Jaquento
Shi Annan
花蝶風月雪月花警部
2018-10-22 09:01:05 19 6
desencyclopedie.com 2018-10-23 03:08:53 COIBot 151.101.192.194 R DobyvatelCZ
G3ron1mo
Jana Lánová
Kiriku
Roriromrack
わたらせみずほ
花蝶風月雪月花警部
210.182.241.23
211.214.215.50
88.193.162.30
2018-10-22 09:01:05 8 3
esporterio.blogspot.pt 2018-10-23 04:53:34 COIBot 172.217.8.1 R Balamen
85.243.168.182
2018-10-23 02:15:21 46 3
flynano.blogspot.ca 2018-10-22 21:18:01 COIBot 172.217.8.1 R Giammarco Ferrari
J Doug McLean
2018-10-22 19:20:43 3 3
googlepress.blogspot.no 2018-10-23 01:02:52 COIBot 172.217.8.1 R Google9912
Iniyavan iniya
SBCornelius
Wewe123456789
2018-10-22 17:20:19 95 2
leathertools.com.ua 2018-10-23 01:01:21 COIBot 31.131.18.227 176.115.97.36
193.0.221.90
77.91.188.80
2018-10-23 01:00:27 9 2
morefmonline.com 2018-10-23 01:00:30 COIBot 165.227.21.77 189.204.164.43
189.223.158.131
189.223.186.2
200.56.109.224
47.144.100.196
2018-10-22 23:31:50 10 2
profiles.wordpress.org 2018-10-22 21:05:22 COIBot 198.143.164.252 R AlaminHoney
Chris dev
Continentaleurope
DeusTron
Eliashedberg
Eskapism
Faerydancer02
Fleshgrinder
Forcing7
Guruitengineer
Hatashe
Itapplication
Johnragozzine
Marsjaninzmarsa
NabiLMoqbel
Nobin420
Olena Y
Ppolonia
Praveen Kumar Nagercoil
SureshHARDIYA
Thehopemonger
Tradesouthwest
Ulfben
Vivekkush1983
Wikiprobz
Wisatajogja
WisdmLabs
Xarxa21
176.51.180.143
2018-07-24 17:29:29 54 14
q0t.kl.com.ua 2018-10-23 01:10:41 COIBot 95.211.16.67 195.114.136.232
195.114.136.253
195.114.136.65
2018-10-22 12:35:02 7 3
rondan.net 2018-10-23 05:18:40 COIBot 183.90.246.70 Atanahama 2018-10-23 05:17:47 17 23 0 0 2
ruicostacyclist.blogspot.fr 2018-10-22 22:33:20 COIBot 172.217.8.1 R Hejsa 2018-10-22 19:43:50 2506 8 0 0 7

Proposed removals[edit]

Symbol comment vote.svg This section is for proposing that a website be unlisted; please add new entries at the bottom of the section.

Remember to provide the specific domain blacklisted, links to the articles they are used in or useful to, and arguments in favour of unlisting. Completed requests will be marked as {{removed}} or {{declined}} and archived.

See also recurring requests for repeatedly propo sed (and refused) removals.

Notes:

  • The addition or removal of a domain from the blacklist is not a vote; please do not bold the first words in statements.
  • This page is for the removal of domains from the global blacklist, not for removal of domains from the blacklists of individual wikis. For those requests please take your discussion to the pertinent wiki, where such requests would be made at Mediawiki talk:Spam-blacklist at that wiki. Search spamlists — remember to enter any relevant language code

genetherapynet.com[edit]



I tried adding this and found it was blacklisted here. It seems that someone with a COI to that site and others was spamming it on articles cross wiki. As far as I can tell the editor has not been here for quite a long time (see here, this and Talk:Spam blacklist/Archives/2011-02). Was wondering if it could be removed. Not sure about the other websites that were blacklisted at the same time as I have not explored them. Aircorn (talk) 06:28, 19 July 2018 (UTC)

@Aircorn: it was being spammed in 2011. A removal needs more than "I want to add it", it usually needs firm reasoning about why it is usable at the sites. You can always ask about whitelisting at w:en:mediawiki talk:spam-whitelist  — billinghurst sDrewth 09:32, 9 August 2018 (UTC)
It is quite hard to find sites that explain genetic engineering and are user friendly. It would make my editing life a little bit easier if I could use it as a reference, but it is not a site I would regularly use as there are much better (although less accessable) resources out there. I figured that since it was (as far as I could tell) only blacklisted due to spamming a long time ago then it would be relatively easy to unblacklist once that was no longer an issue. I will look at whitelisting if I really need it. Thanks for the response. Aircorn (talk) 09:44, 9 August 2018 (UTC)

thecompany.pl[edit]



I am asking you to consider removing this site from the blacklist. —The preceding unsigned comment was added by Kocury (talk)

@Kocury: You will need to make a stronger case than "please remove this site". Why should it be removed? What is the value of the site to the wikis, especially the Wikipedias? How are you proposing to use the link? etc. It has been blacklisted, and was done for a reason at the time.  — billinghurst sDrewth 22:57, 26 August 2018 (UTC)
This is the homepage of the free project that deals with the conversion of games from the Commodore Amiga computer to a Windows. I would like to add this address to the polish page about "Amiga". https://pl.wikipedia.org/wiki/Amiga - I wrote to Polish wikipedia moderators, but they wrote that I should write a request here to remove site from global blacklist filter. —The preceding unsigned comment was added by Kocury (talk)
That community has the ability to whitelist if they believe that it is a viable website. No one is commenting that they think that the site if valuable, and it is one that is spammed, and I am not seeing a case presented that there is clear value to remove the link, and that we will avoid the issues that have previously occurred.  — billinghurst sDrewth 07:14, 3 September 2018 (UTC)
I see that my request is too big to give a "second chance" for this site. I did not know it was such a problem - I'm going back to the Polish administrators/moderators, *SIGH* :) Kocury 14:38, 4 September 2018 (UTC)
It is not about it being too big. We have it being spammed, and managed as a spammer. You have requested its removal, and there wass no evidence of community support for two weeks, nor evidence that it is not going to be spammed again. You want it at one site, so apply there. If you think that there is broader support, then where is it, can you point me to a local discussion elsewhere? I should remove it for use on all sites solely on your say so?  — billinghurst sDrewth 11:22, 5 September 2018 (UTC)

I apologize for the late response. I prefer to ask you to remove the page from the global filter, because making exceptions as you can see is more onerous. I believe that the site should not be responsible for spam, only the person who spammed it. It's probably not the point that the website owner would be responsible for spammers? I would like to remove the page from the global filter,, but what evidence should I present, I have no idea - I'm giving up :) The site has been functioning for 10 years, it allows legal playing games from the Commodore Amiga computer without knowledge of the emulator, I thought that it is just worth adding it at least to the Polish side, so that the article was more complete :)

Here's the discussion from History:

  1. https://pl.wikipedia.org/w/index.php?title=Wikipedia:Pro%C5%9Bby_do_administrator%C3%B3w&oldid=54096818
  2. https://pl.wikipedia.org/w/index.php?title=Dyskusja_MediaWiki:Spam-whitelist&oldid=54005870

(the discussion was finally removed without a reason)

There are more, but I have trouble searching. If I have not convinced you, maybe in a few years someone more important than me will ask you to add a page :)  — Kocury 20:05, 17 September 2018 (UTC)

@Kocury: this has nothing to do with your importance, but with the mitigation of spam. This is not an appropriate use of external links on many wikis (not on en.wikipedia only), I even doubt it is a proper use of external links on pl.wikipedia (see this link removal of this link). Wikipedia (in most languages, if not all languages) is not a platform for this type of links. That is why it got reverted, and that is why it got blacklisted globally. It is likely why it got ignored on pl.
Regarding the user being responsible, yes, they are. And if that user persists in adding these links (as happened here), then the only way to point that user to said responsibility is to blacklist. We are not here to play whack-a-mole, we are here to build an encyclopedia, and the blacklist is there to protect damage to the Wikipedia.
Hence, Declined Declined. That link is not a suitable external link on most wikis, if people want to play Amiga games through a website, then they can use Google to find such pages. --Dirk Beetstra T C (en: U, T) 06:16, 20 September 2018 (UTC)

youtu.be[edit]



The all inclusive *.be filter is blocking YouTube short links (youtu.be) from being used. This also includes wikis not in the Wikipedia network. I am asking that youtu.be itself be allowed while still filtering the rest of *.be. Thank you. Alexia E. Smith (talk) 19:05, 11 October 2018 (UTC)

Additional information: This affects over 2,000 wikis and would require manually editing every MediaWiki:Spam-whitelist on those wikis along with continual new wikis to maintain this exception. Trying to explain to non-technical users on how to transform the shortened URL into the full URL every instance this issue comes up is time consuming especially when it is the preferred URL presented to end users by the YouTube service when requesting a sharing URL. I see that in the past this request has been denied multiple times with those proposing it bringing up the same issue; attempting to get non-technical users to transform the URL. This spam blacklist affects not just Wikimedia projects, but thousands of external projects as well. The argument of blacklisting certain videos on YouTube can be solved by expanding the regex to also include (youtu\.be|youtube\com) in the regex. Alexia E. Smith (talk) 15:52, 12 October 2018 (UTC)
@Alexia E. Smith: It is not a .be rule, this is a specific youtu.be-rule. That rule is added for a reason.
Although I agree that the regexes could be expanded, that same story as your initial story applies - that needs to be clear to the maintainers for all wikis (at least within the MediaWiki realm). There is still a significant amount of material on youtube that simply should not be added for copyright reasons (and that often tends to be the material that is linked, much of the material where there is no copyright concern simply never will get linked as it is not applicable to Wikipedia), or is otherwise discouraged to be linked here.
I don't think that anything outside of the 800+ MediaWiki wikis is going to be a direct reason for us to remove rules. One could consider to expand in the installation documentation or in the example-whitelist of the installation to specifically whitelist domains when used on external installations. --Dirk Beetstra T C (en: U, T) 12:45, 13 October 2018 (UTC)
@Alexia E. Smith: sorry, broke the ping. --Dirk Beetstra T C (en: U, T) 13:01, 13 October 2018 (UTC)
@Beetstra: (Ping doesn't seem to ping me anyway.) If Extension:SpamBlacklist had a simple way to set a setting to globally whitelist a domain this would not be an issue. For me to whitelist this domain I will need to automate a solution to whitelist it on 2,126 wikis and also apply that to any new wikis that are created on a daily basis. The other solution would be to fork Extension:SpamBlacklist to have a setting that would allow a PHP setting based whitelist. However, I am trying to avoid forking the MediaWiki core more than necessary. I figured it was better to appeal here than to go with the less desirable options first. Alexia E. Smith (talk) 15:38, 13 October 2018 (UTC)
@Alexia E. Smith: as I said, the list here is mainly to protect mediawiki projects. I don’t think we should delist here for outside projects. Did you consider running a cross wiki bot on your wikis? —Dirk Beetstra T C (en: U, T) 16:10, 13 October 2018 (UTC)

Troubleshooting and problems[edit]

Symbol comment vote.svg This section is for comments related to problems with the blacklist (such as incorrect syntax or entries not being blocked), or problems saving a page because of a blacklisted link. This is not the section to request that an entry be unlisted (see Proposed removals above).

Discussion[edit]

Symbol comment vote.svg This section is for discussion of Spam blacklist issues among other users.