Jump to content

Talk:Spam blacklist

From Meta, a Wikimedia project coordination wiki
This is an archived version of this page, as edited by Billinghurst (talk | contribs) at 06:30, 2 December 2012 (text edit). It may differ significantly from the current version.

Latest comment: 11 years ago by Billinghurst in topic Proposed removals
Shortcut:
WM:SPAM
WM:SBL
The associated page is used by the MediaWiki Spam Blacklist extension, and lists regular expressions which cannot be used in URLs in any page in Wikimedia Foundation projects (as well as many external wikis). Any meta administrator can edit the spam blacklist; either manually or with SBHandler. For more information on what the spam blacklist is for, and the processes used here, please see Spam blacklist/About.
Proposed additions
Please provide evidence of spamming on several wikis. Spam that only affects a single project should go to that project's local blacklist. Exceptions include malicious domains and URL redirector/shortener services. Please follow this format. Please check back after submitting your report, there could be questions regarding your request.
Proposed removals
Please check our list of requests which repeatedly get declined. Typically, we do not remove domains from the spam blacklist in response to site-owners' requests. Instead, we de-blacklist sites when trusted, high-volume editors request the use of blacklisted links because of their value in support of our projects. Please consider whether requesting whitelisting on a specific wiki for a specific use is more appropriate - that is very often the case.
Other discussion
Troubleshooting and problems - If there is an error in the blacklist (i.e. a regex error) which is causing problems, please raise the issue here.
Discussion - Meta-discussion concerning the operation of the blacklist and related pages, and communication among the spam blacklist team.
#wikimedia-external-linksconnect - Real-time IRC chat for co-ordination of activities related to maintenance of the blacklist.

Please sign your posts with ~~~~ after your comment. This leaves a signature and timestamp so conversations are easier to follow.


Completed requests are marked as {{added}}/{{removed}} or {{declined}}, and are generally archived (search) quickly. Additions and removals are logged · current log 2024/07.

snippet for logging
{{sbl-log|4721296#{{subst:anchorencode:SectionNameHere}}}}


Proposed additions

This section is for proposing that a website be blacklisted; add new entries at the bottom of the section, using the basic URL so that there is no link (example.com, not http://www.example.com). Provide links demonstrating widespread spamming by multiple users on multiple wikis. Completed requests will be marked as {{added}} or {{declined}} and archived.

inforapid.org



This is just a meta page that hasn't got own content. It's rather a wikipedia mirror. So it's not compatible to WP:EL (anywhere). Links to the domain are used as external links and even as references across several projects. COIBot has got 55 records, but there are many more:

Top 10 editors who have added inforapid.org: [name not readable due to unicode-problems] (4), Wikiherder (3), Cinmad (3), 78.6.226.210 (3), Susann Schweden (2), KurtR (2), Bernd Rieke (2), Veronidae (2), CorenSearchBot (1), Aschroet (1).
Top 10 wikis where inforapid.org has been added: w:en (20), w:de (14), w:it (6), w:bg (4), w:es (3), wikt:de (2), w:ru (1), w:pt (1), w:fr (1), w:az (1).

I deleted a couple of links in w:en, w:fr, w:it. But there are still many links left. The problem is: If I blacklist the page now at meta, there may occur some problems, e.g., some archive bots can't cope with threads containing blacklisted links. I could use CamelBot to remove the links from ANS and replace the links by urls in non-ANS, but my bot has got the bot-flag in w:de only.
What do we normally do in such cases? Are there global bots that could help? -- seth (talk) 18:58, 2 June 2012 (UTC)Reply

I deleted the ANS links by hand now. What to do with the rest of the links? -- seth (talk) 15:44, 3 June 2012 (UTC)Reply
I think we should monitor this one. I see no direct evidence of spamming, but the link is not very useful. EdBever (talk) 07:19, 9 June 2012 (UTC)Reply
It's just a wp-mirror. So it does not give any additional information to articles. Apart from that it even has been used many times as a reference. So it's not intended spamming of one person, but unintended spamming of many persons. -- seth (talk) 09:39, 9 June 2012 (UTC)Reply
I don't think there is something as unintended spamming. A number of users found the site useful and inserted it into various articles. We shouldn't judge the site's content on this page since it is meant for fighting spam. EdBever (talk) 14:20, 14 June 2012 (UTC)Reply
I understand your point. However, the users that link to that page just ignore/don't know our rules. links to wp mirrors do not satisfy w:de:WP:EL, nor w:en:WP:EL. I don't think, that other wp-projects want such links.
In w:de it is common to blacklist wp mirrors. Is that different to other wiki-projects? -- seth (talk) 15:57, 16 June 2012 (UTC)Reply

zelenaplus.com

Indonesian and English Wikipedias:







--A. B. (talk) 22:56, 24 September 2012 (UTC)Reply

Links removed, user has been warned. Let's wait until this reoccurs. EdBever (talk) 06:45, 28 September 2012 (UTC)Reply

DGtraffic (Indonesia) spam on Wikipedia

DGTraffic is a large Indonesian SEO firm. [1] [2]

Reference:

Accounts






[4]



[5]













id wikipedia only





id wikipedia only



Domains spammed

These were spammed across Indonesian and English Wikipedias. The spam added to en.wikipedia was done solely for "link love" since the links led to Indonesian language sites. Some spam known to have been added to Commons, Simple, Lombard and Ten. Unfortunately, global contributions search is down, so I don't know if there's more out there.



























































































Related domains to blacklist

SEO blogs







Spammed domains not listed for blacklisting today


    • News publication possibly useful as a reference on id.wikipedia


    • Big Indonesian financial company
Related domain not listed for blacklisting today


    • SEO client; domain not known to have been spammed yet

--A. B. (talk) 01:33, 25 September 2012 (UTC)Reply

More domains and accounts












--A. B. (talk) 02:51, 11 October 2012 (UTC)Reply

Myspacetv.com



Noted this as caught by LiWa3/COIBot. This is a plain redirect site to myspace's video part (myspace.com/tv). Generally, we do blacklist redirect sites on sight, though for dedicated servers an exception may sometimes be made. However, we do have quite some myspace-pages blacklisted (10 myspace.com/<id> - rules), and there is a possible issue there (links to videos should always be double checked, and if a dedicated video server like YouTube has copyright violations on them, how about a social networking site ..). Should we just monitor this, or should we consider to blacklist and clean/convert? --Dirk Beetstra T C (en: U, T) 11:08, 12 November 2012 (UTC)Reply

Example of what this leads to w:en:Avril_Lavigne's_Make_5_Wishes#Video_Episodes - for me they are all blocked due to a local firewall, I hear that others can't see them either since they have to sign up. Therefore, suggesting also:

  • myspace.com/tv

That just fails our inclusion standards. Will need to have the 200 links cross-wiki cleaned up, though. --Dirk Beetstra T C (en: U, T) 11:18, 12 November 2012 (UTC)Reply

Note that the Avril Lavigne's Make 5 Wishes section has been nuked. --Beetstra public (talk) 06:58, 14 November 2012 (UTC)Reply

yamudikimogudumovie.com



Domain appears to be offline, but a Google search of the domain returns nothing but what looks like spam.--Jasper Deng (talk) 06:44, 29 November 2012 (UTC)Reply

 Declined seems to have only been at mediawiki. We try not to add one-offs to the global list, and look to add where added xwiki or is a large persistent problem that requires global blacklisting. I have asked COIBot to watch for it xwiki. — billinghurst sDrewth 09:50, 29 November 2012 (UTC)Reply

Mozaik Publishing related





Accounts

--Hu12 (talk) 17:39, 1 December 2012 (UTC)Reply

Proposed additions (Bot reported)

This section is for domains which have been added to multiple wikis as observed by a bot.

These are automated reports, please check the records and the link thoroughly, it may report good links! For some more info, see Spam blacklist/Help#COIBot_reports. Reports will automatically be archived by the bot when they get stale (less than 5 links reported, which have not been edited in the last 7 days, and where the last editor is COIBot).

Sysops
  • If the report contains links to less than 5 wikis, then only add it when it is really spam
  • Otherwise just revert the link-additions, and close the report; closed reports will be reopened when spamming continues
  • To close a report, change the LinkStatus template to closed ({{LinkStatus|closed}})
  • Please place any notes in the discussion section below the HTML comment

COIBot

The LinkWatchers report domains meeting the following criteria:

  • When a user mainly adds this link, and the link has not been used too much, and this user adds the link to more than 2 wikis
  • When a user mainly adds links on one server, and links on the server have not been used too much, and this user adds the links to more than 2 wikis
  • If ALL links are added by IPs, and the link is added to more than 1 wiki
  • If a small range of IPs have a preference for this link (but it may also have been added by other users), and the link is added to more than 1 wiki.
COIBot's currently open XWiki reports
List Last update By Site IP R Last user Last link addition User Link User - Link User - Link - Wikis Link - Wikis
vrsystems.ru 2023-06-27 15:51:16 COIBot 195.24.68.17 192.36.57.94
193.46.56.178
194.71.126.227
93.99.104.93
2070-01-01 05:00:00 4 4

Proposed removals

This section is for proposing that a website be unlisted; please add new entries at the bottom of the section.

Remember to provide the specific domain blacklisted, links to the articles they are used in or useful to, and arguments in favour of unlisting. Completed requests will be marked as {{removed}} or {{declined}} and archived.

See also /recurring requests for repeatedly proposed (and refused) removals.

The addition or removal of a domain from the blacklist is not a vote; please do not bold the first words in statements.

pro-speleo.ru



Please EXCLUDE pro-speleo.ru from \bpro-*\.ru\b spam-series, since it has no relation to pro-gorod series, instead it is a major source for speleology releated articles in Russian. --Untifler (talk) 04:03, 2 November 2012 (UTC)Reply

youtu.be



This is officially used by YouTube, and when a link is shared, it is automatically given in this format:

http://youtu.be/3pZUCKt0RKc

First of all, there is generally absolutely NO reason to use link redirectors - the full link can simply be used. Youtu.be is a redirect site which is specific, at least not 'custom' pointing to other sites, however, it can be used to circumvent linking to youtube video's that are blacklisted (on the different individual language projects, and on meta, there are several specific youtube links being blacklisted because they were spammed or abused otherwise). That combined with the normal problems that YouTube links have (although there is a lot of good material, there are still plain copyright violations, it is not available to all, etc. etc.), is enough reason to decline this. You can use http://www.youtube.com/watch?v=3pZUCKt0RKc (IIRC, there is a checkbox for full link in the share option that gives you the full link, otherwise converting the 'youtu.be/' to 'youtube.com/watch?v=' does the trick). Thanks. --Dirk Beetstra T C (en: U, T) 11:06, 10 November 2012 (UTC)Reply

Talk.to



I don't understand why this website has been blocked. I tried to find the reasons for it's blacklisting but was unable to find any. It is a only a chat and communication software, there isn't any real need to block it, is there?
I had originally put this request up here
http://en.wikipedia.org/wiki/MediaWiki_talk:Spam-blacklist#Proposed_removals
but was asked to defer to this list. I understand that this domain was earlier used as a spamming platform but since the blacklisting, it has changed ownership and use.
Now, its the website for an upcoming communication platform and that is it's sole purpose. Please respond.

Earlier blacklisting - it was a redirect service, which are generally blacklisted on sight (even when not spammed or abused yet). There simply is no valid reason to use a redirect service anyway.
Anyways, it apparently has changed ownership. Question is, is it now notable enough. You say 'upcoming' (see en:WP:CRYSTAL?). Though, if it is not a redirect service anymore, then there is just also no reason to keep it blacklisted. Thoughts? --Beetstra public (talk) 10:56, 24 November 2012 (UTC)Reply
Well, by the word "upcoming", I meant to say it's growing. I wasn't implying that it hasn't been launched and apologize for any misunderstanding. As you rightly pointed out, Wikipedia is not a crystal ball or a product announcement platform, but it does include information about newly-released products and that's all this webpage shows. Talk.to is quite popular and has been featured on blogs within Spain and Italy which cover Android apps. Also, it enjoys a user base of hundreds of thousands of users across several platforms such as Android, iPhone, Windows Phone, PC, Mac and Google Chrome.
Also, as you said, since its not a redirect service anymore, there is no reason to blacklist it anymore, since now it's just a website for a communication software, like whatsapp.com or viber.com, both of whom are featured on Wikipedia. So even if it's popularity might be debatable, there is still no real justification to keep it on the blacklist.
Thanks for your prompt response and hoping for a positive reply!
-- Ankush Saxena 120.56.170.79 18:24, 27 November 2012 (UTC)Reply
I agree that it is probably no longer worthy of blacklisting if it isn't a redirect, though would add that I don't see that it has any content to which we linking to from the wikis and would be surprised to see much linking to the site if it was removed from the blacklist. — billinghurst sDrewth 00:36, 28 November 2012 (UTC)Reply
That might well be true, but anyway, no one can predict that either way. Which brings us back to the question, is it going to continue being in the blacklist or will it be removed, since its not a redirect service any longer. Thanks for your response!
-- Ankush Saxena 115.111.191.42 09:56, 28 November 2012 (UTC)Reply

cartconvert.allowed.org



I suggest whitelisting cartconvert.allowed.org. It is a RESTFul cartograhpy service form transformation of geolocation bearing points to other geolocation representations. I tried adding the service to the German web page of http://de.wikipedia.org/wiki/Bundesmeldenetz but because *.allowed.org is blacklisted, it was not possible. I am the author or cartconvert.allowed.org too.— The preceding unsigned comment was added by 86.33.210.43 (talk)

I am not sure why Wikipedia should link to this .. could you elaborate on this. Note that global whitelisting is impossible, for that you'd need to place requests on the wiki where this link may be of interest. Thanks. --Beetstra public (talk) 05:38, 25 November 2012 (UTC)Reply
The german article of http://de.wikipedia.org/wiki/Bundesmeldenetz describes the former geodetic datum of Austria. Lot's of legacy data in this format is still lingering around. I am also involved in open government data and while a description of conversion between this geodetic datum is available, no authoritative public service is available to my knowledge. I provided this service as an open source project and it's available online as cartconvert.allowed.org. I would like to add a external link on the mentioned page so that people interested in putting points on eg. openstreetmap in the geodetic datum of Austria (Bundesmeldenetz) could have a start. — The preceding unsigned comment was added by 193.171.58.240 (talk)

www.bodybuilding-magazin.de



Hi, the reference in article de:Fouad Abiad cannot be formatted properly, due to the blacklist for "bodybuilding" on meta. --Valvetube (talk) 11:28, 27 November 2012 (UTC)Reply

updated regex, you should be right to go. — billinghurst sDrewth 12:39, 27 November 2012 (UTC)Reply
Thanks. --Valvetube (talk) 13:43, 27 November 2012 (UTC)Reply

Remove Remove, well in a fashion — billinghurst sDrewth 11:19, 30 November 2012 (UTC)Reply

eduvision.edu.pk



Hi, please remove www.eduvision.edu.pk from blocklist as it is very much useful website for the students of Pakistan. Offering complete information for universities of Pakistan and programs offered by these universities

It is not globally blacklisted. You will find that it is blacklisted locally at these sites [w:en], [w:ne], [w:sq] — billinghurst sDrewth 11:29, 30 November 2012 (UTC)Reply
 Declined nothing to do (see above) — billinghurst sDrewth 11:30, 30 November 2012 (UTC)Reply

avoiceformen.com





This request is for the ban on avoiceformen.com to be removed, at least on a page by page basis. I reviewed the original complaint about how various australian IPs had been adding this men's rights activist blog/website to articles. I looked at the examples and agree they were abusive but that the site is too important to allow the site to be banned because of actions of potentially one misguided user roaming internet cafes.

How the link can be useful on Wikipedia: avoiceformen.com is the largest men's rights organization in the hundreds of websites and blogs known collectively as "the manosphere". The term "manosphere" has over 74,000 results in Google and is mentioned in various government reports but has yet to appear in wikipedia. Avoiceformen.com offers an explanation of terms, concepts, that are indispensable describing this growing social movement. I have quite a number of wikipedia additions I've drafted to describe those terms and concepts used throughout the men's rights movement. Not having the ability to reference Avoiceformen.com without having each link approved will be a big inconvenience for me, but not having the ability to reference the site at all will make the effort practically impossible, since a great many of the men's rights activits in the english speaking world have written for, or are in some way associated with the website. As a third party who does not work for avoiceformen.com nor has any formal relationship with the operator of the site or any employee of the site, I feel that whether or not one agrees with it's precepts there is significant benefit in allowing this movement to be documented neutrally and impartially in wikipedia.

Reasoning why the blacklisting is not necessary anymore: Blocking such an important men's rights website in its entirety for the misuse of potentially one single australian user makes as little sense as blocking youtube.com for the abuse of one video channel. Furthermore the website is based out of Texas not Australia. The owners of the site may have opinions that are disagreeable to some, but they appear to be quite fastidious about properly citing what they believe reliable sources rather than inserting fake references as the spammer has done. The spam was almost certainly not from the site itself.

This sounds like a good example where specific whitelisting on the specific projects for the specific links can be performed. Since people found it necessary to abuse the whole domain (as opposed to what happens to youtube.com), I think it is good that the whole domain is blacklisted. --Beetstra public (talk) 05:53, 2 December 2012 (UTC)Reply
Either way, it is not something that has been done globally; it has all been blocked by local wikis (en, ne, sq) at their own instigation
  • avoiceformen.org is not caught by regexes on black, white or revertlists.
  • avoiceformen.com is caught by blacklists: [w:en] \bavoiceformen\.com\b, [w:ne] \bavoiceformen\.com\b, [w:sq] \bavoiceformen\.com\b
so this will need to be taken up at each wiki via their respective Mediawiki talk:Spam-blacklist pages — billinghurst sDrewth 06:22, 2 December 2012 (UTC)Reply
 Declined not listed globally — billinghurst sDrewth 06:22, 2 December 2012 (UTC)Reply

Troubleshooting and problems

This section is for comments related to problems with the blacklist (such as incorrect syntax or entries not being blocked), or problems saving a page because of a blacklisted link. This is not the section to request that an entry be unlisted (see Proposed removals above).

I would like to add the following "published" template for documentation purpose. --LoKiLeCh (talk) 22:44, 1 November 2012 (UTC) {{published|cite=web|url=http://www.mechanical-engineering.suite101.com/article/ten-mechanical-failure-modes-a149010|title=Ten Mechanical Failure Modes |legal=yes|publisher=www.mechanical-engineering.suite101.com}}Reply

Discussion

This section is for discussion of Spam blacklist issues among other users.

New cross-wiki linksearch wiki sets

Don't click on these, they are just examples and you will get a timeout.

  • This cannot go any higher, the maximum execution time on Google App Engine is only 60 seconds.
  • {en,de,fr}.{wikipedia,wikibooks,wikiquote,wiktionary} + Commons, Meta and mediawiki.org (15 wikis). I will cover the new travel guide project when it is ready.

Just a reminder, I also have a spam archive search (just en and meta only). MER-C (talk) 13:37, 1 October 2012 (UTC)Reply

Hello MER-C, thanks for this - I'd like to note that currently spamarchivesearch.jsp only gives links to en.wikipedia. For example http://wikipediatools.appspot.com/spamarchivesearch.jsp?query=whale.to (recurring request listed here). Regards, -- MarcoAurelio (talk) 13:52, 1 October 2012 (UTC)Reply
I blame MediaWiki's sucky search engine for that. MER-C (talk) 00:50, 2 October 2012 (UTC)Reply
I changed the search string. It should now work reasonably. MER-C (talk) 12:12, 8 October 2012 (UTC)Reply

EzineArticles.com



What's the matter with this website? --Horcrux92 (talk) 08:32, 12 October 2012 (UTC)Reply

It was blocked due to this request. — billinghurst sDrewth 22:28, 12 October 2012 (UTC)Reply
The following discussion is closed.

billinghurst sDrewth 06:30, 15 November 2012 (UTC)Reply

Friendly reminder

Please don't comment out domains on the blacklist to remove them, just remove them and please log the removals. Also please log the reguexp changes you make. I say this because currently I see some domains commented out and I don't know if that's just temporary or definitive. If those are definitive, they should be removed. Thanks. -- MarcoAurelio (talk) 13:37, 17 October 2012 (UTC)Reply