Jump to content

Talk:Spam blacklist: Difference between revisions

From Meta, a Wikimedia project coordination wiki
Latest comment: 2 years ago by JavaHurricane in topic Proposed additions
Content deleted Content added
→‎londondailypost.co.uk (and friends): remove the section I added
Tags: Manual revert Reverted
Undo revision 22190433 by Salimfadhley (talk): Something's up here... some of these links should be discussed. Currently researching, will comment on what I think is happening soon(ish).
Tag: Undo
Line 209: Line 209:
Cross-wiki linkspam by at least these two users; and others may also exist. ''[[User:JavaHurricane| <span style = "color:green">Java</span>]][[User talk:JavaHurricane|<span style = "color:red">Hurricane</span>]]'' 11:02, 14 October 2021 (UTC)
Cross-wiki linkspam by at least these two users; and others may also exist. ''[[User:JavaHurricane| <span style = "color:green">Java</span>]][[User talk:JavaHurricane|<span style = "color:red">Hurricane</span>]]'' 11:02, 14 October 2021 (UTC)
:+1 user. ''[[User:JavaHurricane| <span style = "color:green">Java</span>]][[User talk:JavaHurricane|<span style = "color:red">Hurricane</span>]]'' 17:25, 14 October 2021 (UTC)
:+1 user. ''[[User:JavaHurricane| <span style = "color:green">Java</span>]][[User talk:JavaHurricane|<span style = "color:red">Hurricane</span>]]'' 17:25, 14 October 2021 (UTC)

=== londondailypost.co.uk (and friends) ===
* {{linksummary|londondailypost.co.uk}}
* {{linksummary|thevistek.com}}
* {{linksummary|sypstudios.com}}
* {{linksummary|sypstudios.com}}
* {{linksummary|thehearup.com}}
* {{linksummary|zobuz.com}}


A set of entirely bogus news-sites which exist for no purpose other than to promote influeners. I was working on #wikipedia-en-help and a paid editor asked us to review these sites. Example here: [[:w:en:Draft:Sami_Mukahhal]] [https://londondailypost.co.uk/sami-mukahhal-the-inspiration-to-many-lebanese-around-the-world/] [https://thevistek.com/sami-mukahhal-the-instagram-filter-that-condones-lebanese-pride/] [https://sypstudios.com/an-instagram-filter-restoring-lebanese-pride-during-covid-19/] [https://thehearup.com/sami-mukahhal-expressing-yourself-via-the-lebanese-and-proud-instagram-filter/16295/] [https://zobuz.com/sami-mukahhal-be-yourself-with-the-lebanese-and-proud-instagram-filter/16134/]


== Proposed additions (Bot reported) ==
== Proposed additions (Bot reported) ==

Revision as of 23:21, 14 October 2021

Shortcut:
WM:SPAM
WM:SBL
The associated page is used by the MediaWiki Spam Blacklist extension, and lists regular expressions which cannot be used in URLs in any page in Wikimedia Foundation projects (as well as many external wikis). Any Meta administrator can edit the spam blacklist; either manually or with SBHandler. For more information on what the spam blacklist is for, and the processes used here, please see Spam blacklist/About.

Proposed additions
Please provide evidence of spamming on several wikis. Spam that only affects a single project should go to that project's local blacklist. Exceptions include malicious domains and URL redirector/shortener services. Please follow this format. Please check back after submitting your report, there could be questions regarding your request.
Proposed removals
Please check our list of requests which repeatedly get declined. Typically, we do not remove domains from the spam blacklist in response to site-owners' requests. Instead, we de-blacklist sites when trusted, high-volume editors request the use of blacklisted links because of their value in support of our projects. Please consider whether requesting whitelisting on a specific wiki for a specific use is more appropriate - that is very often the case.
Other discussion
Troubleshooting and problems - If there is an error in the blacklist (i.e. a regex error) which is causing problems, please raise the issue here.
Discussion - Meta-discussion concerning the operation of the blacklist and related pages, and communication among the spam blacklist team.
#wikimedia-external-linksconnect - Real-time IRC chat for co-ordination of activities related to maintenance of the blacklist.
Whitelists
There is no global whitelist, so if you are seeking a whitelisting of a url at a wiki then please address such matters via use of the respective Mediawiki talk:Spam-whitelist page at that wiki, and you should consider the use of the template {{edit protected}} or its local equivalent to get attention to your edit.

Please sign your posts with ~~~~ after your comment. This leaves a signature and timestamp so conversations are easier to follow.


Completed requests are marked as {{added}}/{{removed}} or {{declined}}, and are generally archived quickly. Additions and removals are logged · current log 2024/07.

SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 7 days and sections whose most recent comment is older than 15 days.

Proposed additions

This section is for proposing that a website be blacklisted; add new entries at the bottom of the section, using the basic URL so that there is no link (example.com, not http://www.example.com). Provide links demonstrating widespread spamming by multiple users on multiple wikis. Completed requests will be marked as {{added}} or {{declined}} and archived.

worldaffairsjournal.org



{{onhold|date=2021-9-26}} Usurped domain that is now a gambling site. Lots of sites linked to it so it will need to be cleansed prior to blacklisting.  — billinghurst sDrewth 08:03, 19 September 2021 (UTC)Reply

@Billinghurst: Added Added to Spam blacklist. -- — billinghurst sDrewth 13:12, 3 October 2021 (UTC)Reply

Doxbin



Per en:User:MillerLeut/sandbox/Doxbin a successor 'pastebin' for Doxbin ('Doxbin, is the one of the many successors to the now-defunct Doxbin'), and these are now alternative URLs to it (the original was already blocked):























I see not much reason why we would allow the use of these if we already blacklisted the original. -- Dirk Beetstra T C (en: U, T) 13:50, 26 September 2021 (UTC)Reply

This (en-admin-only) diff may also be of interest. --Dirk Beetstra T C (en: U, T) 13:52, 26 September 2021 (UTC)Reply

@Beetstra: Added Added to Spam blacklist. will do as two regex based on doxbin and dox -- — billinghurst sDrewth 13:17, 3 October 2021 (UTC)Reply

loto188.pw









Cross-wiki spam. SCP-2000 09:45, 1 October 2021 (UTC)Reply

@SCP-2000: Added Added to Spam blacklist. -- — billinghurst sDrewth 13:14, 3 October 2021 (UTC)Reply

typecite.com







IPs keep trying to replace links to citationmachine.net with this. Active again yesterday; was previously reported in August. Aranya (talk) 02:51, 3 October 2021 (UTC)Reply

@Aranya: Added Added to Spam blacklist. --Sgd. —Hasley 03:29, 3 October 2021 (UTC)Reply
@Aranya: can you please tell me what these websites are? Both for typecite.com and citationmachine.net? I saw a replacement of citationmachine.net for typecite.com which I reverted .. and then I decided to remove citationmachine.net as well as being utterly useless where it was used. Dirk Beetstra T C (en: U, T) 08:07, 3 October 2021 (UTC)Reply
@Beetstra: Both seem to be tools for creating citations in APA, MLA formats, etc. Citation Machine is much older, now owned by the company Chegg, and TypeCite seems to have just launched in April, hence its recent spamming. Definitely agree with that removal - either URL doesn't have a place in actual citations. Best, Aranya (talk) 12:06, 3 October 2021 (UTC)Reply
@Aranya: that was a bit my feeling - am I correct that outside of the official website on en:Citation Machine there should be NO instances of that link in mainspace in en.wikipedia? Dirk Beetstra T C (en: U, T) 12:41, 3 October 2021 (UTC)Reply
@Beetstra: I think that's reasonable! I'm not seeing any cases where it would necessary in mainspace. Best, Aranya (talk) 15:44, 3 October 2021 (UTC)Reply

Casino spam

































































































































See Talk:Wikiproject:Antispam#Linkspam online casinos et. al.. Two additional domains still had usages:





MER-C 10:47, 3 October 2021 (UTC)Reply

@MER-C: Added Added to Spam blacklist. --Dirk Beetstra T C (en: U, T) 12:36, 3 October 2021 (UTC)Reply

ashby-wells.blogbright.net



Slots spam by spambots, last seen at this page. JavaHurricane 04:27, 4 October 2021 (UTC)Reply

@JavaHurricane: Added Added to Spam blacklist. --Dirk Beetstra T C (en: U, T) 05:31, 4 October 2021 (UTC)Reply

eroticnut.com



Porn spam, last seen at w:en:User:PhilippDrakeford. JavaHurricane 10:33, 5 October 2021 (UTC)Reply

@JavaHurricane: Added Added to Spam blacklist. --Dirk Beetstra T C (en: U, T) 11:03, 10 October 2021 (UTC)Reply

valeologija.ru



Russian medical spam, last seen at w:en:User talk:AlyciaHillen190. JavaHurricane 10:33, 5 October 2021 (UTC)Reply

@JavaHurricane: Added Added to Spam blacklist. --Dirk Beetstra T C (en: U, T) 11:04, 10 October 2021 (UTC)Reply

automexico.com



xwiki spam, added by multiple accounts. Sgd. —Hasley 16:38, 6 October 2021 (UTC)Reply

Added Added to Spam blacklist. --Sgd. —Hasley 16:39, 6 October 2021 (UTC)Reply

pendekarslot123.com



Casino spam from spambots; see this page for an example I recently found. JavaHurricane 15:10, 7 October 2021 (UTC)Reply

@JavaHurricane: Already Done per User:COIBot/XWiki/pendekarslot123.com#Discussion. --Dirk Beetstra T C (en: U, T) 11:05, 10 October 2021 (UTC)Reply

167.71.211.180



Betting spam from bots, an example is at this page. JavaHurricane 15:19, 7 October 2021 (UTC)Reply

@JavaHurricane: already Done per User:COIBot/XWiki/167.71.211.180#Discussion. --Dirk Beetstra T C (en: U, T) 11:06, 10 October 2021 (UTC)Reply

fintechaz.com











Cross-wiki spam. SCP-2000 11:19, 10 October 2021 (UTC)Reply

@SCP-2000: Currently also an open request on zh.wikipedia. Added Added to Spam blacklist. --Dirk Beetstra T C (en: U, T) 12:11, 10 October 2021 (UTC)Reply

mygenerix.com



Alternate medicine spam. See User:Buymygenerix for an example. JavaHurricane 11:07, 13 October 2021 (UTC)Reply

bitlinkstech.com





























Spammed on enwiki, zhwiki. —Bruce1eetalk 06:35, 14 October 2021 (UTC)Reply

@Bruce1ee: Added Added to Spam blacklist. --Dirk Beetstra T C (en: U, T) 06:55, 14 October 2021 (UTC)Reply

mobisoftinfotech.com









Cross-wiki linkspam by at least these two users; and others may also exist. JavaHurricane 11:02, 14 October 2021 (UTC)Reply

+1 user. JavaHurricane 17:25, 14 October 2021 (UTC)Reply

londondailypost.co.uk (and friends)














A set of entirely bogus news-sites which exist for no purpose other than to promote influeners. I was working on #wikipedia-en-help and a paid editor asked us to review these sites. Example here: w:en:Draft:Sami_Mukahhal [1] [2] [3] [4] [5]

Proposed additions (Bot reported)

This section is for domains which have been added to multiple wikis as observed by a bot.

These are automated reports, please check the records and the link thoroughly, it may report good links! For some more info, see Spam blacklist/Help#COIBot_reports. Reports will automatically be archived by the bot when they get stale (less than 5 links reported, which have not been edited in the last 7 days, and where the last editor is COIBot).

Sysops
  • If the report contains links to less than 5 wikis, then only add it when it is really spam
  • Otherwise just revert the link-additions, and close the report; closed reports will be reopened when spamming continues
  • To close a report, change the LinkStatus template to closed ({{LinkStatus|closed}})
  • Please place any notes in the discussion section below the HTML comment

The LinkWatchers report domains meeting the following criteria:

  • When a user mainly adds this link, and the link has not been used too much, and this user adds the link to more than 2 wikis
  • When a user mainly adds links on one server, and links on the server have not been used too much, and this user adds the links to more than 2 wikis
  • If ALL links are added by IPs, and the link is added to more than 1 wiki
  • If a small range of IPs have a preference for this link (but it may also have been added by other users), and the link is added to more than 1 wiki.
COIBot's currently open XWiki reports
List Last update By Site IP R Last user Last link addition User Link User - Link User - Link - Wikis Link - Wikis
vrsystems.ru 2023-06-27 15:51:16 COIBot 195.24.68.17 192.36.57.94
193.46.56.178
194.71.126.227
93.99.104.93
2070-01-01 05:00:00 4 4

Proposed removals

This section is for proposing that a website be unlisted; please add new entries at the bottom of the section. Use a suitable 3rd level heading and display the domain name as per this example {{LinkSummary|targetdomain.com}}.

Remember to provide the specific domain blacklisted, links to the articles they are used in or useful to, and arguments in favour of unlisting. Completed requests will be marked as {{removed}} or {{declined}} and archived.

See also recurring requests for repeatedly proposed (and refused) removals.

Notes:

  • The addition or removal of a domain from the blacklist is not a vote; please do not bold the first words in statements.
  • This page is for the removal of domains from the global blacklist, not for removal of domains from the blacklists of individual wikis. For those requests please take your discussion to the pertinent wiki, where such requests would be made at Mediawiki talk:Spam-blacklist at that wiki. Search spamlists — remember to enter any relevant language code


Sci-Hub





Hi there, it's me on Sci-Hub again (archives: 2020-01 (1), 2020-01 (2), 2020-03, 2020-11). The sci-hub.do domain has just been "banned" (whatever it means), so it has been moved to sci-hub.ru. Therefore we need to de-blacklist at least these two (both) at least temporarily. --colt_browning (talk) 06:23, 28 August 2021 (UTC)Reply

@Colt browning: I have changed the regex to allow for the modification of urls to sci-hub.ru, there is no requirement to modify to allow for .do usage. I will look at this again in a few days to return to the status quo. After that you will need to apply for local whitelisting.  — billinghurst sDrewth 23:38, 29 August 2021 (UTC)Reply
 temporary done; no permanent removal I cleaned up the links, and have returned the domain to the blacklist.  — billinghurst sDrewth 08:28, 30 August 2021 (UTC)Reply
@Billinghurst: Thank you but there's actually one more thing that needs to be done. Someone has to modify the Wikidata entry Sci-Hub (Q21980377) to set end time (P582) for sci-hub.do to August 2021 and add the new URL sci-hub.ru with Preferred rank and start time (P580) set to August 2021. --colt_browning (talk) 09:17, 30 August 2021 (UTC)Reply
@Colt browning: Please seek whitelisting at WD for a single entry as that is instantaneous, no lag as we will have with the global blacklist. Thanks. Apologies, it wasn't showing in global search.  — billinghurst sDrewth 11:02, 30 August 2021 (UTC)Reply

Discussion

This section is for discussion of Spam blacklist issues among other users.

pollutionissues.com



I used www.pollutionissues.com/Ho-Li/Labor-Farm.html as a source for information about Cesar Chavez's activities in 1947. I received the following information when I tried to publish the page: "Error: Your edit was not saved because it contains a new external link to a site registered on Wikipedia's blacklist or Wikimedia's global blacklist... The following link has triggered a protection filter: pollutionissues.com"

But I could not find the site listed at either Wikipedia's blacklist or Wikimedia's global blacklist. I do not understand, please help me figure it out. Nor does it make sense to request whitelisting the page if the site itself isn't blacklisted (as it doesn't seem to be). Larrykoen (talk) 16:29, 30 September 2021 (UTC)Reply

Larrykoen, it's on the enwiki spam blacklist. It was added two years ago as a result of this discussion. GeneralNotability (talk) 16:34, 30 September 2021 (UTC)Reply
I see that pollutionissues.com is in both lists that you experts mention. But it seems to me that the message that I received should reference a list, some list, where the offending Web site is actually listed, so I don't waste this valuable real estate with my questions.
Regardless, the various explanations for blacklisting pollutionissues.com at "this discussion" do not seem to apply to the page I cited, http://www.pollutionissues.com/Ho-Li/Labor-Farm.html. I have written to the putative author, José B. Cuellar, at [josecuel at sfsu.edu his academic email address], to let him know that Wikipedia blocks his article from being cited, and to request an alternate URL that I can cite for his work.
Finally, what is being "Declined"? Larrykoen (talk) 19:41, 30 September 2021 (UTC)Reply
Larrykoen, "declined" means "this isn't a meta-wiki blacklisting, so nothing to do here." It's an enwiki blacklist entry, so you will have to ask at w:en:MediaWiki_talk:Spam-blacklist. GeneralNotability (talk) 14:59, 1 October 2021 (UTC)Reply

=> Defer to w:en:Mediawiki talk:spam-blacklist  — billinghurst and noting that it is also blocked at another two wikis that mimic enWP's blacklist. sDrewth 14:04, 4 October 2021 (UTC)Reply

<COIBot> 1: [w:en (bl)] \bpollutionissues\.com\b  (pollutionissues.com )
<COIBot> 2: [w:ast (bl)] \bpollutionissues\.com\b  (pollutionissues.com )
<COIBot> 3: [w:ms (bl)] \bpollutionissues\.com\b  (pollutionissues.com )
<COIBot> The term 'pollutionissues.com' found in 3 rules.

I will work this out a bit more: User:Larrykoen, the site is not listed here on this wiki (meta.wikimedia.org) and hence delisting here is 'declined' - people here cannot do anything here since it is listed on en.wikipedia.org, a local wiki. It is on en:MediaWiki:Spam-blacklist as '\bpollutionissues\.com\b' (it is converted to a en:Regex that matches pollutionissues.com). That also means that it is not specifically blacklisting a document by the putative author, it is blacklisting everything on that website. No external link that matches that rule ('\b' matches a word-boundary, '\.' is to match a literal '.') can be added, including your reference, all other documents on that website, and the root domain (pollitionissues.com).

It was added (by me) through this discussion: en:MediaWiki_talk:Spam-blacklist/archives/April_2019#Advameg_sites_(city-data.com,_filmreference.com,_etc.) (2019) and en:MediaWiki_talk:Spam-whitelist/Archives/2007/12#encyclopedia.stateuniversity.com (2007!) (see also Talk:Spam_blacklist/Archives/2009-08#stateuniversity.com, https://meta.wikimedia.org/w/index.php?title=Talk:Spam_blacklist&oldid=485324#www.stateuniversity.com, and several other discussions on en.wikipedia which have been linked through these discussions).

Now back to www.pollutionissues.com/Ho-Li/Labor-Farm.html .. that is a chapter from a book/encyclopedia: "Pollution A-Z, volume 2 (L-Z)" (or " by Editor Richard M Stapleton with a copyright in 2004; ISBN 0-02-865700-4 (set : hardcover : alk. paper) — ISBN 0-02-865701-2 (v. 1); This title is also available as an e-book. ISBN 0-02-865905-8 (set); see https://www.google.com/search?tbm=bks&q=isbn:0028657004.

I hope that the availability of the same document in a book/encyclopedia does not coincide with the ".. often brought to attention in spam reports, reliable sources discussions, and related to copyright violations ...".

I concur with  Declined, and would likely also decline this on en.wikipedia for delisting/whitelisting. --Dirk Beetstra T C (en: U, T) 11:01, 5 October 2021 (UTC)Reply