Jump to content

Talk:Spam blacklist: Difference between revisions

From Meta, a Wikimedia project coordination wiki
Latest comment: 4 years ago by Praxidicae in topic Proposed additions
Content deleted Content added
Line 22: Line 22:
{{Link summary|schooltips.com.ng}}
{{Link summary|schooltips.com.ng}}
{{Link summary|sportinfo.com.ng}}
{{Link summary|sportinfo.com.ng}}
{{Link summary|360ng.com.ng}}
:As per my request for the account locks [https://meta.wikimedia.org/wiki/?diff=prev&oldid=19675961 here] this is being used to spam some garbage site and is potentially a source for hoaxes. [[User:Praxidicae|Praxidicae]] ([[User talk:Praxidicae|talk]]) 15:56, 2 January 2020 (UTC)
:As per my request for the account locks [https://meta.wikimedia.org/wiki/?diff=prev&oldid=19675961 here] this is being used to spam some garbage site and is potentially a source for hoaxes. [[User:Praxidicae|Praxidicae]] ([[User talk:Praxidicae|talk]]) 15:56, 2 January 2020 (UTC)


:{{rto|Praxidicae}} {{Added}} to [[Spam blacklist]]. --[[User:Martin Urbanec|Martin Urbanec]] ([[User talk:Martin Urbanec|talk]]) 16:02, 2 January 2020 (UTC)
:{{rto|Praxidicae}} {{Added}} to [[Spam blacklist]]. --[[User:Martin Urbanec|Martin Urbanec]] ([[User talk:Martin Urbanec|talk]]) 16:02, 2 January 2020 (UTC)
::{{ping|Martin Urbanec}} I added another from this spam set that's being used for the same purpose. Running a report now. Please add this as well. Also pinging {{ping|Ohnoitsjamie}} as this might be of interest to you...[[User:Praxidicae|Praxidicae]] ([[User talk:Praxidicae|talk]]) 16:27, 2 January 2020 (UTC)


==Indian financial scheme spam==
==Indian financial scheme spam==

Revision as of 16:27, 2 January 2020

Shortcut:
WM:SPAM
WM:SBL
The associated page is used by the MediaWiki Spam Blacklist extension, and lists regular expressions which cannot be used in URLs in any page in Wikimedia Foundation projects (as well as many external wikis). Any Meta administrator can edit the spam blacklist; either manually or with SBHandler. For more information on what the spam blacklist is for, and the processes used here, please see Spam blacklist/About.

Proposed additions
Please provide evidence of spamming on several wikis. Spam that only affects a single project should go to that project's local blacklist. Exceptions include malicious domains and URL redirector/shortener services. Please follow this format. Please check back after submitting your report, there could be questions regarding your request.
Proposed removals
Please check our list of requests which repeatedly get declined. Typically, we do not remove domains from the spam blacklist in response to site-owners' requests. Instead, we de-blacklist sites when trusted, high-volume editors request the use of blacklisted links because of their value in support of our projects. Please consider whether requesting whitelisting on a specific wiki for a specific use is more appropriate - that is very often the case.
Other discussion
Troubleshooting and problems - If there is an error in the blacklist (i.e. a regex error) which is causing problems, please raise the issue here.
Discussion - Meta-discussion concerning the operation of the blacklist and related pages, and communication among the spam blacklist team.
#wikimedia-external-linksconnect - Real-time IRC chat for co-ordination of activities related to maintenance of the blacklist.
Whitelists
There is no global whitelist, so if you are seeking a whitelisting of a url at a wiki then please address such matters via use of the respective Mediawiki talk:Spam-whitelist page at that wiki, and you should consider the use of the template {{edit protected}} or its local equivalent to get attention to your edit.

Please sign your posts with ~~~~ after your comment. This leaves a signature and timestamp so conversations are easier to follow.


Completed requests are marked as {{added}}/{{removed}} or {{declined}}, and are generally archived quickly. Additions and removals are logged · current log 2024/07.

SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 7 days and sections whose most recent comment is older than 15 days.

Proposed additions

This section is for proposing that a website be blacklisted; add new entries at the bottom of the section, using the basic URL so that there is no link (example.com, not http://www.example.com). Provide links demonstrating widespread spamming by multiple users on multiple wikis. Completed requests will be marked as {{added}} or {{declined}} and archived.

shorturl.at



URL shortener used to add spam. — JJMC89(T·C) 20:59, 30 December 2019 (UTC)Reply

@JJMC89: Where are you seeing it used? If you have a look at the XWiki report, you will see that I blacklisted the domain in 2018. I will get COIBot to kick a new report to see if it can also assist.  — billinghurst sDrewth 22:08, 30 December 2019 (UTC)Reply
Already done My bad, I misread the diffs. It was being added without the protocol. — JJMC89(T·C) 22:33, 30 December 2019 (UTC)Reply

www.wikitechy.com





Cross-wiki spam. Tgeorgescu (talk) 15:15, 31 December 2019 (UTC)Reply

Request for 3 domains









As per my request for the account locks here this is being used to spam some garbage site and is potentially a source for hoaxes. Praxidicae (talk) 15:56, 2 January 2020 (UTC)Reply
@Praxidicae: Added Added to Spam blacklist. --Martin Urbanec (talk) 16:02, 2 January 2020 (UTC)Reply
@Martin Urbanec: I added another from this spam set that's being used for the same purpose. Running a report now. Please add this as well. Also pinging @Ohnoitsjamie: as this might be of interest to you...Praxidicae (talk) 16:27, 2 January 2020 (UTC)Reply

Indian financial scheme spam





Request moved from English wikipedia SBL per request due to cross-wiki spam. See COIBot reports for pradhanmantri.info and pmagreement.in for both English and Hindi wikipedias. Ravensfire (talk) 03:33, 2 January 2020 (UTC)Reply

Proposed additions (Bot reported)

This section is for domains which have been added to multiple wikis as observed by a bot.

These are automated reports, please check the records and the link thoroughly, it may report good links! For some more info, see Spam blacklist/Help#COIBot_reports. Reports will automatically be archived by the bot when they get stale (less than 5 links reported, which have not been edited in the last 7 days, and where the last editor is COIBot).

Sysops
  • If the report contains links to less than 5 wikis, then only add it when it is really spam
  • Otherwise just revert the link-additions, and close the report; closed reports will be reopened when spamming continues
  • To close a report, change the LinkStatus template to closed ({{LinkStatus|closed}})
  • Please place any notes in the discussion section below the HTML comment

COIBot

The LinkWatchers report domains meeting the following criteria:

  • When a user mainly adds this link, and the link has not been used too much, and this user adds the link to more than 2 wikis
  • When a user mainly adds links on one server, and links on the server have not been used too much, and this user adds the links to more than 2 wikis
  • If ALL links are added by IPs, and the link is added to more than 1 wiki
  • If a small range of IPs have a preference for this link (but it may also have been added by other users), and the link is added to more than 1 wiki.
COIBot's currently open XWiki reports
List Last update By Site IP R Last user Last link addition User Link User - Link User - Link - Wikis Link - Wikis
vrsystems.ru 2023-06-27 15:51:16 COIBot 195.24.68.17 192.36.57.94
193.46.56.178
194.71.126.227
93.99.104.93
2070-01-01 05:00:00 4 4

Proposed removals

This section is for proposing that a website be unlisted; please add new entries at the bottom of the section.

Remember to provide the specific domain blacklisted, links to the articles they are used in or useful to, and arguments in favour of unlisting. Completed requests will be marked as {{removed}} or {{declined}} and archived.

See also recurring requests for repeatedly proposed (and refused) removals.

Notes:

  • The addition or removal of a domain from the blacklist is not a vote; please do not bold the first words in statements.
  • This page is for the removal of domains from the global blacklist, not for removal of domains from the blacklists of individual wikis. For those requests please take your discussion to the pertinent wiki, where such requests would be made at Mediawiki talk:Spam-blacklist at that wiki. Search spamlists — remember to enter any relevant language code

bet365.com



Website of a notable company with 15 sitelinks in Wikidata.[1] It was added to the blacklist in 2007; the log links to a diff to a page that has no spam blacklist log entries since the log started six years ago. Peter James (talk) 09:46, 31 December 2019 (UTC)Reply

A pure betting website with zero encyclopedic information. The site has used referral link schemes in the past, see this spam diff (and probably still does). It has also made news for false advertising and other questionable business ethics (see en-Wiki article). ==> No possible value + high risk of misuse = I am strongly opposed to removing such a site from the blacklist. GermanJoe (talk) 14:56, 31 December 2019 (UTC)Reply
So you think official websites shouldn't be linked in articles or in Wikidata? The "official website" template was once nominated for deletion ([2], and there was strong consensus to keep it. And if you think companies you dislike shouldn't have their sites linked, that's probably incompatible with NPOV. As for "high risk of misuse", all you can find is another edit by the same person at the same time as the diff I linked, twelve years ago (and on another article that hasn't had much spam - the log for that article only shows two attempts to add a Twitter link in 2017). Peter James (talk) 15:13, 31 December 2019 (UTC)Reply
There are several options such as local whitelisting of an about page or simply adding the site information as raw unlinked text to show the official website in main articles on project level. GermanJoe (talk) 21:25, 31 December 2019 (UTC)Reply

Comment Comment generally the process that the wikis look to for high risk sites is for a local whitelisting of respective /about pages, so that those urls can be added as required, though limits the possibility of abuse. The subject matter is covered in the below discussion about vid.me.  — billinghurst sDrewth 22:27, 31 December 2019 (UTC)Reply

There's no evidence that it is a high risk site, only that one person, probably not involved with the company, added spam links twelve years ago. For one or two sites local whitelisting may be reasonable, but there are 15, and also Wikidata where it isn't possible to add unlinked text as an official website. Peter James (talk) 23:18, 31 December 2019 (UTC)Reply
I am not certain that you can say it is or it isn't a high risk site without someone scanning the whole of spamblacklist logs, rather than just for a specific article. Our recommended process for removing sites from the blacklist is to suggest that whitelisting at a wiki first and see how it progresses. Ask at w:mediawiki talk:spam-whitelist and see how you go. [Noting that this is a consensus-based discussion forum, so you can point to this discussion at any whitelist conversation to see if that community has an opinion of the blacklisting anyway.]  — billinghurst sDrewth 11:33, 1 January 2020 (UTC)Reply

Troubleshooting and problems

This section is for comments related to problems with the blacklist (such as incorrect syntax or entries not being blocked), or problems saving a page because of a blacklisted link. This is not the section to request that an entry be unlisted (see Proposed removals above).

d:Q78682705



Because of the \bvid\.me\b entry, this site can't be the P856 (official website) value of this Wikidata item. How to resolve this? --Liuxinyu970226 (talk) 01:59, 17 December 2019 (UTC)Reply

Also, if in the future there are also many items that are about websites listed in this page, then it's expected that normal users can't add P856 values normally, so can we request such additions by posting this page or not? Or should Wikidata be exempted from global spam blacklist application? --Liuxinyu970226 (talk) 02:06, 17 December 2019 (UTC)Reply
Bit dot ly, TinyURL are likely. --Liuxinyu970226 (talk) 02:10, 17 December 2019 (UTC)Reply
WD is always welcome to utilise their local whitelist to exempt any domain that it chooses, though my understanding is that it isn't that simple, as the further usage at the WPs is not possible due to the blacklisting. There has been that discussion here—which will be in the archives—where Beetstra has propounded on this and I will let Beetstra better express his points rather than do my poor man reproduction.

If you believe that the policy of blacklisting url shorteners is incorrect, then worthwhile raising that matter through a well-structured RFC, as the policy pre-exists WD, and identifying all the aspects of the matter from its collection to its use, and how you expect to deal with spam or abusable urls.  — billinghurst sDrewth 03:39, 17 December 2019 (UTC)Reply

@Billinghurst and Liuxinyu970226: Be VERY careful with this. IF you whitelist, say, \btinyurl\.com\b then that allows for not only the domain link as official homepage on the WikiData item for Tinyurl, but also for a lot of tinyurls everywhere throughout wikidata (there is no reason why the globally blacklisted 'myspammycompany.com' for the wikidata item for MySpammyCompany then cannot have a tinyurl redirect to myspammycompany.com). That gives a plethora of problems down the line: 'tinyurl.com' will be transcluded through external links templates, so a page on en.wikipedia suddenly has a link to tinyurl.com. Since it is not whitelisted on the local wiki, it will hence result in a spam filter block on that local wiki on the next edit. It also results in a spam block for anyone who wants to add that transcluded domain through one of the templates upon adding (in other words, they cannot add that through WD transclusion). If tinyurl.com is blanket whitelisted and starts appearing also on other items on wikidata, it may also result in spam blocks on edits on other pages. Please, do NOT do this. --Dirk Beetstra T C (en: U, T) 06:08, 17 December 2019 (UTC)Reply
@Billinghurst: .. this is not only for url shorteners .. it goes for anything that is blacklisted. Redtube.com will have the same problem, whitelist that and you can just wait for tech-savvy high school vandals to add that as their school's official homepage on wikidata and have it transcluded on hundreds of Wikis at once. --Dirk Beetstra T C (en: U, T) 06:10, 17 December 2019 (UTC)Reply
Sure, I know that. It isn't my job to explain folly to them. I was presuming that they were going to whitelist, add the url, the remove, as they would normally only asking for official webpage.

The bigger story is about the impact and consequences of having added links at WD, and then trying to utilise them at other WMF wikis when they are still blacklisted globally, or locally, and the consequences in editing.  — billinghurst sDrewth 10:54, 17 December 2019 (UTC)Reply

Well, I had to explain it to them quite some time ago once, when they whitelisted something on WD and someone on en.w came complaining they couldn't edit. What en.w locally does is to whitelist the /about page - that is generally a neutral landing page and not the top page (which is often the reason something got blacklisted - pornhub.com is blacklisted because students tend to replace their school website with it), and it is more difficult to 'abuse' (whitelisting tinyurl.com's homepage also allows tinyurl's redirects). But in any case, whether pornhub.com/about or pornhub.com is locally whitelisted on WD, it will impact editing on all wikis that try to transclude the globally blacklisted page, as pages cannot be edited.
What COULD be considered is that our blacklist rule is exempting a neutral landing page (a /about) on each site that we blacklist. --Dirk Beetstra T C (en: U, T) 10:57, 18 December 2019 (UTC)Reply
In Wikipedia they could still be added without linking to the URL. Probably better to use edit filters to block edits such as that, which are vandalism rather than spam. Peter James (talk) 09:59, 31 December 2019 (UTC)Reply
There are already blacklists for specific types of URLs without blocking the entire Google and Amazon websites - could something similar be done here for URLs such as tinyurl.com, possibly using a regex for one or more characters after the domain name? Peter James (talk) 09:59, 31 December 2019 (UTC)Reply
Also the spam blacklist doesn't prevent addition of blacklisted links, it only restricts editing of pages that contain them, so for example I couldn't undo this edit. Peter James (talk) 10:36, 31 December 2019 (UTC)Reply
I beg to differ, I think that you have that back to front. An undo from no links to links is the addition of links. It is looking for "added_links".  — billinghurst sDrewth 11:43, 1 January 2020 (UTC)Reply
@Billinghurst: I just ask if how the P856 values can be added for items, where topics are websites that listed in this blacklist, if the answers are "no" or "not easy", then I will ask Wikidata community to consider technically excluding application of global spam blacklist, and only use local blacklist to anti-abuse. --Liuxinyu970226 (talk) 07:14, 19 December 2019 (UTC)Reply
@Liuxinyu970226: That will have disastrous effects on all wikis that use that data. --Dirk Beetstra T C (en: U, T) 07:40, 19 December 2019 (UTC)Reply
Liuxinyu970226, there are currently existing external links on wikidata that are now blacklisted here on-wiki (these links were spammed to WD before they were blacklisted). There are now on all hundreds of wikis a page where you cannot add the official website by transclusion from WD (like e.g. en:template:Official website does when called without parameters). --Dirk Beetstra T C (en: U, T) 07:51, 19 December 2019 (UTC)Reply
@Liuxinyu970226: It was a pretty naive question, and you were given a fulsome answer to try and cover the range of reasons that you may have been asking. I would think that this is a bigger question than just WD where the urls are used outside of WD. I would think that it may be something that all of the WMF community may have an interest in rather than just the technocrats/puritans at WD. As Beetstra said these were blacklisted as they were abused, not because they had the potential to be abused. If you take it to WD, I look forward to your holistic discussion, not something narrowly focused upon that the spam blacklist stops them being added.  — billinghurst sDrewth 09:15, 19 December 2019 (UTC)Reply
@Billinghurst: see d:Wikidata:Administrators'_noticeboard#Local_spam_filter. From a WD perspective this all makes sense (though they would also get the real crap), but WD is however used by the majority (if not all) other wikis. --Dirk Beetstra T C (en: U, T) 09:39, 19 December 2019 (UTC)Reply

To me, for these items the best option is still to exclude here on meta a neutral landing page. That solves a lot of problems throughout: it enables the WD item to have a representative link in their item that does not result in any problems on other Wikimedia projects (or when local projects want to use that link). That does still protect WD against edits like this and [ this] (can someone tell me why a municipality in Germany needs a link to pornhub.com?). All other options are of a technological level that needs significant changes in the structure of the software that Wikipedia is running on. --Dirk Beetstra T C (en: U, T) 09:59, 19 December 2019 (UTC)Reply

I think we should only permit that for URLs specifically on request, lest the spammers refashion their /about page for promotion. Vermont (talk) 11:11, 19 December 2019 (UTC)Reply
  • Comment Comment global whitelist. It seems to me that there is now the need for global whitelist page. We know that there are dangerous domain names, though for famous sites. Asking for every wikipedia to locally whitelist is now unreasonable, especially in light of WD, and its methodologies.  — billinghurst sDrewth 22:30, 31 December 2019 (UTC)Reply
    • What about negative lookahead? See encyclopediadramatica\.(?:com(?!/Main_Page)|net|org|se) entry, encyclopediadramatica.com/Main_Page should work correctly, unlike the rest of the variants. \bgoo\.gl\b(?!/maps\b).* is similar variation. --Martin Urbanec (talk) 22:35, 31 December 2019 (UTC)Reply

Discussion

This section is for discussion of Spam blacklist issues among other users.

Hi, can I please have some more information regarding the reason that this domain has been blacklisted? https://meta.wikimedia.org/wiki/User_talk:COIBot/XWiki/omnislots.com --Jeditom (talk) 13:36, 2 January 2020 (UTC)Reply

I hope billinghurst I have done it at the correct section.