Jump to content

Talk:Spam blacklist

From Meta, a Wikimedia project coordination wiki
This is an archived version of this page, as edited by Curps~metawiki (talk | contribs) at 00:21, 13 February 2005 (→‎Please add (spambot, part 4)). It may differ significantly from the current version.

If you are having problems with the spam list and aren't a spammer please include a link to the article you are having trouble saving and say which URL (without the leading http part) you are told is blocked. Any meta administrator can edit the spam blacklist.

Please remove

Add new items to the end of the list. For each include:

  • The URL the error message mentions without the http:// prefix.
  • Links to the article or articles you were editing. Use an external link if you're not sure what the right interwiki link.
  • See /completed removals for removals whcih have been processed (either removed or reasons given for choosing not to remove). If not removed, you can still use a plain text form of the link.
  • www.77002.com - Why it not spam: although commercial, this is a legitimate club and music production site. I'm posting in order to have some info about the Houston night life: http://en.wikipedia.org/w/index.php?title=User:Forkbat/EMDG. I'll have to move it somewhere else if I can't include 77002.com. Thanks.
  • unknown filter - blocks links to forums.merkey.net at en:Google_bomb. We credit the site as the winner of the Nigritude Ultramarine contest. Being extremely not well versed in anything computerish, I've done a kludge (I think that's the right term) by identifying the site without linking. Any alternative solution is fine with me. -- JamesMLane of en Wikipedia. 165.247.32.100 05:36, 14 Dec 2004 (UTC)
    • When I went to save the article, I found the same story with blog.outer-court.com/googlebomb/. The link (Googlebomb Watch) seems worth keeping. 165.247.32.100 05:46, 14 Dec 2004 (UTC)
  • -china\.com - blocks link to www.maps-of-china.com at en:Guilin. The link shows the map of Guilin Paranoid 14:31, 18 Dec 2004 (UTC)
  • y365.com - blocks links to rugby365.com at it:Rugby a 15. I tried to make a change to an article on the Italian wikipedia and when I tried to submit there was an error message something about spam and the website rugby365.com. I didn't add this link and it was already in the article, yet some sort of spam filter is stopping me making unrelated changes. The error message wasn't very descriptive. After doing a bit of digging I finally found that a new spam filter has been put in. Although rugby365.com does not appear on the blacklist, y365.com does and I guess that that is matching. Could someone look into this. Also maybe improve the Italian spam message. -- 219.88.206.155 03:22, 16 Dec 2004 (UTC)~
  • 1stop- - blocks links to www.1stop-penguins.com at en:Penguins. This is what I would call overly-zealous blocking. Can't the filter be improved to allow links that are already part of the article text, and just disallow those that are added in the diff? Please let me know, or just reinstate the link in the artice, when this is fixed (I commented it out and put a space in it) PhilHibbs 12:15, 17 Dec 2004 (UTC)
    • Update: I've reinstated the link, without the http:// but it still needs fixing due to inconsistent rendering. PhilHibbs 12:21, 17 Dec 2004 (UTC)
  • SOMETHING is blocking laws.findlaw.com user w:User:Soren9580 requests access to this so they can edit w:Abington School District v. Schempp. Help! incidently, this is the first time I've encountered the spam block... Oh, incidently, you are blocking wired.com. I just got blocked! Also note that other unrelated sites are being blocked, can't tell you which ones. Sorry. - Ta bu shi da yu 13:33, 20 Dec 2004 (UTC)
  • Can an admin please remove "freewebpage\.org" as from a week or so ago it seems to be stopping me from having a link to my homepage, as so many others do, on my talk/user page. Or is the problem caused by the problem Fvw mentioned? My home-page is: wikiuser.freewebpage.org/. Thanks. 217.204.65.210 17:03, 21 Dec 2004 (UTC)
  • tsinghua.edu.cn was bloked linkage from zh:北京and online.sh.cn from zh:上海. I don't kown why, please remove from the spam blacklist.--Fanghong 14:01, 1 Jan 2005 (UTC)
  • Mustafaa and Fanghong - as well as my above request I also posted this on the page of the person maintaining this "spam" list:-

"Help please. Hello. Can you tell why "freewebpage\.org" is on the block list. It would be better to block specific freewebpage\.org user names as I can't put a link to my homepage on my user page as so many do. freewebpage\.org itself is just a free web page supplier like Yahoo. I left a note on the discussion page but no one replied. thank you. 217.204.65.210 21:42, 29 Dec 2004 (UTC)"

I think you'll find that it's par for the course for The Wikipedia here, and tlat we'll just be ignored and listed as spammers for no reason. 217.204.65.210 19:53, 5 Jan 2005 (UTC)

  • latex- - its blocking everything which is connected with the LaTeX system (ie. LaTeX's logo) --Chepry 03:08, 9 Jan 2005 (UTC)
  • adultweb - blocks links to www.adultweblaw.com/laws/childporn.htm at en:Child pornography (this is not a porn site, this is a site with legal information).Paranoid 07:49, 10 Jan 2005 (UTC)
  • unknown filter - blocks link to magazine.14850.com/9403/censorship.html at en:Child pornography. A reference to a magazine article. Paranoid 07:49, 10 Jan 2005 (UTC)
  • infoweb.co.nz - A while back someone I know started to spam wikis with my domain name in an attempt to annoy me personally. Would it be possible to have infoweb removed from the black list -- User:Thing2b
  • www.66163.com - Fujian Today website --Shizhao 06:33, 27 Jan 2005 (UTC)
  • princess-mononoke.com - I updated the Princess Mononoke page [1] with the archived official site, which is on the spam list. Please remove, and if you want to you can change my redirector to the original URL. Thanks. 7 Feb 2005

Please add

Add new items to the end of the list. See /completed removals for additions which have been processed (either added or reasons given for choosing not to add). For each include:

  • Links to one or more page diffs which show the spam being added. Preferably in many different wikis. Admins can check for cross-wiki spam ads using a special page.

Do not include:

  • The URLs being promoted - they won't be added without a link to them. We need to document why we added something.
  • Anything where you can't point to a diff showing the spam being added - we also have to ensure that false requests aren't made for addition or deletion.

Show restraint in requesting additions for:

  • Spammers who spam once.
  • Spammers who can be effectively blocked with IP blocks - use those instead.
  • www.mobileunlocking.4t.com. I would like to add it to the list. Is there a protocol how to do that? Maybe an email address where these urls can be sent to? Another thing, if these urls are put on this page. Isn't the spam effective after all then? Maybe they should be mangled, something like what I did to the above one. Anyway, I think this is a great idea. Fixing the problem right at the root. Polyglot 14:22, 17 Dec 2004 (UTC)

Here is another one: caiyin.to8.com 彩印 Polyglot 09:17, 19 Dec 2004 (UTC) In fact it's the to8.com part that is the common denominator. This user has come back 4 times already to spamvertise. Polyglot 09:20, 19 Dec 2004 (UTC)

And some more (all added in one go): I tried to clean them up a bit, but the list keeps going on and on. I did however strip the h t t p part.

I've removed them all because none contained sufficient information to determine that they were the source of repeat spamming which couldn't be blocked with an IP block. This list is very high overhead and only for hard cases which can't be dealt with using routine blocking methods. If IP blocks don't work and hte spam is regular or in lots of wikis, it's a good candidate for listing here. If you have't tried blocking and found it to be ineffective, it's not worth the great amount of time involved in verifying and adding a new entry, then dealing with any accidental blockings related to the listing. Jamesday 11:51, 20 Dec 2004 (UTC)

en.wiktionary is under attack again by the "CCNGROUP-TJ". Spam came from 60.25.127.208 and advertised "www.88558888.com" this time. Diff here. — Hippietrail 12:24, 20 Dec 2004 (UTC)

All of the 5+ character numeric .com sites are now blocked as of 20 December. So are the others I know this particular spammer is using. Jamesday 21:23, 10 Jan 2005 (UTC)
Added, one on 20 December, others today. Jamesday 21:23, 10 Jan 2005 (UTC)
  • Please check this edit [3]. Tomos 21:13, 23 Dec 2004 (UTC)
Added and added the promoted site. Like the one which said "Fraud people together" in a banner at the top of the page.:) Jamesday 21:23, 10 Jan 2005 (UTC)
  • Please check this revision. Since this is the first revision of a page, there is no diff. [4] Tomos 04:28, 24 Dec 2004 (UTC)
Added. Jamesday 21:23, 10 Jan 2005 (UTC)
  • home.graffiti.net. [5], [6], [7] these are just 3 diffs from hundreds. This link was put in a wide range of articles by a very awful spammer. Apparently the spam-bot found the articles by scanning the RC. This happened the last few days now in the german wikipedia, always for about half an hour and always with another IP for each edit, up to 3 edits per minute :-( So please add this URI to the blacklist. Thanx -Bdk 11:33, 26 Dec 2004 (UTC)
Added on 26 December. Jamesday 21:23, 10 Jan 2005 (UTC)
  • www.asinah.org. See e.g. [8] for a mass-adding of "weather in" weblinks. There are several other URL from that server, all giving nearly no additional value to the article. Ahoerstemeier 21:25, 29 Dec 2004 (UTC)
Not adding at the moment - that's too much like something a legitimate user would do because it seemed useful to them. Jamesday 21:23, 10 Jan 2005 (UTC)
  • *.europe-countries.com
  • www.europe-atlas.com

He keeps spamming wikipedia geography related articles on many languages. See http://it.wikipedia.org/w/index.php?title=Estonia&diff=345862&oldid=345846 and, more important: http://www.google.it/search?hl=it&as_qdr=all&q=+%22europe.countries.com%22+site%3Awikipedia.org --M/ 22:16, 30 Dec 2004 (UTC)

The ones I looked at appeared useful additions (better maps than in the article, useful information pages) and a check of all recent changes edits in all projects for the last week found nothing by the 256 IP address block that IP address was in, so I haven't added this. Looks more like someone using a bot to add basic useful information. Jamesday 21:23, 10 Jan 2005 (UTC)
  • www.golfcards.com and www.golftour.de commercial links that are frequently inserted in german de:Golf (Sport). Hadhuey 17:49, 6 Jan 2005 (UTC)
  • www.golfcards.com (as above) and www.presidentcard.com, commercial links that are frequently inserted in Danish da:Golf. Byrial 19:12, 11 Jan 2005 (UTC)
  • razored.net, has been repeatedly spammed to the indian ocean earthquake and tsunami articles on en:. Fvw 20:21, 15 Jan 2005 (UTC)
  • syn.cs.pdx.edu/wiki got linkspammed the other day: [9]. Source IP was 212.164.71.254. If you google for that ip, you see a lot of wiki logs that have been similarly spammed.

--24.22.27.155 04:06, 20 Jan 2005 (UTC)

  • Car-related articles on en.wikipedia are getting spammed with links to www.autospectator.com. [10] is an example diff. This site is simply an advertising vehicle that attempts to get high enough Google rankings to get hits, but has only trivial content. Linking it here is an attempt to get Google ranking, and it should IMO be blocked. Morven 17:56, 21 Jan 2005 (UTC)
  • home.tiscali.be/wallpaperheaven/ - [11], just one diff from many. This link was put in a wide range of articles by a awful spammer. Apparently the spam-bot found the articles by scanning the RC and the main page of the german wikipedia. This happened always with another IP for each edit, up to 3 edits per minute :-( So please add this URI to the blacklist. Thanx a lot --:Bdk: 00:19, 28 Jan 2005 (UTC)
Because spamming was going on, I added home.tiscali.be (I wanted to add with subdomain, but I couldn't find the way.) Because it is a big site, I would like someone to fix it in a more proper way.) --Aphaia | WQ2翻訳中 | talk 02:45, 30 Jan 2005 (UTC)
  • www.travel-images.com, annoying commercial site that sells images.

Pls see diff http://it.wikipedia.org/w/index.php?title=Marocco&diff=388364&oldid=388363 and spam diffusion at: http://www.google.it/search?as_q=travel-images&num=10&as_sitesearch=wikipedia.org --M/ 11:18, 28 Jan 2005 (UTC)

Added. --Aphaia | WQ2翻訳中 | talk 20:23, 7 Feb 2005 (UTC)
  • cgi.ebay.com, some users try to raise their sale by adding the link to their product on ebay. Possibly also include other language ebay sites. (for an example, see these contributions for February 6, until the IP got blocked) Chris 73 02:15, 6 Feb 2005 (UTC)
Added. --Aphaia | WQ2翻訳中 | talk 20:23, 7 Feb 2005 (UTC)
  • bertelsmann2club.be.funpic.de calls itself an FAQ, has no special infos, the webmaster just earns money, when you sign up with his refer.
  1. http://de.wikipedia.org/w/index.php?title=Bertelsmann_AG&diff=3112349&oldid=2985486
  2. http://de.wikipedia.org/w/index.php?title=Bertelsmann_AG&diff=0&oldid=4429338

--82.141.59.242 14:23, 9 Feb 2005 (UTC)


en:PHP is hit by a persistent spambot, that also attacked other wikis. See en:Wikipedia:Administrators' noticeboard#Continuing spambot attacks A number of links are added frequently, and maybe it would help to put them on the blocklist. Some of these are:

  • incoherent.fragment.6x.to


-- 133.240.96.4 08:16, 10 Feb 2005 (UTC)

[17] ... and so one --141.76.1.121 00:13, 13 Feb 2005 (UTC)

    • This looks like an isolated attack that should be dealt with by protection and blocking. silsor 00:19, Feb 13, 2005 (UTC)

Diff

As suggested by Jamesday on IRC, here is a selection of diff URLs of typical spams from the biggest spammer on the English Wiktionary since October. The spams come from various IP ranges, the content contains either Chinese or the same badly translated into English. The URLs in the spam also change and seem to be something like tinyurls:

"211.90.129.176" 02:22, 5 Nov 2004 Hippietrail blocked  with an expiry time of 48 hours (quit spamvertising already) 
http://en.wiktionary.org/w/wiki.phtml?title=Wiktionary:Esperanto_index&diff=120125&oldid=120123
http://en.wiktionary.org/w/wiki.phtml?title=Wiktionary:Esperanto_index&diff=112487&oldid=112087

"218.17.80.112" 10:08, 15 Nov 2004 Paul G blocked  with an expiry time of 14 days (Persistent spamming) 
http://en.wiktionary.org/w/wiki.phtml?title=Qur%27an&diff=117271&oldid=117268
http://en.wiktionary.org/w/wiki.phtml?title=Help:Contents&diff=117242&oldid=117235

"218.18.12.28" 10:36, 16 Nov 2004 Hippietrail blocked  with an expiry time of 2 days (the spamvertiser returns) 
http://en.wiktionary.org/w/wiki.phtml?title=Help:Contents&diff=117645&oldid=117642
http://en.wiktionary.org/w/wiki.phtml?title=Wiktionary:Community_Portal&diff=117650&oldid=117637
: done to here ~~~~
"218.18.87.200" 11:39, 4 Nov 2004 Hippietrail blocked  with an expiry time of 24 hours (spamvertising again) 
http://en.wiktionary.org/w/wiki.phtml?title=Slovene&diff=114324&oldid=114321
http://en.wiktionary.org/w/wiki.phtml?title=Russian&diff=114063&oldid=113973

"219.133.112.138" 07:22, 5 Nov 2004 Hippietrail blocked  with an expiry time of 48 hours (repeat spamvertiser) 
http://en.wiktionary.org/w/wiki.phtml?title=Wiktionary:Dutch_index&diff=114587&oldid=114569

"221.196.11.2" 07:18, 20 Nov 2004 Eclecticology blocked  with an expiry time of 168 hours (vandalism) 
http://en.wiktionary.org/w/wiki.phtml?title=Exponere&diff=118987&oldid=118968
http://en.wiktionary.org/w/wiki.phtml?title=Devil&diff=118982&oldid=118966

"221.196.82.226" 07:02, 13 Nov 2004 Hippietrail blocked  with an expiry time of 96 hours (spamdork strikes again) 
http://en.wiktionary.org/w/wiki.phtml?title=Megalomania&diff=116818&oldid=116812
http://en.wiktionary.org/w/wiki.phtml?title=Wiktionary:Recentchanges&diff=116817&oldid=116811

"221.197.16.90" "221.197.18.150"13:04, 10 Dec 2004 Polyglot blocked  with an expiry time of 504 hours (vandalizing Wiktionary 3 days in a row) 
http://en.wiktionary.org/w/wiki.phtml?title=Blue&diff=116635&oldid=116565
http://en.wiktionary.org/w/wiki.phtml?title=Onomatopoetic&diff=115406&oldid=114914

"221.197.18.150"13:04, 10 Dec 2004 Polyglot blocked  with an expiry time of 504 hours (vandalizing Wiktionary 3 days in a row) 
http://en.wiktionary.org/w/wiki.phtml?title=Soror&diff=125963&oldid=125957
http://en.wiktionary.org/w/wiki.phtml?title=Talk:Transwiki:Chortle&diff=125381&oldid=125312

"221.197.19.225" 06:49, 26 Nov 2004 Hippietrail blocked  with an expiry time of 4 days (spamvertiser) 
http://en.wiktionary.org/w/wiki.phtml?title=Mi&diff=124180&oldid=124103
http://en.wiktionary.org/w/wiki.phtml?title=Basketbrawl&diff=120998&oldid=120985

"60.25.120.218" 13:48, 11 Oct 2004 Hippietrail blocked  with an expiry time of 24 hours (continuing spamvertising) 
http://en.wiktionary.org/w/wiki.phtml?title=Lead&diff=108477&oldid=108407
http://en.wiktionary.org/w/wiki.phtml?title=Phoneme&diff=108562&oldid=108346

"60.25.122.177" 08:25, 17 Nov 2004 Hippietrail blocked  with an expiry time of 3 days (spamvertiser) 
http://en.wiktionary.org/w/wiki.phtml?title=Nu&diff=117897&oldid=117808
http://en.wiktionary.org/w/wiki.phtml?title=User:Gangleri/tests&diff=117898&oldid=117805

"61.145.136.171" 06:39, 11 Oct 2004 Polyglot blocked  with an expiry time of 168 hours (repeated vandalism in several pages over the last 24 hours) 
http://en.wiktionary.org/w/wiki.phtml?title=User:D%C5%82ugosz&diff=108377&oldid=108351
http://en.wiktionary.org/w/wiki.phtml?title=Help:Contents&diff=108241&oldid=10817

Hippietrail 13:50, 11 Dec 2004 (UTC)

Spam protection filter bugs

(moved from User talk:Angela)

I got the above message for the first time today, more than once. Many of the sites (because of which I got the above message) are not listed on m:Spam blacklist. Some users link to google on votes for deletion page and so, I cannot edit even those pages. Utcursch 13:37, 20 Dec 2004 (UTC)

I have the same problem, on en:Viktor Yushchenko page. When I try to save the page, I get a message that biz.yahoo.co.uk and reuters.com are on m:Spam blacklist, although none of them is listed there (and should not be listed, as those are frequently linked news sites). 195.13.132.24 13:45, 20 Dec 2004 (UTC)
I had the same problem. I think it was probably related to the server going read-only this morning. 64.222.254.30 06:04, 21 Dec 2004 (UTC)

It was an over-broad spam filter rule. Unrelated to any server issues - I simply got a rule significantly wrong. Jamesday 21:33, 10 Jan 2005 (UTC)

goatse

This is extremely urgent: www.goat.cx

I cannot imagine any link to this site should be possible to be made. Make sure you're not eating, if you do check it out. We had a very explicit picture on our rfd page on en.wiktionary.org Polyglot 22:46, 23 Dec 2004 (UTC)

Here is the diff:

Wiktionary:Requests_for_deletion&curid=4395&diff=130956&oldid=130942

This guy returned, created a new user account (I banned his other account, all he did was recurring vandalism) and came to pollute Wiktionary again with a the same picture from this site. Can some action be taken please!!! This site does not have pictures that you want to see when you visit a wikimedia site. It's extremely disgusting and very sexually explicit in nature. Polyglot 22:23, 25 Dec 2004 (UTC)

yea, you'd think that would be an URL they would just throw in there from common sense. Maybe some Wiktionary ops could be made meta ops? We seem to get the most spam for some reason.--Eean 08:31, 30 Dec 2004 (UTC)
I took vacation for a week, went to practice some Esperanto in Germany. Odd that this site still has not been added to the list. It shows that this way of requesting is totally ineffective. I asked for sysop status in order to be able to add it myself, but only Angela spoke out to say she didn't want to vote because I didn't have enough edits (on wikimedia.org). Oh well, when I see it show up again, I'll take it off again. It might take us some time before we notice it and some innocent soul wanting to simply look up a word will get disgusted by it and probably never return. It shouldn't really be my problem and now, more than a week later it really isn't anymore. It was good to be detached from the internet for a while. Should be doing it more often (and probably will). Polyglot 00:43, 5 Jan 2005 (UTC)
Not added: this isn't spam. Content you don't like goes elsewhere. Not added also because there are legitimate links to this. Yes, I'm familiar with the content there. If someone adds it, remove it. If someone makes a habit of it, block them if that's appropriate under the policies of the wiki you're at. Deciding to block it here when it's an appropriate link in many of the projects isn't appropriate. This isn't "I don't like it, stop it from being added to any project". It's "this is recurring bulk spamming which can't be blocked with IP blocks, block it this way instead". Jamesday 21:54, 10 Jan 2005 (UTC)
http://en.wiktionary.org/w/index.php?title=Wiktionary:Requests_for_deletion&curid=4395&diff=135720&oldid=135718
There we go again. This is the third time this happens, always the same image, always to the same page. I'll start by banning the IP addresses this person comes from. He's not too timid to create a temporary account though. We will keep deleting it as it arrives. This time I wasn't quick enough, so somebody else had to be disgusted by it. No big deal, until it is a child that happens upon it...
I understand the reason why you won't block it and I won't ask for it again.
I also have to add that I really appreciate the work you and the rest of the crew do, James. Thanks Polyglot 09:21, 11 Jan 2005 (UTC)
Why are inline off-site images allowed on Wiktionary? —Ben Brockert < 16:38, 11 Jan 2005 (UTC)


latex-

The Wikipedia is on LaTeX and the official website of the LaTeX project is www.latex-project.org ... the spam filter is set to latex- ... Please remove it Squash 23:17, 31 Dec 2004 (UTC) (Also member of same name on Wikipedia)

Sites to add

Please add http://flash88.51.net, http://www.gghggh.com, http://www.paper-translation.com, http://www.law-translation.com, http://www.book-translation.com, http://www.sowang.com/translation.htm, http://online.paper-translation.com/, http://www.acmetranslation.com, http://www.commerce-translation.com. Anoying moron(s) spamming ervery wiki online. Spammed on Meta, Commons and pl. -83.129.21.131 09:13, 12 Jan 2005 (UTC)

Please add a wildcard, or somesuch: bertelsmann2club.be.funpic.de ist blocked, bertelsmann2club.funpic.de ist not http://de.wikipedia.org/w/index.php?title=Bertelsmann_AG&diff=4442449&oldid=4439218 --82.141.58.185 11:36, 10 Feb 2005 (UTC)

Please check this list and if it's done, note it after the request, if it isn't then please update the list and note that it's done. Thanks. --grin 20:50, 14 Jan 2005 (UTC)

1

I think http://en.thinkexist.com should be blocked - the owner adds it again and again to promote his quotation website, regardsless being told many times it has wikiqoute for that. Ahoerstemeier 17:34, 21 Nov 2004 (UTC)

Please add filter for cracowonline.com. See http://en.wikipedia.org/wiki/Talk:Krakow#Spam --Gene s 11:06, 3 Dec 2004 (UTC)

Please add the following URLs (all posted dozens of times from multiple IPs) to the spam regex:

http://www.badmintonfan.com
http://www.nbpv.com
http://www.yide-sh.com
http://www.best-deals-online-gambling.info
http://www.top-deals-online-pharmacy.info
http://www.best-deals-levitra.info
http://www.top-deals-pills.info
http://www.best-deals-tramadol.info
http://www.best-deals-weight-loss.info
http://www.best-deals-diet.info
http://www.top-deals-viagra.info
http://www.best-deals-hotels.info
http://www.best-deals-roulette.info
http://www.credit-reports-4u.info
http://www.credit-report-4u.info
http://www.mortgage-calculators-ebanking.info
http://www.mortgage-4-u.info
http://www.private-mortgage-insurance-ebanking.info
http://www.student-loans-ebanking.info
http://www.loans-4-u.info
http://www.health-insurancedeals-4u.info
http://www.auto-insurancedeals-4u.info
http://www.car-insurancedeals-4u.info
http://www.insurancedeals-4u.info
http://www.insurance-quotesdeals-4u.info
http://www.credit-card-applications-4u.info
http://www.hotelse-site.info
http://www.hotele-site.info
http://www.las-vegas-hotels-e-site.info
http://www.cheap-hotels-e-site.info
http://www.hotel-dealse-site.info
http://www.travel-e-site.info
http://www.top-e-site.info
http://www.air-travel-e-site.info
http://www.great-e-site.info
http://www.car-rental-e-site.info
http://www.car-rentals-e-site.info
http://www.rental-car-e-site.info
http://www.deal-e-site.info
http://www.dating-e-site.info
http://www.online-dating-e-site.info
http://www.dating-site-e-site.info
http://www.adult-dvd-top-shop.info
http://www.dvd-top-shop.info
http://www.digital-camera-esite.info
http://www.digital-cameras-esite.info
http://www.golf-e-course.info
http://www.golf-clubs-e-course.info

See the edit history of en:Comment for a good example of how long this has been going on. The same spammers have also targeted BerliOS relentlessly. Note that some of the URLs look like repeats, but they're actually slightly different. I know it's a long list (took me a while to compile), but filtering them would be very helpful. -- Hadal 07:20, 9 Dec 2004 (UTC)

And a few more:

http://www.hzyage.com
http://www.51wisdom.com
http://www.hhboy.com
http://www.livingchina.cn
http://www.dzsc.com
http://www.ic37.com
http://www.cnzsw.com
http://www.51wisdom.com
http://www.sj-qh.com
http://www.my512.com
http://www.fj-zhanhong.com

Thanks. -- Hadal 07:39, 9 Dec 2004 (UTC)

some www.numbers.com removed because they were already filtered, and blocked the update. --grin 20:50, 14 Jan 2005 (UTC)

chinese spam all around, recent and old

Random wikipedia spam from china, see [18]

http://www.hpv''[0-9]*''.com.cn (hpv80, hpv120 etc)
http://www.hpv''[0-9]*''.cn
and others.

Okay, see [19] for searching, probably some fscking chinese viagra clone, let them die all. --grin 10:49, 9 Dec 2004 (UTC)

chinese again

  • spam diff here, since there is no "look for spam" anymore, it's up to the devs to seek and destroy, and filter along the way. ('h' is cut to prevent them from becoming live links)

ttp://www.rackstorage.cn/ 货架] ttp://www.rackstorage.cn/map.htm 货架] ttp://www.rackstorage.cn/1 货架] ttp://www.dinmo.net/hj 货架] ttp://www.rackstorage.cn/2/ 货架] ttp://www.seov.net/hj 货架] ttp://www.google-seo.net/hj 货架] ttp://www.rackstorage.cn/3/ 货架] ttp://www.vbzx.net/hj 货架] ttp://www.rackstorage.cn/5/ 货架] ttp://www.5782601.net/hj 货架] ttp://www.xazl.net/hj 货架] ttp://www.guizang.net/hj 货架] ttp://www.wikidragon.net/hj 货架] ttp://www.houseso.cn/hj 货架] ttp://www.house263.com/hj 货架] ttp://www.xh008.com/hj 货架] ttp://artmtm.nease.net/hj 货架] ttp://pxxi100.51.net 货架]

--grin 11:00, 14 Jan 2005 (UTC)

translation spam

I cannot check for multiple-project-spam (option seems to be deactivated?), but I guess this one spreaded more than huwiki: see this diff. url's listed there. --grin 18:02, 1 Jan 2005 (UTC)

What's the difference?

Requesting from de.wiki for on-going spamming, I made an edit and add an entry. I expected it would work soon, but I had an additional report, spamming continued. What was wrong on my edit? Or it is a feature? --Aphaia | WQ2翻訳中 | talk 01:43, 28 Jan 2005 (UTC)

Please add (spambot, part 1)

Hello. A spambot is attacking en:PHP, en:Cybercash, en:DBpp, en:DBM, en:CCVS, en:FrontBase, and also en:Consultative Group on Indonesia, en:Wikipedia talk:Friends of Wikipedia/Other wikis, en:Thatware and their talk pages, adding several dozen edits per day in some cases using multiple anon IPs.

Can you add all of these to the spam blacklist? I'd even recommend adding the top-level domains, eg: "6x.to", "uni.cc" and all of ".su" (old Soviet Union TLD, who legitimately uses it today?). -- Curps 19:30, 9 Feb 2005 (UTC)


bizarre-free-gallery.6x.to ass-fucking-gallery.6x.to bizarre-gallery.6x.to busty-babes-gallery.6x.to anal-gallery-hardcore.6x.to nude-beach.6x.to black-ebony-pussy.6x.to gallery-of-pretty-babes.6x.to asian-nude-gallery.6x.to spice-girl.6x.to blonde-fucking-gallery.6x.to sexy-brunette.6x.to logos-shower-galleries.6x.to beach-galleries.6x.to hot-cheerleader.6x.to topless-bikini-girl-gallery.6x.to

dose-valtrex.uni.cc lipitor-zocor-1.uni.cc pregnancy-zyrtec.uni.cc housewifes.uni.cc pfizer-celebrex.uni.cc discount-adipex.uni.cc indian-larry.uni.cc discount-adipex.uni.cc nude-africa.uni.cc granny-post.uni.cc manufacturer-renova.uni.cc acyclovir-valtrex.uni.cc apache-indians.uni.cc enema-fetish-gallery.uni.cc carmen-electra-nude.uni.cc housewifes-picture.uni.cc pic-of-jenna-jameson.uni.cc despret-housewifes.uni.cc prescription-valtrex.uni.cc zocor-recall.uni.cc elephant-list-links.uni.cc indian-woman.uni.cc xenical-diet-pill.uni.cc

Cigarettes.nov.su Shopping.grozny.su Toys.grozny.su Vicodin.karacol.su Pain-relief.karacol.su Blackjack.ivanovo.su card.togliatti.su mortgages.vologda.su Phone.tselinograd.su Blackjack.tselinograd.su Pharmacy.nalchik.su Gambling.termez.su maxtor.tuva.su maxtor.lenug.su Phone.belgorod.su Percocet.spb.su Phone.north-kazakhstan.su Hobby.armenia.su Gambling.bryansk.su Baccarat.aktyubinsk.su

Prescription.kalmykia.ru Gambling.adygeya.ru Hobby.adygeya.ru Gambling.grozny.ru Pain-relief.bashkiria.ru Tramadol.cbg.ru Percocet.vladikavkaz.ru mortgages.vladimir.ru

I am seriously thinking about the idea of just filtering out all uni.cc and 6x.to addresses, but are there any legitimate links that would be affected by this? Or are they only used by spammers? What about on the non-English wikis? Silsor 01:28, Feb 10, 2005 (UTC)
Google shows a couple of legitimate uni.cc addresses and a couple of legitimate 6x.to addresses. I don't know what the solution to this is... he is probably using throwaway domains, registering new ones on a continuous basis. I suspect he may be using hacked zombie machines all over the Internet, because I saw at least one AOL IP, so that definitely wasn't an open proxy. -- Curps 01:39, 10 Feb 2005 (UTC)
Is there any way to tell the filter to operate in reverse for certain TLDs, as in "block all 6x.to addresses except this list?" Sort of like hosts.allow and hosts.deny in Unix? -- Curps 02:06, 10 Feb 2005 (UTC)
There is now a bug open for this [20]. Silsor 02:59, Feb 10, 2005 (UTC)
The bot is following links. I added a vprotect notice to en:Wikipedia talk:Recent additions 20 which generated a bogus link to en:Wikipedia talk talk:Recent additions 20. A spam page was promptly created there... -- Curps 02:06, 10 Feb 2005 (UTC)

Please add (spambot, part 2)

The attack has spread to en:PL/I and en:PHP-Nuke. It could easily spread to many other pages. At this point I'd say block all of 6x.to and uni.cc (despite the 2-3 legit external links that use them) and all of .su. This is an extremely serious situation. -- Curps 08:06, 10 Feb 2005 (UTC)

halloween-costume-idea.6x.to gilmore-girl.6x.to

discount-didrex.uni.cc lortab-overdose.uni.cc nude-female.uni.cc gay-bear-gallery.uni.cc

Canadian-pharmacy.cbg.ru

Blackjack.georgia.su Gambling.exnet.su Tramadol.aktyubinsk.su

Please add (spambot, part 3)

amateur-gallery.6x.to rap-olympics.6x.to

Baccarat.murmansk.su

Please add (spambot, part 4)

Feb 12:

Shopping.bryansk.su Baccarat.mordovia.su Pain-relief.adygeya.ru

History of Nations

Can a site be removed from the spam filter? I think a site is on it by mistake. Please check out the Hot Sites page at USATODAY.com at http://www.usatoday.com/tech/webguide/hotsites/2005/2005-02-07-hotsites.htm. Note the first site which is History of Nations. The comment reads. "Wondering what the deal is with Ukraine? Did it recently dawn on you that canada's frontier history might include some very interesting tales? This site is exactly the sort of thing the Internet has been delivering for years, and yet it's still apleasure to see it when it's done well. Consider this a worthy reference resource, but don't neglect the pleasures of randomly browsing around."

How can such a site be considered spam? It is not like a spammer hacked USATODAY.com and added this comment.

I have a theory...

Did someone think this site was a copy of Wikipedia? If so, it may have been mistakenly banned. Note this comment on the index page of the site in question. "The text of this site did NOT originate at any online user contributed encyclopedia. Many of the articles are similar because many of these user contributed encyclopedias and History of Nations have seeded national histories from the same public domain text. Although many have been confused about this fact, History of Nations is not a mirror of the content of these encyclopedias and does not have to link to these encyclopedia articles and does not have to comply with any licensing requirements."

Perhaps this site should be removed? Miland 20:41, 9 Feb 2005 (UTC)

I don't know why historyofnations was added to the spam blacklist, but I'm inclined to leave this fence up until I know why it was put up. It's one of the very first entries in the list so likely the domain was promoted with spam. Silsor 01:36, Feb 10, 2005 (UTC)

This doesn't like like a spam site. It is even listed in Yahoo. (See http://dir.yahoo.com/arts/humanities/history/) Yahoo doesn't list spam sites...

There is some sort of mistake here. This site never needs to be listed in any Wiki project but having the stigma of being on this list is unfair for a quality site. It makes no sense. Did someone at Wikipedia mistakenly believe that historyofnations was an unauthorized user of wikipedia content? Did the site owner hire an unethical seo company which got them banned? Did a competing site get mad that historyofnations had a few listings so they spammed Wikipedia to death with this site to get it banned? (This is a technique promoted in some webmaster forums...) Look at this site. It is nothing like all the other ones on this spam list. This should be reconsidered. Is a listing on the blacklist a permanent sentence? Miland 02:26, 10 Feb 2005 (UTC)

It is definitely not a permanent sentence. I have been trying to track it down and historyofnations was listed on the original $wgSpamRegex before we even had this spam blacklist. Does anybody remember why it was added? Silsor 02:56, Feb 10, 2005 (UTC)
This is commented out now. silsor 21:01, Feb 10, 2005 (UTC)