Talk:Spam blacklist
For more information on what the spam blacklist is for, and the processes used here, please see Spam blacklist/About.
Please post comments to the appropriate section below: Proposed additions, Proposed removals, or Troubleshooting and problems, read the messageboxes at the top of each section for an explanation. Also, please check back some time after submitting, there could be questions regarding your request. Per-project whitelists are discussed at MediaWiki talk:Spam-whitelist. In addition to that, please sign your posts with ~~~~ after your comment. Other discussions related to this last, but that are not a problem with a particular link please see, Spam blacklist policy discussion.
Completed requests are archived (list, search), additions and removal are logged.
- Information
- List of all projects
- Overviews
- Reports
- Wikimedia Embassy
- Project portals
- Country portals
- Tools
- Spam blacklist
- Title blacklist
- Email blacklist
- Rename blacklist
- Closure of wikis
- Interwiki map
- Requests
- Permissions
- Bot flags
- New languages
- New projects
- Username changes
- Translations
- Speedy deletions
snippet for logging: {{/request|1083694#{{subst:anchorencode:section name here}}}}
If you cannot find your remark below, please do a search for the URL in question with this Archive Search tool.
Proposed additions
This section is for proposing that a website be blacklisted; add new entries at the bottom of the section, using the basic URL so that there is no link (example.com, not http://www.example.com). Provide links demonstrating widespread spamming by multiple users on multiple wikis. Completed requests will be marked as {{added}} or {{declined}} and archived. |
musicdumper.com
musicdumper.com
User:124.107.177.204(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
For the sake of clarity really. I've listed this domain (& not logged it so far). An IP created a page on Commons with a trailing "/" so often the sign of bot stuff with links to this site on. The toolserver was down so I could not check either Luxo or Eagle & I felt listing was better than being spammed. Now both are back on I can see no links to the site (top 30 wikis) nor any other relevant contribs.
The page is here but Commons admins only I'm afraid.
Should it stay listed or not?
Cheers --Herby talk thyme 13:08, 8 July 2008 (UTC)
- I am waiting for the COIBot report, guess we know it may be coming, so it is save to take it off and wait for COIBot to report more of this. But I am also fine with leaving it here. --Dirk Beetstra T C (en: U, T) 13:16, 8 July 2008 (UTC)
- Just a note that this needs logging or removing while I am away. Cheers --Herby talk thyme 13:53, 9 July 2008 (UTC)
- Logged — Mike.lifeguard | @en.wb 22:54, 12 July 2008 (UTC)
- Just a note that this needs logging or removing while I am away. Cheers --Herby talk thyme 13:53, 9 July 2008 (UTC)
steephill.tv
steephill.tv
I think this one should be reviewed. There has been quite a bit of link placement (think there may have been another bot report too - I seems to recall dealing with some links before). I spotted some links on en wp the other day & I've not had time to follow it up. Don't have strong views though I believe the linkage may be rather too great. Cheers --Herby talk thyme 13:32, 9 July 2008 (UTC)
- Reverted a lot of it. The site does look somewhat useful, so I'm reluctant to blacklist it as well. I think Not done for now, and we'll see what happens. — Mike.lifeguard | @en.wb 23:09, 12 July 2008 (UTC)
img205.imageshack.us
img205.imageshack.us
Added this to the spamblacklist to stop ongoing cross-wiki-spammer:
User:88.253.101.99(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User added spam to talk of Upload (all his spam is already deleted). Example IRC-Log: Blacklist ms:User:88.253.101.99 used edit summary "Hacked By" in creating ms:Talk:Upload (+28950) URL: http://ms.wikipedia.org/wiki/Perbincangan:Upload "HACKED BY SERSERİ77" which contained a link to img205.imageshack.us
Maybe it has to be removed again. Best regards, --birdy geimfyglið (:> )=| ∇ 11:02, 11 July 2008 (UTC)
I also added
\bimageshack\.us\b \brapmatix\.com\/musics\/music527\.mp3\b
because he does not stop (where is that global block) creating nonsense pages crosswiki. Andre deleted them all so far, cleaning everything behind him. These links have to be removed, --birdy geimfyglið (:> )=| ∇ 12:47, 11 July 2008 (UTC)
derindusunce.org
Links to a Turkish website has been inserted cross wiki by various Orange France GPRS Network IP's the last days, links were also inserted cross wiki in December 2007/January 2008.
derindusunce.org
IP's
User:80.10.39.4(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:90.95.30.190(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:80.10.51.65(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:90.94.11.80(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:82.226.80.230(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:80.10.15.52(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:90.94.23.255(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
--Jorunn 17:28, 11 July 2008 (UTC)
diplomaticsociety.org
28x linkspam in 10 projects in about an hour See COIbot report, previous spamming has occurred.
diplomaticsociety.org
User:66.189.223.234(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:71.112.105.60(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:71.112.98.172(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:209.202.226.100(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
--EdBever 20:47, 11 July 2008 (UTC)
- Thanks - Added — Mike.lifeguard | @en.wb 23:17, 12 July 2008 (UTC)
tourist guide sites
Cross wiki spam. These are related by the IP's spamming them.
URLs
copenhagencity.org
- Google Adsense id 5081061325759884
jamsankoski.com
- Google Adsense id 5081061325759884
bucegi.info
- Google Adsense id 5081061325759884
myrtle-beach-guide.org
- Google Adsense id 5081061325759884
aboutporto.com
- Google Adsense id 5387217387406024
discoverlausanne.com
- Google Adsense id 5387217387406024
aboutlausanne.com
aboutsheffield.com
- Google Adsense id 7183744314501122
nurnberg.biz
- Google Adsense id 7183744314501122
aboutliverpool.com
- Google Adsense id 7183744314501122
IPs
User:86.34.48.47(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:86.34.48.225(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:86.34.48.81(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:87.97.98.232(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:79.113.46.161(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:79.113.41.82(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:79.113.43.120(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
User:79.119.18.192(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
- did also blank the reports User:COIBot/XWiki/aboutlausanne.com and User:COIBot/LinkReports/aboutlausanne.com
User:91.82.32.140(IP tools: Google | WHOIS | domaintools | RBL | tools)
(spamhaus | projecthoneypot | malwareurl)
+ en:user:Tibiche --Jorunn 23:07, 11 July 2008 (UTC)
comicradioshow.com
This url was removed from the blacklist in this edit. I belive this was by mistake.
Discussion of the link placement
Now en:user:Maqz has started inserting the link on en.wikipedia again.
--Jorunn 00:56, 12 July 2008 (UTC)
kurz.es
kurz.es
Please add this, it's a URL shortener which has been used to bypass the local blacklist at dewiki. --Entlinkt 19:35, 12 July 2008 (UTC)
Proposed additions (Bot reported)
This section is for websites which have been added to multiple wikis as observed by a bot.
Items there will automatically be archived by the bot when they get stale. Sysops, please change the LinkStatus template to closed ( These are automated reports, please check the records and the link thoroughly, it may be good links! For some more info, see Spam blacklist/help#SpamReportBot_reports If the report contains links to less than 5 wikis, then only add it when it is really spam. Otherwise just revert the link-additions, and close the report, closed reports will be reopened when spamming continues. Please place suggestions on the automated reports in the discussion section. |
COIBot
Running, will report a certain domain shortly after a link is used more than 2 times by one user on more than 2 wikipedia (technically: when more than 66% of this link has been added by this user, and more than 66% of this link were added XWiki). Same system as SpamReportBot (discussions after the remark "<!-- Please put comments after this remark -->" at the bottom; please close reports when reverted/blacklisted/waiting for more or ignore when good link)
List | Last update | By | Site IP | R | Last user | Last link addition | User | Link | User - Link | User - Link - Wikis | Link - Wikis |
---|---|---|---|---|---|---|---|---|---|---|---|
vrsystems.ru | 2023-06-27 15:51:16 | COIBot | 195.24.68.17 | 192.36.57.94 193.46.56.178 194.71.126.227 93.99.104.93 |
2070-01-01 05:00:00 | 4 | 4 |
Proposed removals
This section is for proposing that a website be unlisted; please add new entries at the bottom of the section. Remember to provide the specific URL blacklisted, links to the articles they are used in or useful to, and arguments in favour of unlisting. Completed requests will be marked as done or denied and archived. See also /recurring requests for repeatedly proposed (and refused) removals. The addition or removal of a link is not a vote, please do not bold the first words in statements. |
holocaustresearchproject.org
holocaustresearchproject.org
I wanted to add the external link http://www.holocaustresearchproject .org/ghettos/riga.html (Holocaust Education & Archive Research Team: The Killings at Riga) to a page. What is the reason that this domain is at the blacklist? SchirmerPower 11:04, 29 June 2008 (UTC)
- The link was added per request. Is the site more accurate now? I for one can't judge that. Not sure about removing, so I'm leaving this to someone else. --Erwin(85) 11:39, 29 June 2008 (UTC)
- I am baffled why this was ever blacklisted. It is well-written and seems to be accurate. It should be removed at once.--Cato 11:49, 29 June 2008 (UTC)
The logic behind the original listing is as linked by Erwin and seems to have been perfectly valid. The lost appropriate course of action would be to get hold on some en wp (or other languages) editors with knowledge of the subject to assess whether it can now be considered a reliable source. Until such opinions are given de-listing may be premature. Thanks --Herby talk thyme 12:06, 29 June 2008 (UTC)
- So it's been declared a non-reliable site on the basis of criticism on a blogspot? And why should I be presumed not to have knowledge of the subject? I repeat that the Riga page seems to be accurate.--Cato 12:20, 29 June 2008 (UTC)
- I have no idea what the history is only that it was decided it was unreliable. My understanding was that you were not active on Wikipedias and so I suggested looking for some from en (or other) wikis for their views - is that so unreasonable? In practice almost nothing gets removed from here "at once" as you will see if you run through the archives. Mistakes apart some time for input from others is the normal practice here. Other views are very welcome indeed. --Herby talk thyme 13:09, 29 June 2008 (UTC)
- Sorry if I misunderstood. I thought you were saying that because I mainly edit EN:WQ I wasn't in a position to comment in an area where in fact I have some expertise, and that the views of people active on EN:WP or some other WP would be needed.--Cato 14:07, 29 June 2008 (UTC)
- Is it -maybe- possible to exclude the "Riga" page from the blacklist until the rest of the domain is cleared? SchirmerPower 08:43, 30 June 2008 (UTC)
- Can I suggest in the short term you look for local whitelisting if this is inconveniencing you. I'm guessing "de" is the appropriate wiki for you &, if so, you would ask here. Equally then if you want an aspect of the site still unavailable you can seek blacklisting of that specific url locally too? I hope that helps for now. Regards --Herby talk thyme 10:10, 30 June 2008 (UTC)
- Thank you, I will ask there for putting the link on the local whitelist. SchirmerPower 11:15, 30 June 2008 (UTC)
I have looked round the site, and it seems very good. Why it was blacklisted in the first place seems odd, and I cannot see that continuing the blacklist can benefit the project. (CoI: I lost several relatives in the Holocaust.)--Yehudi 08:28, 3 July 2008 (UTC)
If I recall correctly, there was a legal battle between two sites with very similar names, which had what to do with the prevention of either from being used. The actual discussion is somewhere in the OTRS archives. -- Avi 06:19, 10 July 2008 (UTC)
- That was deathcamps.org / death-camps.org. JzG 12:25, 12 July 2008 (UTC)
www.cfrcluj.net
Why was this link deleted? This website provides valuable information to people interested in the football club. —The preceding unsigned comment was added by 76.69.49.133 (talk • contribs) 15:53, 3 Jul 2008 (UTC)
- Not blacklisted - where was it removed from (which page & which wiki). Thanks --Herby talk thyme 15:55, 3 July 2008 (UTC)
cfrcluj.net
- There is an xwiki report from COIBot. I think Mike.lifeguard reverted the additions. Site may be appropriate on en, not sure if it has to be everywhere. --Dirk Beetstra T C (en: U, T) 15:58, 3 July 2008 (UTC)
- Given that it was added all at once by a single user, I did revert the additions. That does not mean it is totally inappropriate for inclusion, just that the manner in which it was being included was not for the benefit of the projects. If you wish to include it on somewhere, you may do so as long as it's addition is in line with whatever policies and guidelines there are on the subject, and you are adding it to improve the wiki (and no other reason). As this is not blacklisted, there is nothing further for us to do here. — Mike.lifeguard | @en.wb 21:53, 3 July 2008 (UTC)
I understand your point. However, I disagree. The club has many French, German-speaking people looking to find out info about the club. As there is no CFR website in French or German, people tend to look for one that is in the international language (English). By having a link to the only CFR English website in the articles, anyone who is interested in the club can find out more detailed informantion in a language that they can actually understand. By the way, what exactly are the policies and guidelines on the subject?
P.S. Herby: the article is on CFR Cluj
dcc.ttu.ee/Bands/get.asp?ident=2555
dcc.ttu.ee/Bands/get.asp?ident=2555
bacause it only gives Bad Request (Invalid Hostname). --WikedKentaur 05:53, 5 July 2008 (UTC)
- This was blacklisted by User:Nick1915 after this request. If it returns an error there is no reason to link to it, and given past problems, I'm reluctant to de-list it in case it is fixed and spammed. Perhaps that's too paranoid. — Mike.lifeguard | @en.wb 13:07, 5 July 2008 (UTC)
You shouldn't blacklist my sites, or indeed anything related to them. I feel there is no need for you to blacklist them. Gawd, I'm not a spammer. What do you take me for?? Oh, and one more thing, these sites should never be blacklisted anywhere. --Paris Hilton 20:25, 6 July 2008 (UTC)
- I have a hard time believing this is a serious request. Do others feel we can disregard this one (note that the account is blocked currently). — Mike.lifeguard | @en.wb 23:18, 6 July 2008 (UTC)
- Domains are not being removed from the spam blacklist in response to site-owners' requests. Instead, sites are de-blacklist when trusted, high-volume editors request the use of blacklisted links because of their encyclopaedic value in support of our encyclopaedia pages. If such an editor asks to use your links, I'm sure the request will be carefully considered and your links may well be removed. --Jorunn 23:58, 6 July 2008 (UTC)
- User now blocked I see. Equally they appear to have been active on de wikis & I am not aware of that linguistic ability for this person. Declined --Herby talk thyme 09:35, 7 July 2008 (UTC)
Google.az
Why is the azeri version of Google on the blacklist ? Is it possible to remove it ? I noticed it while trying to edit fr:Discuter:Heydär Äliyev. Moumine 13:29, 8 July 2008 (UTC)
cais-soas.com
"The Circle of Ancient Iranian Studies (CAIS) is an independent not-for-profit educational programme, with no affiliation to any political or religious group dedicated to the research, protection, preservation of the pre-Islamic Iranian civilisation.
CAIS was established in 1998 by Shapour Suren-Pahlav and Oric Basirov (Department of Art and Archaeology), under the name of "Ancient Iranian Civilisation at the School of Oriental and African Studies" (AIC at SOAS) and later changed to "The Circle of Ancient Iranian Studies at the School of Oriental and African Studies" (CAIS at SOAS) of the University of London, to act as a forum for the exchange of information about the art, archaeology, culture and civilisation of Iranian peoples. CAIS no longer has any affiliation with SOAS.
The mission of the Circle is to expand understanding and appreciation of pre-Islamic Iranian heritage as achieved through systematic investigation of the archaeological and historical records.
The Circle seeks to promote and increase the existing body of knowledge relating to this important area, laying particular emphasis on providing up-to-date information to students, academics and cultural enthusiasts about current Iranian and international research projects and fieldworks."
So why is it black-listed?--81.105.226.158 13:35, 9 July 2008 (UTC)
cais-soas.com
Has been added in June 2008, per request. Strike that, no that was another one, can't find it yet. --Dirk Beetstra T C (en: U, T) 13:40, 9 July 2008 (UTC)
Ah, here it is:
it was done by Dmcdevit in May 2007, per this evidence, part of this thread. Hope this explains. --Dirk Beetstra T C (en: U, T) 13:45, 9 July 2008 (UTC)
- As far as I recall it was used to support distinctly non-neutral edits and as a source for copyright violating images, I also seem to recall a significant volume of apparently COI link additions but I could be wrong about that. I don't believe there is such a paucity of neutral sources that we would need this one, the request seems largely to address a desire to promote the site rather than a genuine wish to improve the encycloipaedia. JzG 14:27, 12 July 2008 (UTC)
gazeteler.com
gazeteler.com is somehow on the spam list for wikipedia. I don't know why it is there and I don't know or care if there is an article which could link to gazeteler.com. All I know is that gazeteler.com is not a spam website and does not deserve to be listed in a spam list. Either name that section "sites we do not like" or take gazeteler.com out of that spam list, since it negatively effects the reputation of an almost 10 year old project. — The preceding unsigned comment was added by 85.100.45.232 (talk)
- The link was blacklisted after I requested it here. It was blacklisted because someone had started a project inserting links to the mentioned website, and its sister websites, into Wikipedia articles about newspapers in various countries.
- There is a proposal to rename the list. --Jorunn 11:50, 9 July 2008 (UTC)
suite101.com
Hello, I am Swusr on English Wikipedia, and I want to use articles from this website as references for the article Vasili Eroshenko on English Wikipedia. But when I try to do so, I cannot link to it because it is blacklisted. Swusr 20:58, 9 July 2008 (UTC)
faeroerne.dk
Hi! About a year ago i spammed wikipedia with a link to a site about Faroe Islands (Denmark) in the 'Faroe Islands category' in many languages. At that time I did not think of it as spam. Now, a year later, the site has got a 'Virtual Tour' from the Islands. This is really useful for tourists. So I kindly ask you to remove it from the blacklist? I would like to add the link in http://da.wikipedia.org/wiki/F%C3%A6r%C3%B8erne
The site has got several incoming links from great official sites too. Some of the most important links: VisitDenmark.com - http://visitdenmark.com/sverige/sv-se/menu/turist/turistinformation/regionaleturistsites/regioner.htm Atlantic Airways - http://atlanticairways.com/ Smyyril Line (Ferry to Faroe Islands): http://www.smyril-line.fo/NORR%C3%96NA/360%C2%B0/M%C3%B3tt%C3%B8kan.aspx
/Kasper Solberg
- The link was blacklisted after I requested it here, 7 November 2007. I requested the link blacklisted because of the widespread insertion of the link, including reinserting it sevral places after it had been removed (the IP you used was blocked in 3 wikis for spamming).
- Domains are not being removed from the spam blacklist in response to site-owners' requests. Instead, sites are de-blacklist when trusted, high-volume editors request the use of blacklisted links because of their encyclopaedic value in support of our encyclopaedia pages. If such an editor asks to use your links, I'm sure the request will be carefully considered and your links may well be removed.
- If the link is wanted on da.wikipedia it can be locally whitelisted there, on da:MediaWiki:Spam-whitelist. (Siden den lokale hvitlisten ikke er i bruk på da.wikipedia ennå er det kanskje lurest å henstille om hvitlisting på da:Wikipedia:Landsbybrønden. Men vær klar over at mange Wikipediautgaver har regler som sier at man ikke skal legge inn lenker til egne nettsteder. Retningslinjene fro eksterne lenker på da.wikipedia ligger her.) --Jorunn 23:40, 10 July 2008 (UTC)
Troubleshooting and problems
Discussion
COIBot
poking COIBot
I notice that sometimes people who are not active on IRC need some link reports. Admins here can now add {{LinkSummary|domain}}
to User:COIBot/Poke, when COIBot picks up the edit to that page (and it should), it will put the domains into its reporting queue (high priority, which is, only behind waiting XWiki reports) and create a report on the link(s). The first report should be saved within about 5 minutes, if it takes longer than 15 minutes there is probably something wrong, and it may be useful to add the template with the link again (it reads the added part of the diffs (the right column)), or poke me or another person who is active on IRC personally. Hope this is of help. --Dirk Beetstra T C (en: U, T) 12:46, 4 June 2008 (UTC)
- P.S. Please don't overuse the functionality, everything still needs to be saved. --Dirk Beetstra T C (en: U, T) 12:54, 4 June 2008 (UTC)
- It had some startup problems, but all seems to work fine now. --Dirk Beetstra T C (en: U, T) 17:28, 4 June 2008 (UTC)
Sorting - UTC
The COIBot reports now are/should be sorted by time, newest records at the bottom. The newer records are now stored in UTC, and there is a bot busy with converting the time of the old records to UTC. When the time is in UTC, it will show ' (UTC)' behind the timestamp. --Dirk Beetstra T C (en: U, T) 13:56, 5 June 2008 (UTC)
Already reverted?
OK, here's another thing for lazy people: tools:~erwin85/xwiki.php. If you enter the title of a report, or link to it directly like xwiki.php?report=User:COIBot/XWiki/automovil.cc, it gets the diffs from the report and checks the database to see whether or not it's the top edit and how many pages still link to the domain. Could make reverting a bit less work. Beetstra, could you add a link in new reports? Suggestions are welcome. --Erwin(85) 19:55, 25 June 2008 (UTC)
- Certainly looks useful, but using it on User:COIBot/XWiki/battiatoweb.com it has the sentence "Retrieved 7 edits from User:COIBot/XWiki/battiatoweb.com. Searching for 'http://com.battiatoweb.%'." -- the domain is incorrect, but it appears to be searching for the correct domain. — Mike.lifeguard | @en.wb 22:07, 25 June 2008 (UTC)
- The database uses that format for el_index, see database structure. The search value should be 'http://com.example.%' for example.com and 'http://com.example.sub.%' for sub.example.com. I guess it allows for faster indexing. The value is shown so you can check whether it actually searches for the correct value. --Erwin(85) 07:56, 26 June 2008 (UTC)
- Hmm .. why not link it in the {{LinkSummary}} for the appropriate linkrecords? --Dirk Beetstra T C (en: U, T) 08:45, 26 June 2008 (UTC)
- Great work Erwin & appreciated. I think worth bearing in mind that the fact that the link placement is not the last edit doesn't actually mean it has been reverted though. I've found times when that was not the case (particularly when some language bot passes by). I'll look at the tool more later or tomorrow. --Herby talk thyme 09:18, 26 June 2008 (UTC)
- Indeed, you should check the link count and not only the edits. All edits since the reported one are now listed, thanks to Beetstra for the suggestion, and it works for all three bot reports. --Erwin(85) 10:13, 26 June 2008 (UTC)
- Great work Erwin & appreciated. I think worth bearing in mind that the fact that the link placement is not the last edit doesn't actually mean it has been reverted though. I've found times when that was not the case (particularly when some language bot passes by). I'll look at the tool more later or tomorrow. --Herby talk thyme 09:18, 26 June 2008 (UTC)
- And I adapted the {{LinkSummary}} accordingly .. someone suggestions for something which is more appropriate than '(clear?)'? --Dirk Beetstra T C (en: U, T) 10:19, 26 June 2008 (UTC)
- Would it be possible instead of listing edits since the domain was added to have it do a linksearch on the apporpriate wiki, and return whether or not the page in question currently has the link? That would be much easier than looking at a bunch of edits since the diff provided. — Mike.lifeguard | @en.wb 21:39, 26 June 2008 (UTC)
- Also your edit to
{{linksummary}}
makes things not fit :( I will fix it shortly. — Mike.lifeguard | @en.wb 21:43, 26 June 2008 (UTC) - OK, I think it already does a linksearch on each relevant wiki, which makes this ridiculously useful :D — Mike.lifeguard | @en.wb 02:33, 27 June 2008 (UTC)
- It indeed counts the number of links. It now also returns the pages linking to the domain and marks reported pages with links with (L). --Erwin(85) 10:08, 27 June 2008 (UTC)
- Hmm .. why not link it in the {{LinkSummary}} for the appropriate linkrecords? --Dirk Beetstra T C (en: U, T) 08:45, 26 June 2008 (UTC)
- The database uses that format for el_index, see database structure. The search value should be 'http://com.example.%' for example.com and 'http://com.example.sub.%' for sub.example.com. I guess it allows for faster indexing. The value is shown so you can check whether it actually searches for the correct value. --Erwin(85) 07:56, 26 June 2008 (UTC)
- That is so good. BIG thanks Erwin (& we've just been chasing the same one :)). --Herby talk thyme 10:19, 27 June 2008 (UTC)
- I added support for User:Mike.lifeguard/removespam.js. That script does a search and replace for the given domain, set a default or given summary and show the changes. All you need to do is click save if you agree. Of course, you need to add the script to your monobook.js for each wiki. --Erwin(85) 11:59, 29 June 2008 (UTC)
- If you look at User:Mike.lifeguard/thing there is a script to load a script from meta, which you can use to load a standard script for every wiki. This lets you manage that script in one location. — Mike.lifeguard | @en.wb 01:20, 30 June 2008 (UTC)
Renaming the blacklist
For some time, folks all over have been wanting to rename the blacklist to remove the S-word. This would reduce the ability of people to complain about cases where a domain is legitimately blacklisted, but is not spam. Currently we get "Hey! You're calling me a spammer - that's defamation/libel!" even in cases where the complainant didn't link that site, so we are not calling them a spammer (nor are we necessarily calling their site spam). Since we legitimately use the blacklist to block domains which are not actually spam this is not a policy change. We already use the spam blacklist to block domains that are not spammed, and domains which are not spam - this change would reflect the reality and would reduce complaints as described above. Since there has already been discussion in the past, and that discussion was largely in favour, I have created bugzilla:14719. Please comment here, but if you know of places on the mailing lists, or on the wikis where this has already been discussed, it may be worth adding to the bug. — Mike.lifeguard | @en.wb 22:59, 3 July 2008 (UTC)
- As a general note, I think it is a misconception that people define a link as spam, it is in my opinion the way it is added ('pushed'). I concur with renaming, though, as it just generates unnecessery 'problems', and takes the ones out which are neither spam nor spammed (redirect sites). Could we have some poll with some suggestions, e.g. on metapub? --Dirk Beetstra T C (en: U, T) 09:43, 4 July 2008 (UTC)
- Just go vote on the bug Mike has created I think. There was discussion here a while back where all agreed anyway - just nothing happened. --Herby talk thyme 09:49, 4 July 2008 (UTC)
- Agreed - yet another poll or discussion is not really necessary. There is already agreement to do this, it is only the implementation we're waiting for. — Mike.lifeguard | @en.wb 16:05, 4 July 2008 (UTC)
- What is the new name going to be? Then I can prepare the bots for that. --Dirk Beetstra T C (en: U, T) 11:57, 9 July 2008 (UTC)
- How about "Mike's list? --A. B. (talk) 05:10, 10 July 2008 (UTC)
- "Disallowed websites"? (the term blocked may be too provocative) -- Avi 06:21, 10 July 2008 (UTC)
- That sounds good. —Giggy 06:24, 10 July 2008 (UTC)
- Yeah, that sounds fine to me - I'll suggest that on the bugzilla req. — Mike.lifeguard | @en.wb 10:59, 10 July 2008 (UTC)
- I think we should keep "blacklist" in the name, for integrity. So, if we're going to change it, I suggest "Website blacklist" or "URL blacklist" Huji 14:47, 10 July 2008 (UTC)
- "Disallowed websites"? (the term blocked may be too provocative) -- Avi 06:21, 10 July 2008 (UTC)
- How about "Mike's list? --A. B. (talk) 05:10, 10 July 2008 (UTC)
- I personally like "URL blacklist" or "External link blacklist" most. But all is fine with me. --Dirk Beetstra T C (en: U, T) 18:42, 10 July 2008 (UTC)
- I too like "URL blacklist".
- For reference: External link exclusion list discussion from February/March. --Jorunn 08:54, 11 July 2008 (UTC)
Thinking aloud
Bear in mind if we list something now existing links do need removing or they will stay there I believe? Cheers --Herby talk thyme 07:32, 7 July 2008 (UTC)
- Yes, the blacklist no longer prevents you from saving a page with a blacklisted link on it - only if you're adding a link that wasn't there before. So more cleanup is needed, but the threshold for blacklisting is perhaps a bit lower. Since it won't block you saving the page, we can't rely on editors hitting the "spam blocked" screen and then removing the link. I'm actually unhappy about this change - I'd have been happier to see some other changes to the extension before that one. The complaint is that people didn't know what to do when the edit was blocked - so make the message clearer, and figure some way to make section editing when blocked actually work. Not sure why this bug got done and the others didn't. As well, it produced bugzilla:14114.
- As well, the blacklist currently blocks in the edit summary too, which is a nice change. — Mike.lifeguard | @en.wb 11:08, 7 July 2008 (UTC)
- I am actually afraid that things go wrong with this 'fix'. If a vandal decides to blank the page, you can't undo because the link is still there (would be good if admin rollback and bot edits would be exempt from blacklisting). It does indeed give us the possibility to blacklist without removing first (to minimize disruption), but I still believe we have a cleanup job to do, it would be nice if we had a bot for that (but I know how difficult it is to make a bot select what actually needs removing). --Dirk Beetstra T C (en: U, T) 11:12, 7 July 2008 (UTC)
- FYI, the blacklist also now blocks links in the edit summary. (This is not very new - a month or two?) — Mike.lifeguard | @en.wb 01:02, 8 July 2008 (UTC)
double/wrong entries
when i deleted some entries from the german sbl, which are already listed in the meta sbl, i saw, that there are many double entries in the meta sbl, e.g., search for
- top-seo, buy-viagra, powerleveling, cthb, timeyiqi, cnvacation, mendean
and you'll find some of them. if you find it useful, i can try to write a small script (in august), which indicates more entries of this kind.
furthermore i'm wondering about some entries:
- "\zoofilia", for "\z" matches the end of a string.
- "\.us\.ma([\/\]\b\s]|$)", for ([\/\]\b\s]|$) ist the same as simply \b, isn't it? (back-refs are not of interest here)
- "1001nights\.net\free-porn", for \f matches a formfeed, i.e., never
- "\bweb\.archive\.org\[^ \]\{0,50\}", for that seems to be BRE, but php uses ERE, so i guess, this will never match
- "\btranslatedarticles\].com", for \] matches a ']', so will probably never match.
before i go on, i want to know, if you are interested in this information or not. :-) -- seth 22:23, 12 July 2008 (UTC)