Talk:Spam blacklist: Difference between revisions

From Meta, a Wikimedia project coordination wiki
Content deleted Content added
→‎y23 pictures: y23 explains his future COI behaviour (no weblinks, talk first)
+32: pdesigner.net spam ... also, how do I add some other domains to the x-wiki watchlist?
Line 176: Line 176:
::*{{UserSummary|Luckymurda}}
::*{{UserSummary|Luckymurda}}
::--<font face="Futura">[[User:A. B.|A. B.]] [[User talk:A. B.|<sup>(talk)</sup>]] </font> 20:05, 30 January 2009 (UTC)
::--<font face="Futura">[[User:A. B.|A. B.]] [[User talk:A. B.|<sup>(talk)</sup>]] </font> 20:05, 30 January 2009 (UTC)


===pdesigner.net spam===
Azerbaijani web designer and search engine optimization consultant. Spam originates from IPs in Azerbaijan and Italy.


;Domains spammed on one or more projects registered to the spammer:
*{{LinkSummary|ecommerce-designer.com}}
*{{LinkSummary|lalamirr.com}}
*{{LinkSummary|pdesigner.net}}
*{{LinkSummary|3dmax-tutorials.com}}
*{{LinkSummary|archicad-tutorials.info}}
*{{LinkSummary|bloomsandbaskets.com}}
*{{LinkSummary|character-studio.net}}


;Domains spammed on one or more projects by the spammer -- appear to be clients of the spammer's:
*{{LinkSummary|anatoliangifts.com}}
*{{LinkSummary|batik-tulis.com}}
*{{LinkSummary|cosmeproud.com}}
*{{LinkSummary|eastalgarvevilla.com}}
*{{LinkSummary|geminispanishproperties.com}}
*{{LinkSummary|otakuhouse.com}}
*{{LinkSummary|smsmp3.co.uk}}
*{{LinkSummary|stonexitalia.it}} (related to stonexsurveying.com)
*{{LinkSummary|stonexsurveying.com}}
*{{LinkSummary|totalstation.org}} (related to stonexsurveying.com)
*{{LinkSummary|totalstationshop.com}} (related to stonexsurveying.com)
*{{LinkSummary|travelbaku.com}}
The Stonex Surveying spam was especially egregious.


;Related domains registered to the spammer:
*{{LinkSummary|chinawholesale247.com}}
*{{LinkSummary|freelanceprogrammers.org}}
*{{LinkSummary|nxs4.com}}
*{{LinkSummary|en-ru.org}}


;Additional related domains associated with Stonex Surveying:
*{{LinkSummary|automatic-level.com}}
*{{LinkSummary|construction-laser.com}}
*{{LinkSummary|digitaltheodolite.com}}
*{{LinkSummary|distanciometro.com}}
*{{LinkSummary|echosounder.org}}
*{{LinkSummary|measuring-wheels.com}}
*{{LinkSummary|rtks82.com}}
*{{LinkSummary|teodolito.org}}
*{{LinkSummary|topografico.net}}


;Accounts:
*{{IPSummary|217.25.31.2}}
*{{IPSummary|81.21.81.117}}
*{{IPSummary|213.156.61.155}}
*{{IPSummary|81.208.74.187}}
*{{IPSummary|82.48.104.120}}
*{{IPSummary|81.17.95.183}}


*[[:az:Xüsusi:Contributions/81.21.81.117]]
*[[:el:Ειδικό:Συνεισφορές/217.25.31.2]]
*[[:ru:Служебная:Contributions/217.25.31.2]]
*[[:ru:Служебная:Contributions/81.17.94.119]]
*[[:en:Special:Contributions/217.25.31.2]]
*[[:en:Special:Contributions/81.21.81.117]]
*[[:en:Special:Contributions/213.156.61.155]]
*[[:en:Special:Contributions/81.208.74.187]]
*[[:en:Special:Contributions/82.48.104.120]]
*[[:en:Special:Contributions/81.17.95.183]]
*[[:it:Speciale:Contributi/213.156.61.155]]
**[[:it:Discussioni utente:213.156.61.155]]
*[[:da:Speciel:Bidrag/217.64.20.168]]
*[[:bs:Special:Contributions/217.64.20.168]]
*[[:cs:Special:Contributions/217.64.20.168]]
*[[:cy:Special:Contributions/217.64.20.168]]
*[[:de:Special:Contributions/217.64.20.168]]
*[[:eo:Special:Contributions/217.64.20.168]]
*[[:eu:Special:Contributions/217.64.20.168]]
*[[:fi:Special:Contributions/217.64.20.168]]
*[[:id:Special:Contributions/217.64.20.168]]
*[[:io:Special:Contributions/217.64.20.168]]
*[[:it:Special:Contributions/217.64.20.168]]
*[[:lt:Special:Contributions/217.64.20.168]]
*[[:na:Special:Contributions/217.64.20.168]]
*[[:nl:Special:Contributions/217.64.20.168]]
*[[:nn:Special:Contributions/217.64.20.168]]
*[[:no:Special:Contributions/217.64.20.168]]
*[[:pl:Special:Contributions/217.64.20.168]]
*[[:pt:Special:Contributions/217.64.20.168]]
*[[:scn:Special:Contributions/217.64.20.168]]
*[[:simple:Special:Contributions/217.64.20.168]]
*[[:sk:Special:Contributions/217.64.20.168]]
*[[:sv:Special:Contributions/217.64.20.168]]
*[[:tl:Special:Contributions/217.64.20.168]]
*[[:vi:Special:Contributions/217.64.20.168]]
*[[:zh-min-nan:Special:Contributions/217.64.20.168]]


The spammer has a number of other clients; on en.wikipedia, we are monitoring them -- is there a way to also monitor them x-wiki? If so, I think it's a good idea. Here's the list:
*[[:en:MediaWiki talk:Spam-blacklist#www.stonexsurveying.com]] <small>(permanent [http://en.wikipedia.org/w/index.php?title=MediaWiki_talk:Spam-blacklist&oldid=267539052#www.stonexsurveying.com link])</small>
--<font face="Futura">[[User:A. B.|A. B.]] [[User talk:A. B.|<sup>(talk)</sup>]] </font> 18:20, 31 January 2009 (UTC)


== Proposed additions (Bot reported) ==
== Proposed additions (Bot reported) ==

Revision as of 18:20, 31 January 2009

Shortcut:
WM:SPAM
WM:SBL
The associated page is used by the MediaWiki Spam Blacklist extension, and lists strings of text that may not be used in URLs in any page in Wikimedia Foundation projects (as well as many external wikis). Any meta administrator can edit the spam blacklist. There is also a more aggressive way to block spamming through direct use of $wgSpamRegex. Only system administrators can make changes to $wgSpamRegex, and its use is to be avoided whenever possible.

For more information on what the spam blacklist is for, and the processes used here, please see Spam blacklist/About.

Proposed additions
Please provide evidence of spamming on several wikis. Spam that only affects single project should go to that project's local blacklist. Exceptions include malicious domains and URL redirector/shortener services. Please follow this format.
Also, please check back after submitting your report, there could be questions regarding your request.
Proposed removals
Please check our list of requests which repeatedly get declined. Typically, we do not remove domains from the spam blacklist in response to site-owners' requests. Instead, we de-blacklist sites when trusted, high-volume editors request the use of blacklisted links because of their value in support of our projects. Please consider whether requesting whitelisting on a specific wiki for a specific use is more appropriate.

Please sign your posts with ~~~~ after your comment. This leaves a signature and timestamp so conversations are easier to follow.

Completed requests are marked as {{added}}/{{removed}} or {{declined}}, and are generally archived (search) quickly. Additions and removals are logged.

Other discussion
Troubleshooting and problems - If there is an error in the blacklist (i.e. a regex error) which is causing problems, please raise the issue here.
Discussion - Meta-discussion concerning the operation of the blacklist and related pages, and communication among the spam blacklist team.

snippet for logging: {{sbl-log|1363747#{{subst:anchorencode:SectionNameHere}}}}

Proposed additions

This section is for proposing that a website be blacklisted; add new entries at the bottom of the section, using the basic URL so that there is no link (example.com, not http://www.example.com). Provide links demonstrating widespread spamming by multiple users on multiple wikis. Completed requests will be marked as {{added}} or {{declined}} and archived.


Userpage drug spam



{{linksummary|newbluepill.com}}





Probably relates back to the Canadian Pharmacy spammer from 1 or 2 months ago.  — Mike.lifeguard | @en.wb 17:42, 20 January 2009 (UTC)[reply]

More: {{linksummary|professionalmedicines.com}} {{linksummary|phentrimine-shop.com}} {{linksummary|us-ds.com}}























{{linksummary|canadian-meds-shop.com}} {{linksummary|relaxinspain.ru}} {{linksummary|traveltoprague.ru}} {{linksummary|traveltofrance.ru}} {{linksummary|ruicona.ru}} {{linksummary|relaxinitaly.ru}}  — Mike.lifeguard | @en.wb 17:56, 20 January 2009 (UTC)[reply]

This traces back to Pharmacy terrorists - removing the domains already blacklisted.  — Mike.lifeguard | @en.wb 01:37, 21 January 2009 (UTC)[reply]

Added Added \byandex\.ru\b (the only remaining domain)  — Mike.lifeguard | @en.wb 02:27, 21 January 2009 (UTC)[reply]

This site was added by User:Mike.lifeguard in the global spam-list without any comment. Yandex is a search engine with 44% of requests in runet, and has many aditional features like dictionaries, blogs etc. I does not understand why this site is blacklisted. Now there are many references to slovary.yandex.ru (dictionary), to search results, etc in ruwiki, and editing the many pages may be difficult. For more information about this site see en:Yandex. Administrator of ruwiki, Track13 14:53, 21 January 2009 (UTC)[reply]

related talk: #Userpage drug spam. -- seth 17:16, 21 January 2009 (UTC)[reply]
Hmm... I does not see any "bad" edits with this url. May be all of these changes have been removed? or I do not understand something?
In any case, blocking one of the biggest portals of runet totally is not a good idea.I apologize for my English. I can invite anyone from our admins with more advanced English, if you want. =) Track13 18:31, 21 January 2009 (UTC)[reply]
Merged these sections. Upon closer inspection it seems the spammers were using yandex.com/redir (which is a 404 but should perhaps be blacklisted anyways). So, the whole domain shouldn't be blacklisted, it should be that specific section. I'll take a closer look shortly.  — Mike.lifeguard | @en.wb 19:51, 21 January 2009 (UTC)[reply]
COIBot reports only two link placings in non-ru-wiki: http://company.yandex.com/general_info/yandex_today.xml and http://company.yandex.com/press_center/press_releases/2008/2008-09-09.xml. Where did you get the information about yandex.com/redir? -- seth 20:45, 21 January 2009 (UTC)[reply]
Deleted edits on enwikibooks COIBot didn't catch. I'm not sure COIBot parses & saves diffs of link additions as fast as Beetstra thinks...  — Mike.lifeguard | @en.wb 21:09, 21 January 2009 (UTC)[reply]
Replaced by \byandex\.ru/redir\b  — Mike.lifeguard | @en.wb 18:07, 22 January 2009 (UTC)[reply]
Thanks! Track13 11:28, 23 January 2009 (UTC)[reply]

muselius.com



See e.g. http://www.muselius.com/api/. To be sure I asked at irc-channel #wikipedia-es whether they consider the link to be spam. They do. So do we at de-wiki. At pt-wiki the domain is blocked already. For that I Added blocked that domain now globally. -- seth 01:31, 21 January 2009 (UTC)[reply]

Hi seth!
What is the problem with this site? The page you mention http://www.muselius.com/api is just a page with documentation about how to use the API of this site. This has nothing to do with spam. I think :that muselius add value to some pages about art and museums with information, maps, etc. What is wrong with it? This info is not in wikipedia and this site is also editable, like a wiki, and has a community behind :maintaining the data it shows. Hope you can consider your decission. -- 85.0.208.95 22:11, 30 January 2009 (UTC)[reply]

ymap.ru

Added Added



which was spammed by





--Erwin(85) 15:37, 24 January 2009 (UTC)[reply]

viagra-billig.com





Already Added Added — Mike.lifeguard | @en.wb 15:07, 26 January 2009 (UTC)[reply]

Also involved:











 — Mike.lifeguard | @en.wb 15:38, 26 January 2009 (UTC)[reply]

I suggest to group some of the viagra spam by the following to entries
(?:buy|cheap)-?viagra
viagra-?(?:drugs|online|billig)
But this would block more sites than the present entries. -- seth 18:53, 26 January 2009 (UTC)[reply]

Redirects for clickbank.net













Originally submitted to en:Wikipedia_talk:WikiProject_Spam#earth-energy4home.tk
On the first three, if you view the source at these domains, they contain a script to redirect users to megaman27.earth4.hop.clickbank.net
The fourth entry is a redirect to www.file-factory.co.uk/energy.html, which itself redirects to landers85.earth4.hop.clickbank.net/
The fifth entry is a "blog", which is merely an advertisement whose only link is to taraff1.earth4.hop.clickbank.net
Note that "clickbank.net" is already on the meta blacklist, so the above URLs are simply redirects to attempt to bypass blacklisting of the referal tracking link. --- Barek (talkcontribs) - 21:17, 26 January 2009 (UTC)[reply]
OK, Added adding the following:
\bearth4energy(z|-solarhome.blogspot)?\.(tk|com|net)\b
\bhome-energy\.tk\b
\bearth-energy4home\.tk\b
\bfile-factory\.co\.uk/energy\.html\b
 — Mike.lifeguard | @en.wb 21:37, 26 January 2009 (UTC)[reply]

Additional clickbank.net redirects



Above entry redirects to megaman27.wwwtv.hop.clickbank.net?id=unlimitedtvonline


Above entry redirects to megaman27.homemadeen.hop.clickbank.net


Above entry redirects to megaman27.efitness.hop.clickbank.net?id=smartfatburning_com

The above links were added by en:User:86.163.142.60, all are being used as redirects to bypass the existing blacklisting of "clickbank.net". --- Barek (talkcontribs) - 05:12, 27 January 2009 (UTC)[reply]

Added Added  — Mike.lifeguard | @en.wb 04:46, 29 January 2009 (UTC)[reply]


annneftin.com spam

Spam domains









Deleted article


Spam accounts




--A. B. (talk) 14:46, 29 January 2009 (UTC)[reply]

Added Added  — Mike.lifeguard | @en.wb 14:56, 29 January 2009 (UTC)[reply]


Spammers attempting spam blacklist deletions often uncover more domains. Luckymurda's attempt to delete this entry has turned up another Neftin domain:[1]




--A. B. (talk) 20:05, 30 January 2009 (UTC)[reply]


pdesigner.net spam

Azerbaijani web designer and search engine optimization consultant. Spam originates from IPs in Azerbaijan and Italy.


Domains spammed on one or more projects registered to the spammer















Domains spammed on one or more projects by the spammer -- appear to be clients of the spammer's
















(related to stonexsurveying.com)





(related to stonexsurveying.com)



(related to stonexsurveying.com)



The Stonex Surveying spam was especially egregious.


Related domains registered to the spammer









Additional related domains associated with Stonex Surveying



















Accounts














The spammer has a number of other clients; on en.wikipedia, we are monitoring them -- is there a way to also monitor them x-wiki? If so, I think it's a good idea. Here's the list:

--A. B. (talk) 18:20, 31 January 2009 (UTC)[reply]

Proposed additions (Bot reported)

This section is for domains which have been added to multiple wikis as observed by a bot.

These are automated reports, please check the records and the link thoroughly, it may report good links! For some more info, see Spam blacklist/Help#COIBot_reports. Reports will automatically be archived by the bot when they get stale (less than 5 links reported, which have not been edited in the last 7 days, and where the last editor is COIBot).

Sysops
  • If the report contains links to less than 5 wikis, then only add it when it is really spam
  • Otherwise just revert the link-additions, and close the report; closed reports will be reopened when spamming continues
  • To close a report, change the LinkStatus template to closed ({{LinkStatus|closed}})
  • Please place any notes in the discussion section below the HTML comment

COIBot

The LinkWatchers report domains meeting the following criteria:

  • When a user mainly adds this link, and the link has not been used too much, and this user adds the link to more than 2 wikis
  • When a user mainly adds links on one server, and links on the server have not been used too much, and this user adds the links to more than 2 wikis
  • If ALL links are added by IPs, and the link is added to more than 1 wiki
  • If a small range of IPs have a preference for this link (but it may also have been added by other users), and the link is added to more than 1 wiki.
List Last update By Site IP R Last user Last link addition User Link User - Link User - Link - Wikis Link - Wikis
vrsystems.ru 2023-06-27 15:51:16 COIBot 195.24.68.17 192.36.57.94
193.46.56.178
194.71.126.227
93.99.104.93
2070-01-01 05:00:00 4 4

Proposed removals

This section is for proposing that a website be unlisted; please add new entries at the bottom of the section.

Remember to provide the specific domain blacklisted, links to the articles they are used in or useful to, and arguments in favour of unlisting. Completed requests will be marked as {{removed}} or {{declined}} and archived.

See also /recurring requests for repeatedly proposed (and refused) removals.

The addition or removal of a domain from the blacklist is not a vote; please do not bold the first words in statements.

newenergytimes.com



More an explanation request than a removal request (although the owner would probably be delighted if it were removed). The owner has contacted me with a query why his domain was blacklisted. Not really my area, so if there's a readily accessible archive please provide a link to it. Otherwise, would much appreciate an explanation of the decision for the domain owner's peace of mind. Thanks much, Durova 04:52, 12 January 2009 (UTC)[reply]

It's blacklisted on enwiki, though I couldn't immediately find the log entry there.  — Mike.lifeguard | @en.wb 04:55, 12 January 2009 (UTC)[reply]
Heh, shows how seldom I deal with blacklist issues. Thanks very much, Mike. :) Durova 05:10, 12 January 2009 (UTC)[reply]
Additional, it was cross-wiki spammed, it might even have a place here. I can provide full data if needed. --Dirk Beetstra T C (en: U, T) 11:17, 12 January 2009 (UTC)[reply]
I have not investigated newenergytimes.com, but I suspect that this is related to the en wikipedia blacklisting of lenr-canr.org by admin JzG, which he then brought here for blacklisting, which was done. There is discussion of this here under the proposed listing for lenr-canr.org. I do know that newenergytimes.com is a notable publication in the field of Low Energy Nuclear Reactions, popularly known as cold fusion, and that an editor who believes that this is fringe science would have a motive to get it blacklisted. However, from what I've seen so far with lenr-canr.org, there wasn't linkspamming, there was use of sources by an editor or editors, not banned, possibly controversial, and which would normally be resolved by editorial consensus in the article in question, rather than by administrative fiat.--Abd 18:15, 12 January 2009 (UTC)[reply]
Here is the insertion. As I suspected, it was JzG. He didn't make this easy to find.... He didn't log this entry nor the previous. He "proposed" lenr-canr.org, but only with his addition, not to discuss it. Here is the proposal. There is other discussion below at here.--Abd 18:35, 12 January 2009 (UTC)[reply]
  • And do I need to point out that this was on enWP? Or that independent admins have concurred with my diagnosis of abuse in respect of lenr-canr? We are dealing here with a long-term campaign of POV-pushing which has elements of the free energy suppression conspiracy theory. Of course those involved in running these sites perceive themselves as fearless bearers of TruthTM, and of course they will argue tenaciously for links to their site from Wikipedia, as such links are their lifeblood, but that doies not make their goals any less incompatible with ours. JzG 21:34, 15 January 2009 (UTC)[reply]
This certain has a lot of ferment on en.wikipedia. It appears that there have been at least some COI problems:
--A. B. (talk) 23:25, 19 January 2009 (UTC)[reply]

asphost4free.com



Free webhosting service, may contain material of many kinds. I see no reason for a general block. Presumably the web hotel removes sites which violate normal terms of service. Jacob Lundberg 23:32, 19 January 2009 (UTC)[reply]

Was added by M7: asphost4free\.com M7 spambot adding nonsense pages. May I ask what utility the domain has for our projects?  — Mike.lifeguard | @en.wb 18:16, 22 January 2009 (UTC)[reply]
At that time, some bot created nonsense pages (those ending with "/" or "index.php"), full with "free" spam links. An now I dot see any reason to de-list a free-hosting container, sources required for articles should come from established and well known sites. --M/ 12:56, 25 January 2009 (UTC)[reply]
Due to past problems with excessive linking to this domain, I do not believe this request should be fulfilled. We de-blacklist sites when trusted, high-volume editors request the use of blacklisted links because of their value in support of our projects. If such a situation arises, I'm sure the request will be carefully considered and this domain may well be removed.
Until such time, this request is Declined. — Mike.lifeguard | @en.wb 15:47, 25 January 2009 (UTC)[reply]

y23 pictures

http://meta.wikimedia.org/wiki/User:COIBot/XWiki/www-2.net

HELP!!! I am trying to donate to wikipedia the best of my pictures and you REMOVE the lot? Am I too quick to add pictures AT THE APROPRIATE PLACES?

You OBVIOUSLY have not looked at the pictures!!! They are all TOPICAL and totally kosher, even illuminating!!

Look, Do you know how much work it was to put all these pictures online? I am very disappointed with wikipedia.

buuuuhhuuuuuuu huuu huuuu !!! How mean!! Please revert!!

I can not possibly recreate all the pictures... can you undo the robot?— The preceding unsigned comment was added by 60.234.144.222 (talk)

Well, I see the majority of the links is to en and de wikipedia. I know that the German wikipedia has strict rules about the number and what should be linked, for the en wikipedia, you might want to review: en:Wikipedia:External Links, 'what wikipedia is not, repository', 'what wikipedia is not, directory', and you seem to have a conflict of interest (see the conflict of interest guideline. Both wikis, actually all of them, have an upload function, where you can upload images that improves the wikipedia more than mere links. Also, we are writing an encyclopedia, based on content.
Hence,  Declined --Dirk Beetstra T C (en: U, T) 11:03, 29 January 2009 (UTC)[reply]

Please see also my talk page.  — Mike.lifeguard | @en.wb 14:04, 29 January 2009 (UTC)[reply]

Thanks! It is OK to donate my pictures to wikipedia but it is forbidden to link to the source-page where other versions of the picture can be found? E.g. my donated picture of (my hero) Don Merton had a link to the source removed. But sources of reliable information is what wikipedia is all about, right? I took great care to donate illustrative pictures and have resigned to having them mercilessly commercialized through wikipedia. Yes, I own the domain, yes I have thousands of pictures online, yes I love wikipedia and yes I want to have my donated pictures sourced. Wikipedia is adamant about sourcing. So am I. RE: CONFLICT OF INTEREST: Where I technically promoted my website -- through an entry on the WEBLINKS section -- I did so because I do not own the copyright of the photo and cannot donate it. MY RATIONALE: Yoffe's batik (also on my server!) of www-2.net/y23.stock.pictures/200501_yoffe-batik/ very rare 16th century designs is good info for the next wikipedia content updator -- people more qualified than I -- because Yoffe has given the names and origin (!) of the batik designs on that webpage! It is a nice start to a new encyclopedic page about the origins of the designs. I will try to get Yoffe to add content to the batik page but the man is 80 years old -- so things are slow (I have to hurry and be motivated). Have a heart and remove the spam-block, so I can source my photos. I promise to obey the conflict of interest rules to the letter. They are very clear. Y23 21:59, 29 January 2009 (UTC)[reply]
Aha! The proper way to source-reference pictures is ONLY on the actual picture page. Will do so. Y23 03:39, 31 January 2009 (UTC)[reply]
You claim not to have copyright of these pictures, which means that you're not allowed to upload them to Wikipedia. So what do you plan on doing if we should remove this URL from the blacklist? As Beetstra told you the projects have a strict policy about external links. --Erwin(85) 09:44, 31 January 2009 (UTC)[reply]
Thanks. I will henceforth refrain from adding any weblinks that go to my server. If I still feel the urgent need -- I will first discuss it on the talk page as suggested in en:WP:COI. Y23 12:37, 31 January 2009 (UTC)[reply]

ecomarchenews.com



this site contains no spam or illegal material, I am the admin and I can certify that, I ask it is cacelled from spam list

Was added after cross-wiki linking which was deemed inappropriate.  — Mike.lifeguard | @en.wb 14:47, 29 January 2009 (UTC)[reply]

Troubleshooting and problems

This section is for comments related to problems with the blacklist (such as incorrect syntax or entries not being blocked), or problems saving a page because of a blacklisted link. This is not the section to request that an entry be unlisted (see Proposed removals above).

User: namespace abuse

This section is for reporting abuse of userpages for promotional purposes; add new entries at the bottom of the section, using the basic URL so that there is no link (example.com, not http://www.example.com). Abuse across several wikis should be reported here; please provide links to example behaviour. Completed requests will be marked as {{added}} or {{declined}} and archived.


Discussion

Open reports

You can check the development of the number of open reports at tools:~erwin85/stats/. It's been gathering data for just over a week, so I hope the trend we see now won't last. --Erwin(85) 19:37, 1 December 2008 (UTC)[reply]

Erwin, others. This is strongly dependent on how strong I put my detection, etc. Some of them are still quite weak. E.g. it detects when links are only added by IPs, which misses the ones which get put on the more visible pages where one of the antivandalism bots reverts an edit to re-include the link the bots don't report it here (and that are probably the more visible ones). I am thinking to make it something like >90% by IPs (or >90% by not-whitelisted users?), but that will probably give even more ..
One of the problems is that for the moment I don't know yet what to do with links which get spammed to 2 projects, technically they would be better to be evaluated by the two projects locally. We really, really need more manpower here.
Are there many that are detected falsely?
Erwin, coul you count also the ones that COIBot marks as stale, and the ones that get ignored/closed (and possibly even those who get actually added) .. May give me a bit more a feeling of where I should look at the thresholds??
I think we should perhaps raise the bar on reporting. For example, we want to know about link additions which occur several times (say, 3) on one page (ie it keeps getting removed, and it keeps getting re-added). And we want to know about patterns affecting several wikis (say, 4 or 5). Less than that is too much to handle, it seems. Of course, the best solution is to add manpower. I'm not sure how best to go about doing that.  — Mike.lifeguard | @en.wb 02:09, 3 December 2008 (UTC)[reply]
Problem with increasing thresholds could make us miss some awful stuff. I am adapting the autostale function of COIBot:
...
if ((($timedifference > $settings{'closingtime'}) && ($lasteditor eq "COIBot")) && ((($fulllinkcount < 5) && ($fulllinkcount > 0) && ($fulllinklangcount > 2)) || ((($fulllinkcount < 10) && ($fulllinkcount > 0)) && ($fulllinklangcount < 3)))) {
     &closeoldone($pages);
} else {
...

$timedifference the difference between now and the last edit to the report
$settings{'closingtime'} at this moment 1 week.
$lasteditor the last editor of the report
$fulllinkcount the number of additions of the link
$fulllinklangcount the number of wikis the domain was added to
$pages the XWiki report pagename
That should keep it smaller, while we should not miss anything. --Dirk Beetstra T C (en: U, T) 11:41, 3 December 2008 (UTC)[reply]
I'll add Category:Closed XWiki reports, Category:Ignore XWiki reports and Category:Stale XWiki reports in the near future. It's more difficult to count the number of added links. The best way I can think of is counting the number of page titles starting with User:COIBot/XWiki/ which use {{Added}}. Any suggestions? I'll probably end up using a SQL query for it, so it'll be affected by the replag. In any case, it would be nice to see what effect the settings have on the number of reports and what we do with the reports. --Erwin(85) 20:55, 3 December 2008 (UTC)[reply]
The best count would be to see which reportnames are 'caught' by the meta blacklist. Thanks for making these statistics, Erwin. --Dirk Beetstra T C (en: U, T) 10:56, 4 December 2008 (UTC)[reply]
That would indeed be better, but I decided to use a category instead. The stats are updated every 5 minutes and I don't want to check > 1.000 closed reports each time. I want to keep it as simple as possible, so I guess we'll just have to use {{Added}}. The graphs now use a logarithmic scale because the range is quite big and I can't add another Y axis with different scale. I'd like to have a linear scale, but that would be dominated by the number of closed reports. Having a separate graph for each count is even less informative and more work, so I don't think that's an option. Does any know of a better way to present the data dynamically using RRDtool? --Erwin(85) 21:47, 4 December 2008 (UTC)[reply]
Do we really care about the number of closed reports? If not, would removing that make a linear scale possible?  — Mike.lifeguard | @en.wb 22:19, 4 December 2008 (UTC)[reply]
Beetstra, do you care? A linear scale is indeed possible without the closed reports. I'll add a fraction of the closed reports, e.g. closed/10, first though. That should work as well even though I can't add a second Y axis for it. --Erwin(85) 08:58, 5 December 2008 (UTC)[reply]
Nah, I don't care about that graph of closed, just include the number somewhere for reference? --Dirk Beetstra T C (en: U, T) 11:26, 5 December 2008 (UTC)[reply]
OK, maybe I'll remove it in the future, but for now it includes closed/10 as a graph. It fits nicely in the middle of the range that way. --Erwin(85) 10:46, 6 December 2008 (UTC)[reply]

Can we add a graph for:

  • average and max age in days (since last link addition)
  • average and max for Link-Wikis
  • average and max for Link

I think those would be useful for tracking the backlog & how serious it is. We already know how many reports of various types we want, but we also want to know how old they are, and how serious they are (the last 2 being a measure of how serious). As well, the page seems to waste a lot of space -- perhaps modelling it after tools:~bryan/stats/replag would be useful?  — Mike.lifeguard | @en.wb 01:26, 9 December 2008 (UTC)[reply]

Any chance we can track users by the number of edits they make to the User:COIBot/XWiki "namespace"? As reports that get closed are archived, it's tough to know who's helping otherwise.  — Mike.lifeguard | @en.wb 18:20, 9 December 2008 (UTC)[reply]
I missed your previous message before. What graphs do you want to add? Age is a time difference between what? --Erwin(85) 10:03, 14 December 2008 (UTC)[reply]
Age is the time from the last link addition to now, in days. So, track the average and max age of currently-open reports. Also, the average and max for Link-Wikis and Link (ie the columns listed on User:COIBot/XWiki)... if that makes sense :)  — Mike.lifeguard | @en.wb 04:26, 20 December 2008 (UTC)[reply]

Screencasts

I was thinking of ways to get new contributors to the SBL & decided that simply not knowing what to do and how to do it is probably a big barrier for people. So, I'm going to make a series of screencasts to cover some of the tasks we do here, which should hopefully be helpful for recruiting new people.

Your comments about the one I've already done are welcome. Mardetanha has said this is helpful, so I think this will be a worthwhile project to continue. I've already learned a few things that should make the end result higher quality for the next ones.  — Mike.lifeguard | @en.wb 18:32, 9 December 2008 (UTC)[reply]

Toolserver replication halted

FYI: cluster s3 won't be replicated to the toolserver for the time being, see mailarchive:toolserver-l/2009-January/001766.html. This affects the results of the toolserver's tools for most wiki's, e.g. tools:~erwin85/xwiki.php won't be up-to-date. The English Wikipedia is on cluster 1 and the German Wikipedia, Dutch Wikipedia and a handful of other projects are on cluster 2. --Erwin(85) 21:07, 6 January 2009 (UTC)[reply]

New hardware has been ordered & should arrive in about 2 weeks. s1 will be moved to the new server at that point, and s3 will be re-imported on it's own server.  — Mike.lifeguard | @en.wb 01:59, 18 January 2009 (UTC)[reply]