Jump to content

Talk:Spam blacklist

From Meta, a Wikimedia project coordination wiki
This is an archived version of this page, as edited by John Vandenberg (talk | contribs) at 07:17, 10 September 2008 (→‎Highhi: also user Kisspig). It may differ significantly from the current version.

Latest comment: 15 years ago by Jayvdb in topic User: namespace abuse
Shortcut:
WM:SPAM
The associated page is used by the Mediawiki Spam Blacklist extension, and lists strings of text that may not be used in URLs in any page in Wikimedia Foundation projects (as well as many external wikis). Any meta administrator can edit the spam blacklist. There is also a more aggressive way to block spamming through direct use of $wgSpamRegex. Only developers can make changes to $wgSpamRegex, and its use is to be avoided whenever possible.

For more information on what the spam blacklist is for, and the processes used here, please see Spam blacklist/About.

Please post comments to the appropriate section below: Proposed additions, Proposed removals, or Troubleshooting and problems, read the messageboxes at the top of each section for an explanation. Also, please check back some time after submitting, there could be questions regarding your request. Per-project whitelists are discussed at MediaWiki talk:Spam-whitelist. In addition to that, please sign your posts with ~~~~ after your comment. Other discussions related to this last, but that are not a problem with a particular link please see, Spam blacklist policy discussion.

Completed requests are archived (list, search), additions and removal are logged.

snippet for logging: {{sbl-log|1174410#{{subst:anchorencode:SectionNameHere}}}}

If you cannot find your remark below, please do a search for the URL in question with this Archive Search tool.

Spam that only affects single project should go to that project's local blacklist

Proposed additions

This section is for proposing that a website be blacklisted; add new entries at the bottom of the section, using the basic URL so that there is no link (example.com, not http://www.example.com). Provide links demonstrating widespread spamming by multiple users on multiple wikis. Completed requests will be marked as {{added}} or {{declined}} and archived.







Cosmoetica.com



See [1] and [2], a sockfarm of users all adding links to work by Dan Schneider, mostly from his website cosmoetica.com. 120 links cleaned from enWP and a number from other wikis (I'm on it but it's slow as I have to cross-check who added them). Where the links are added by anons, it is a stable subnet. Definitely a candidate for blacklisting on enWP and probably a candidate for meta blacklisting due to cross-wiki issues, albeit fairly limited by comparison wiht the extensive enWP abuse. JzG 21:54, 1 September 2008 (UTC)Reply

Is this really something we want to be linking to regardless of who added it? Furthermore, it seems from the links you've provided that several domains are involved. Has blacklisting been discussed on enwiki yet?  — Mike.lifeguard | @en.wb 18:22, 2 September 2008 (UTC)Reply
Yes, it's now blacklisted on enWP (and one of Schneider's socks promptly requested whitelisting). The list of socks is up to about 40 now [3], but primarily an enWP issue. Still, there is some cross-wiki activity and the guy is very determined to use Wikipedia to make his the "The Most Widely Read Interview Series In Internet History!" - to which I cynically respond: {{fact}}. JzG 18:05, 4 September 2008 (UTC)Reply
Added Added  — Mike.lifeguard | talk 16:37, 7 September 2008 (UTC)Reply



Franchising spam

{{linksummary|fai.co.in}} {{linksummary|franchise.org.au}} Per luxo:85.121.14.131 and Cross-wiki vandalism report. Perhaps a tad stale. The IP has been reverted since then.  — Mike.lifeguard | talk 14:17, 4 September 2008 (UTC)Reply

In his contributions I see spamming of




and not the domains you mentioned. Blacklist those? --Erwin(85) 08:40, 5 September 2008 (UTC)Reply
Yes... Now to find out who is spamming those domains.  — Mike.lifeguard | talk 11:21, 5 September 2008 (UTC)Reply
Sorry, no useful data in my database .. --Dirk Beetstra T C (en: U, T) 11:24, 5 September 2008 (UTC)Reply

Added Added. --Erwin(85) 11:39, 6 September 2008 (UTC)Reply


ara-bux.com





Please see [4]. (Blocked at Commons) Cirt (talk) 09:55, 5 September 2008 (UTC)Reply

Added Added. Thanks. --Erwin(85) 10:00, 5 September 2008 (UTC)Reply
Oh I think the domain is ara-bux.com, not bara... Cirt (talk) 10:01, 5 September 2008 (UTC)Reply
\b is a special character in regex. It's used to make sure that e.g. \bbar\.com\b matches the spam domain bar.com, but not the good domain foobar.com. --Erwin(85) 10:12, 5 September 2008 (UTC)Reply
Ah okay thank you. Cirt (talk) 10:27, 5 September 2008 (UTC)Reply

Proposed additions (Bot reported)

This section is for websites which have been added to multiple wikis as observed by a bot.

Items there will automatically be archived by the bot when they get stale.

Sysops, please change the LinkStatus template to closed ({{LinkStatus|closed}}) when the report is dealt with, and change to ignore for good links ({{LinkStatus|ignore}}).

These are automated reports, please check the records and the link thoroughly, it may be good links! For some more info, see Spam blacklist/help#SpamReportBot_reports

If the report contains links to less than 5 wikis, then only add it when it is really spam. Otherwise just revert the link-additions, and close the report, closed reports will be reopened when spamming continues.

The bot will automagically mark as stale any reports that have less than 5 links reported, which have not been edited in the last 7 days, and where the last editor is COIBot. They can be found in this category.

Please place suggestions on the automated reports in the discussion section.

COIBot

Running, will report a certain domain shortly after a link is used more than 2 times by one user on more than 2 wikipedia (technically: when more than 66% of this link has been added by this user, and more than 66% of this link were added XWiki). Same system as SpamReportBot (discussions after the remark "<!-- Please put comments after this remark -->" at the bottom; please close reports when reverted/blacklisted/waiting for more or ignore when good link)

List Last update By Site IP R Last user Last link addition User Link User - Link User - Link - Wikis Link - Wikis
vrsystems.ru 2023-06-27 15:51:16 COIBot 195.24.68.17 192.36.57.94
193.46.56.178
194.71.126.227
93.99.104.93
2070-01-01 05:00:00 4 4

Proposed removals

This section is for proposing that a website be unlisted; please add new entries at the bottom of the section.

Remember to provide the specific domain blacklisted, links to the articles they are used in or useful to, and arguments in favour of unlisting. Completed requests will be marked as {{removed}} or {{declined}} and archived.

See also /recurring requests for repeatedly proposed (and refused) removals.

The addition or removal of a domain from the blacklist is not a vote; please do not bold the first words in statements.



x.y.z.info

Concerning regexp [0-9]+\.[-\w\d]+\.info/?[-\w\d]+[0-9]+[-\w\d]*\].
A few days ago I removed this entry, but was told afterwards, that every removing needs a de-list discussion. So here I go ([#double/wrong entries|again]).
Short: This entry never worked and does not seem to be needed, so imho it's the best to remove the entry.
Long: In the beginning of 2006 there had been this request, which was added immediately. It was modified some time later. But all versions of the entry never matched anything, because the spamblock extension does not work on link descriptions, but only on the link itself. So there will never be a match on whitespace or square brackets.
Now there are 2 possibilities: 1. fix the regexp or 2. remove it permanently.
The original request said that the urls were something like (integer number).(letter).(name).info (which could perhaps be translated into \d+\.[a-z]\.\w+\.info). But if one looks at the present sbl, one can't see even one entry like this. So probably there's no need to block those domains anylonger. The only possibiliy is that entries like "cinn\.info" and "ephraim\.info" are of this format but were inserted without third-level domains. However, a short look into the history of the sbl discussion does not verify that.
Altogether I suggest to leave the entry removed. -- seth 09:59, 7 September 2008 (UTC)Reply

This entry (if done properly) is probably too broad; recommend removal.  — Mike.lifeguard | talk 19:35, 8 September 2008 (UTC)Reply

Caracal pistol, italian Wiki article





[QUOTE]

Hello Erwin, I would like to know the reason of Caracal info european site being listed on spamlist/blacklist and the removal of the link of the italian article. I am the author of all Wikipedia articles in 16 languages related to the first pistol made in United Arab Emirates known as Caracal pistol and I regularly post the latest news on Caracal-pistol.info website to keep readers informed of the latest developments since day one. Sincerely Edmond HUET Quickload 09:55, 6 September 2008 (UTC)Reply

Hi, as far as I can see there's only a small amount of information available on the web site. Most links point to your Domains for sale section. That and adding it to multiple wiki's caused me to blacklist it. Feel free to request removal from the blacklist at Talk:Spam blacklist. --Erwin(85) 11:38, 6 September 2008 (UTC)Reply

Hi, Small amount of information? Maybe, you should click on the 10 buttons on the left when you are on any one of the pages http://www.caracal-consulting.com/caracal-pistol-datas/caracal-pistol-datas.html There is no other site on the web and all the available infos related to Caracal pistol can be found on this site.

[END QUOTE]

Hello, I request removal from blacklist, above quote explains why. Ask for more info if needed. Quickload 09:12, 7 September 2008 (UTC)Reply

Clearly that domain is not blacklisted at meta.  — Mike.lifeguard | talk 16:23, 7 September 2008 (UTC)Reply
caracal-pistol.info however is. Given your conflict of interest in this case, the cross-wiki additions and our norm of declining de-listing requests from site owners, this request is  Declined.  — Mike.lifeguard | talk 16:30, 7 September 2008 (UTC)Reply


OK, given the related domains, and additions by Quickload, I think this may have turned into a request for listing. I'm normally not a fan of listing related domains, but Quickload seems to have a COI here, and is adding sites cross-wiki. Looking for input here. Related domains listed below.  — Mike.lifeguard | talk 16:35, 7 September 2008 (UTC)Reply

Related domains

















































































 — Mike.lifeguard | talk 16:35, 7 September 2008 (UTC)Reply


Hi,

my site is free-jsf-components.net, has been listed on http://en.wikipedia.org/wiki/JavaServer_Faces http://de.wikipedia.org/wiki/JavaServer_Faces

now it is black listed. Please remove it from black list because it is very useful and non profitable technical site.

Troubleshooting and problems

This section is for comments related to problems with the blacklist (such as incorrect syntax or entries not being blocked), or problems saving a page because of a blacklisted link. This is not the section to request that an entry be unlisted (see Proposed removals above).


nimp.org

Just noticed that we are blocking only

wikipedia\.on\.nimp\.org
\.on\.nimp\.org
\bblocked\.on\.nimp\.org\b

when we might as well block the whole thing:

\bnimp\.org\b

 — Mike.lifeguard | @en.wb 01:44, 3 September 2008 (UTC)Reply

Please do, yes. We're all tired of seeing it in edit summaries by now ... - Alison 05:49, 8 September 2008 (UTC)Reply
I was thinking more along the lines of "Is there some technical reason for using these regexes instead of that one?" but {{done}} just the same!  — Mike.lifeguard | talk 11:45, 8 September 2008 (UTC)Reply
I can't find the 3 regexes I listed above - perhaps my browser isn't loading the whole blacklist? 0.o
Can someone else double-check me?  — Mike.lifeguard | talk 11:55, 8 September 2008 (UTC)Reply
It got removed recently in this diff. Hold on - I'll bop it back in now :) - Alison 12:21, 8 September 2008 (UTC)Reply
And Done - it's kinda already been logged, so I left it alone - Alison 12:27, 8 September 2008 (UTC)Reply
Bah, I just wanted to make sure the other ones were gone. So we're good now.  — Mike.lifeguard | talk 12:29, 8 September 2008 (UTC)Reply
I recently deleted the entries
wikipedia\.on\.nimp\.org
\bblocked\.on\.nimp\.org\b
for they were totally redundant, because of \.on\.nimp\.org. (And I didn't see your request.)
Blocking the whole domain nimp.org is even more restrictive, but I don't see any good pages there, so I guess the changing to nimp\.org is ok. -- seth 12:28, 8 September 2008 (UTC)Reply

Bad backlog on MediaWiki talk:Spam-whitelist

Please pitch in and help whittle this down. We have editors who've been waiting several months.

Thanks, --A. B. (talk) 15:14, 8 September 2008 (UTC)Reply

I see three requests, all three answered, and none of them suitable for whitelisting (as it needs whitelisting on the local projects, if anywhere). Are you sure you link to the right page? --Dirk Beetstra T C (en: U, T) 15:18, 8 September 2008 (UTC)Reply
I think he meant w:en:MediaWiki talk:Spam-whitelist.  — Mike.lifeguard | talk 16:24, 8 September 2008 (UTC)Reply
Ooops -- I meant to post this at en:MediaWiki talk:Spam-blacklist. --A. B. (talk) 18:42, 8 September 2008 (UTC)Reply

User: namespace abuse

Nervenhammer



similar pattern, adding a personal link..--Cometstyles 12:01, 12 August 2008 (UTC)Reply
Thanks Comets - Added Added for now. In passing I see no harm in listing such sites as much to send a message to the user that their behaviour may not be appropriate. Not sure about how lasting teh listing should be our logging immediately - thoughts welcome. --Herby talk thyme 12:12, 12 August 2008 (UTC)Reply
Reviewing this it may well be a good faith de user who has just decided to expand there interests (based on SUL info). In which case I suggest serious consideration for de-listing if we are asked. --Herby talk thyme 12:17, 12 August 2008 (UTC)Reply
Hi guys, I don't understand this, why is my personal website Nervenhammer on this blacklist? Fleshgrinder 09:53, 22 August 2008 (UTC)Reply
Adding the link to your userpage on many wikis where you are not a community member is generally frowned upon. I suggest you instead leave a link to your userpage on your home wiki if you need to create a userpage. If you are an established community member, you would be afforded more leeway with respect to user page content. I'm prepared to de-list this on the condition that the link is not added cross-wiki again.  — Mike.lifeguard | @en.wb 14:20, 31 August 2008 (UTC)Reply
Okay, I'm very sorry about that, it was never my intention to start link building for my website - I only wanted to show my person and what I do. It won't happen again. If I'm not really contributing something I don't create a userpage and if, I set a link to the German Wikipedia (where I'm contributing the most). Thank you for the answer and for elucidating me about this issue. It would be nice if you would enlist the URI, because I don't want my URI to be on a blacklist and I'm definitly not going to post the address again. Kindest regards --Fleshgrinder 09:48, 2 September 2008 (UTC)Reply

Removed Removed  — Mike.lifeguard | @en.wb 14:51, 2 September 2008 (UTC)Reply

Autofinance



Cross wiki spam pages. (autofinance-ez.com is the domain). --Herby talk thyme 12:59, 13 August 2008 (UTC)Reply



Added Added  — Mike.lifeguard | @en.wb 17:18, 1 September 2008 (UTC)Reply

Bestlyriccollection



What is that? I stumbled into it when 84.109.83.73 was vandalizing through the wikis. Best regards, --birdy geimfyglið (:> )=| 10:41, 14 August 2008 (UTC)Reply

Very odd indeed. fr wp didn't like the idea of a "user page for bookmarks". Not sure that it is spam but sure doesn't look like "normal" user pages. Looking some more & other opinions would be good. --Herby talk thyme 11:02, 14 August 2008 (UTC)Reply
They have a point [5]... I don't understand why he needs that in multiple wikis, I mean, if he (miss)uses his userpage for bookmarks, why on many places, --birdy geimfyglið (:> )=| 12:25, 14 August 2008 (UTC)Reply

Jon Awbrey and JonAwbrey





Creates userpages full of external links (and selfpromotion references?) on many wikis. Annabel 19:08, 28 August 2008 (UTC)Reply

Discussion

I placed the same vita on my user page that I use on all the sites where I contribute work and discuss ideas with other interested parties. This does not constitute SPAM (= "unsolicited mass-mailing or posting") in any technical or COI sense of the word. I would appreciate the two variants of my real name that I use on the Internet and Web not being listed on any kind of badlists. Thank you, Jon Awbrey 19:12, 29 August 2008 (UTC)Reply
While it may not be spam, it would seem to be abuse of WMF wikis & as such unwanted. While community members are given leeway with their userpages, such excessive linking is generally frowned upon. Furthermore, I very much doubt you understand all the languages you have posted this to, nor are you active in those wikis. I invite you to fix the problem before it is done for you. The history at enwiki will be of interest to others reviewing this.  — Mike.lifeguard | @en.wb 19:36, 29 August 2008 (UTC)Reply
I would appreciate it if you could point to the relevant WMF Terms of Service, or even a generally accepted standard of etiquette that would justify your calling this user page vita an "Abuse". I am referring to the one now posted here at Meta, which is a copy of the one deleted by Annabel from my Nederlands User Page. By "generally accepted standard of etiquette" I mean one that you could honestly assure me is followed across the board on all WMF User Pages. In addition, I have never seen any notice of Wikipedias being "Encyclopedias that anyone who is fluent in the local language can edit" — but please let me know if I have missed such a restriction somewhere. Jon Awbrey 20:22, 29 August 2008 (UTC)Reply
You misunderstand me crucially. I do not say you need to be fluent in the languages where you contribute. To claim that would be hypocritical; I edit all WMF wikis. The issue is that:
  1. You are not an established member of the community on any wiki where you have a userpage (so far as I can tell).
  2. Your userpage has an excessive amount of links (indeed, links form the only content, and they appear to be placed for self-promotional purposes). This would perhaps be an issue regardless of the above.
 — Mike.lifeguard | @en.wb 20:31, 29 August 2008 (UTC)Reply

[Undent]: Correct me if I am wrong, but I do not think it is customary for newcomers to any of the many-tongued Wikipædiæ to be subjected to the ordeals of this type of entrance exam with regard to the legitimacy of their participation. However, By FYIing my real name, educational background, and ongoing intellectual interests, I have certainly done more than the avarage Anon IP on that score.

Many people post pics on their user pages as a way of providing a friendly introduction to themselves, their current interests, and their personal histories. My old web vita harks back to a day when I was unsure about the propriety of copying pics, so I used links instead, over the years being forced to replace many of them with WayBak links. You can hardly dream that I am collecting revenue off archival links like that, can you?

If and when you personally discover an interest in some of the Active Suggestions Concerning Intellectual Interchange that I enumerated in my web vita — which was my sole purpose in posting it to my NL User Page — then we may find more interesting things to talk about. In the mean time, I can hardly become an "established member of the community on any wiki", much less learn a few bits of the local colour and language, if some Admin deletes my self-introductory user page and blocks my account after the first few edits, now can I? Jon Awbrey 23:45, 29 August 2008 (UTC)Reply

  • Jon, this same sort of Wikilawyering nonsense is what got you banned from enWP and booted from the mailing list. Obviously your rampant sockpuppetry and disruption ensures you remain banned on enWP. I would be the first to help you if you wanted your massive list of socks associated with some other name, to reduce the impact on you, but I don't see why we should help you to pretend that you are here to do anything other than the usual: self-promotion and idiosyncratic original research. JzG 20:50, 4 September 2008 (UTC)Reply
Still placing pages - en wq in the past few hours. Cheers --Herby talk thyme 08:00, 6 September 2008 (UTC)Reply
This is shameless self-promotion, and I would suggest that someone who has the necessary rights removes the pages from all projects on which he is not an active participant. JzG 11:44, 7 September 2008 (UTC)Reply

So, the following links are the ones being used for vanity spamming here:

planetmath.org/encyclopedia/DifferentialPropositionalCalculus.html
www.mywikibiz.com/Directory:Jon_Awbrey
www.mywikibiz.com/User:Jon_Awbrey
www.mywikibiz.com/User_talk:Jon_Awbrey 
http://planetmath.org/?op=getuser&id=15246
knol.google.com/k/-/-/3fkwvf69kridz/1
mathforum.org/kb/accountView.jspa?userID=99854
www.mathweb.org/wiki/User:Jon_Awbrey
www.mathweb.org/wiki/User_talk:Jon_Awbrey
www.research.att.com/~njas/sequences/?q=Awbrey
www.p2pfoundation.net/User:JonAwbrey
www.p2pfoundation.net/User_talk:JonAwbrey
altheim.4java.ca/ceryle/wiki/Wiki.jsp?page=JonAwbrey
forum.wolframscience.com/member.php?s=&action=getinfo&userid=336
www.wikinfo.org/index.php/User:Jon_Awbrey
www.wikinfo.org/index.php/User_talk:Jon_Awbrey
www.getwiki.net/-User:Jon_Awbrey
www.getwiki.net/-UserTalk:Jon_Awbrey
ontolog.cim3.net/cgi-bin/wiki.pl?JonAwbrey
semanticweb.org/wiki/User:Jon_Awbrey
semanticweb.org/wiki/User_talk:Jon_Awbrey
wikipediareview.com/index.php?showuser=5619
wikipediareview.com/index.php?showuser=398
zh.wikipedia.org/wiki/User:Jon_Awbrey
zh.wikipedia.org/wiki/User_talk:Jon_Awbrey
org.sagepub.com/cgi/content/abstract/8/2/269
www.cspeirce.com/menu/library/aboutcsp/awbrey/integrat.htm
www.chss.montclair.edu/inquiry/fall95/awbrey.html
www.abccommunity.org/tmp-a.html
www2.oakland.edu/secs/dispprofile.asp?Fname=Fatma&Lname=Mili
www2.oakland.edu/secs/dispprofile.asp?Fname=Mohamed&Lname=Zohdy
www2.oakland.edu/oakland/ouportal/index.asp?site=87
www.msu.edu/dig/msumap/psychology.html
www.msu.edu/dig/msumap/beaumont.html
quod.lib.umich.edu/cgi/i/image/image-idx?id=S-BHL-X-BL001808%5DBL001808
www.uiuc.edu/navigation/buildings/altgeld.top.html 
www.mth.msu.edu/images/wells_medium.jpg 
www.msu.edu/dig/msumap/phillips.html
www.enolagaia.com/JMC.html
I would like to add a comment. As long as this page edited by this user multiple times on the English Wikipedia still exists, we look absolutely foolish trying to suppress a passive list of vitae links from a USER page, for heaven's sake. No surprise. Given the opportunity to choose two paths, Wikimedians will select the most backward, stupid-looking one. -- Thekohser 18:04, 8 September 2008 (UTC)Reply
To my knowledge, Elonka has not edited her article in a long time; this was a big issue in her several RfAs and she's been severely criticized for this before. If I'm wrong and there remains an ongoing issue with coi edits, let me know. Thanks, --A. B. (talk) 18:58, 8 September 2008 (UTC)Reply
Given the variety of links, and that several may well have legitimate uses, I'm going to remove the links. Pushing links is inappropriate regardless of the namespace.  — Mike.lifeguard | talk 19:09, 8 September 2008 (UTC)Reply
Well, I was pointedly reverted on English Wikiversity. I did attempt an explanation in irc, but that was equally-pointedly rebuffed. Relevant on-wiki discussion is on English Wikibooks. Perhaps someone else would take that on.  — Mike.lifeguard | talk 00:05, 9 September 2008 (UTC)Reply
Comment: I see JonAwbrey has reverted quite a number of linkremovals on userpages. --Dirk Beetstra T C (en: U, T) 13:57, 9 September 2008 (UTC)Reply
More userpages on enwikquote, eswiki, fiwiki, kowiki, ruwiki on top of the reverts.  — Mike.lifeguard | talk 14:38, 9 September 2008 (UTC)Reply
See also a discussion between Moulton and Jon Awbrey on Wikiversity, which seems to contain a threat.  — Mike.lifeguard | talk 14:24, 9 September 2008 (UTC)Reply

[Undent] Mr. Lifeguard, given your acknowledgement that the material in question is not "SPAM", I think that further discussion on this so-called "spam blacklist" page is no longer relevant. So I would like to request, once again, that you remove the listing of my usual Internet names from this page. Thanks in advance, JonAwbrey 17:03, 9 September 2008 (UTC)Reply

It may or may not be spam. To me it certainly is abuse of the facilities that are enjoyed by users provided by the Foundation. Your contributions to many projects are zero other than your overlinked user page. --Herby talk thyme 18:02, 9 September 2008 (UTC)Reply
Your statements are incorrect. Since you appear genuinely interested, I can give you a list of contributions to several projects that may not show up in your cursory scans. For instance, you are probably missing the contributions that come by way of interwiki translations of articles that I wrote for the English Wikipedia. These contributions are, in my humble opinion quite substantial. Indeed, it was in following the search engine traces of these translations that I was brought to many of those non-anglophone Wikipedias. JonAwbrey 18:26, 9 September 2008 (UTC)Reply
As for the rest, surely you must have some sense of how silly it would sound to say that a person cannot be allowed to contribute unless he or she is already an established contributor? Surely? JonAwbrey 18:26, 9 September 2008 (UTC)Reply
Fortunately you are entitled to your opinion & I to mine. Wikis are about collaborative working with consensus among folk - your view would seem at odds with some others and not to be particularly collaborative in their approach. Personally I'm inclined to consider blacklisting the links as I see the excessive linkage to be outside the scope of most projects.
There is nothing silly about suggesting that someone whose only contribution to a project is a personal page which is out of scope is not effectively contributing. I delete many such pages most days. --Herby talk thyme 18:56, 9 September 2008 (UTC)Reply

JzG

The following discussion is closed.


Using the User: namespace to promote the domain:



JonAwbrey 20:42, 8 September 2008 (UTC)Reply


closed, please don't make bulk requests, there is no crosswiki spam, he either links to his enwiki userpage or has no userpage. Thanks, --birdy geimfyglið (:> )=| 20:51, 8 September 2008 (UTC)Reply

Elonka ‎

The following discussion is closed.


Creates userpages full of external links (and self-promotion references!) on several wikis. -- Thekohser 20:45, 8 September 2008 (UTC)Reply

closed, please don't make bulk requests, there is no crosswiki spam, he either links to his enwiki userpage or has no userpage. Thanks, --birdy geimfyglið (:> )=| 20:55, 8 September 2008 (UTC)Reply

Highhi



Is most definitely a spammer who creates "SCM declassified" (www.51icjm.com link) on his userpage and talkpage so it can't be rollbacked :( ..thsi account did a similar thing (same pattern)--Cometstyles 03:24, 9 September 2008 (UTC)Reply

Despite being blocked on Commons they are still spamming their user page. Protected now but I guess other projects will be affected. Cheers --Herby talk thyme 11:11, 9 September 2008 (UTC)Reply
Worth locking the account?  — Mike.lifeguard | talk 18:18, 9 September 2008 (UTC)Reply
The account is not global ;( I would suggest to add the link to the bl, best regards, --birdy geimfyglið (:> )=| 18:19, 9 September 2008 (UTC)Reply
It is added, but they are spamming it in plaintext. This is why we want to give stewards the ability to forcibly merge accounts. By not unifying, spammers and vandals may continue unless we block them on each wiki individually. I will purge the userpages, but without some mechanism to enforce this, I do not see how we may force them to stop.  — Mike.lifeguard | talk 18:24, 9 September 2008 (UTC)Reply
Sorry, they are spamming a new domain, which I'm Added adding.  — Mike.lifeguard | talk 18:30, 9 September 2008 (UTC)Reply

Domain was



 — Mike.lifeguard | talk 18:33, 9 September 2008 (UTC)Reply

Checkuser on English Wikisource reminded of an earlier account that has done this, Kisspig. Please globally block the account. John Vandenberg 07:17, 10 September 2008 (UTC)Reply

Discussion

NOINDEX

Prior discussion at Talk:Spam_blacklist/Archives/2008/06#Excluding_our_work_from_search_engines, among other places

There is now a magic word __NOINDEX__ which we can use to selectively exclude certain pages from being indexed. I suggest having the bots use this magic word in all reports generated immediately. Whether to have this page and it's archives indexed was a point of contention previously, and deserves further discussion.  — Mike.lifeguard | @en.wb 01:33, 4 August 2008 (UTC)Reply

Sorry missed this one. I certainly support the "noindex" of the bot pages. They are somewhat speculative. If we could get the page name changed I would be happier about not using the magic word on this but..... --Herby talk thyme 16:09, 6 August 2008 (UTC)Reply
I have added the keyword to the COIBot generated reports, they should now follow that. --Dirk Beetstra T C (en: U, T) 16:31, 6 August 2008 (UTC)Reply
My bot is flagged now, so I can start adding it to old reports. I will poke a sysadmin first to see if I really must make ~12000 edits before I start though. It will not be all in one go, and I will not start for a day or two.
Any other thoughts on adding it to this page and/or it's archives?  — Mike.lifeguard | @en.wb 18:11, 9 August 2008 (UTC)Reply
Already sort-of done with {{linkstatus}}, so the bot probably won't run. I plan to keep the flag though <evil grin>  — Mike.lifeguard | @en.wb 22:56, 11 August 2008 (UTC)Reply

Renaming the blacklist should be done at some point in the future; we'll have to wait on Brion for that. Until then, I'd like to have this page and it's archives __NOINDEX__ed. Having it indexed causes more issues than it solves & we now have an easy way to remedy the situation. We should review this when the blacklist is renamed.  — Mike.lifeguard | @en.wb 02:55, 14 August 2008 (UTC)Reply

The Logs

log system

I would like to consolidate our logs into one system which uses subpages and transclusions to make things easy. Each month would get a subpage, which is then transcluded onto Spam blacklist/Log so they can easily be searched. This would mean merging Nakons "log entries" into the main log, and including the pre-2008 log. This wouldn't require much change in how we log things.

However, I wonder what people think about also logging removals and/or changes to the regexes. Currently, we don't keep track of those in any systematic way, but I think we should. For example, I consolidated a few regexes a while back, and simply made the old log entries match the new regexes, which is rather Orwellian. Similarly, we simply remove log entries when we remove domains - nothing is added to the log, so we cannot track this easily. This idea (changing the way we log things) is likely going to require some discussion; I don't think there should be any problem moving to transcluded subpages immediately.

 — Mike.lifeguard | @en.wb 14:41, 6 August 2008 (UTC)Reply

I'm all for using one system for the logs. I'm not sure about your second idea though. Is the log intended purely to explain the current entries or also former entries and perhaps even edits? Logging removals would be a good idea to see if a domain was once listed, but logging changes seems too bureaucratic. Matching the log entries with the new regexes might be Orwellian, but it's also pragmatic. What are the advantages of logging changes? Could you perhaps give an example of how you suggest to log changes? --Erwin(85) 18:16, 6 August 2008 (UTC)Reply
I should say I mean "Orwellian" without the connotative value. The denotative value is simply that the current method is "changing history" - not in and of itself a bad thing. Indeed, I've had no issues with this, hence the speculative nature of that part of my suggestion.  — Mike.lifeguard | @en.wb 19:48, 6 August 2008 (UTC)Reply
in de:WP:SBL we do log all new entries, removals and changes on black- and whitelists. logging changes can be useful e.g. for retracing old discussions. -- seth 01:35, 7 August 2008 (UTC)Reply
i think, that the transclusions are a good idea to keep the traffic low. is anybody against that?
concerning the logging of removals/modifications: what do you think about a log system like de:Wikipedia:Spam-blacklist/log#Mai_2008? -- seth 12:12, 13 August 2008 (UTC)Reply
It would be quite some work to link the diffs, but I'm not against using it. I guess that means this is a weak support. --Erwin(85) 09:35, 19 August 2008 (UTC)Reply

if everyone else continues ignoring the suggestions till tomorrow, i will start realizing Miks.lifguards idea by creating subpages like

apart from that i'd like to know...

  1. which components/tools are dependent on the sbl-log-syntax/-format?
  2. am i right, that there is no meta-whitelist? will there ever be one?
  3. would it be ok to switch from the old log-syntax to a new one, whithout converting the old log-entries?

-- seth 10:23, 23 August 2008 (UTC)Reply

Please use subpages; I changed your examples above. There is no global whitelist, no. But in the future? Perhaps something to request. I imagine leaving old logs will be fine. Are we sure we want to log changes to regexes? I'm not sure whether that's really necessary. It also raises the already-high bar to contributing in this area. Our procedures are opaque enough as it is - this is one more hoop we are making potential recruits to the anti-spam team jump through.  — Mike.lifeguard | @en.wb 22:12, 23 August 2008 (UTC)Reply
whitelist: i guess, a global whitelist would not be necessary, because blacklist entries usually can be modified by plain regexp-syntax to match all example.org except example.org/good. such a blacklist entry would be
example\.org(?!/good)
however, there may be cases, where a explicite whitelist entry would be better human-readable.
leaving old logs: if removals shall be logged, how shall they be logged? just by comment?
log changes: the main reasons why i am asking are #double.2Fwrong_entries and #double entries. if it was ok to remove bugs, syntax-optimizations and double entries without logging it, it would be less work for me. ;-) -- seth 22:50, 23 August 2008 (UTC)Reply
I guess so, but if you see something suspicious please check if it can really be removed. Using the new syntax is OK with me. Logging removals like on dewiki looks good. --Erwin(85) 10:19, 24 August 2008 (UTC)Reply

at least the splitting is done. [6] -- seth 09:37, 25 August 2008 (UTC)Reply

Thanks for taking care of the logs; I think that will work much better.
I'm not sure whether I'm happy with having regexes consolidated as you've done. Within each set of additions, one should try to be concise with your regexes, but I don't think merging all the blogspot ones together is necessarily a good idea. This will make future removals more difficult. In case you forget, not all are as proficient with regex as you, myself included!  — Mike.lifeguard | @en.wb 02:14, 29 August 2008 (UTC)Reply
first of all: i did not merge blogspot entries. the very long blogspot line had existed before my "big" edit. ;-)
merging all blogspot-links in one line would probably be not a good idea, because of performance reasons (the extension builds the regexps in 4k-blocks) and because of COIBot, which now allows a maximum line length of 1k chars.
(not to be misunderstood: grouping regexps increases performance, but lines >1k will lead to problems)
i grouped only a few regexps and only if they were "near" together in the SBL and had no different headings. as regexp grouping is used already, i didn't think that would be difficult to read. the largest grouping i did at the beginning and in lines 3300-3500, see [7]. was that too much?
concerning the logging: afaics we all want to log removals, too, don't we? but if i didn't get you wrong, you don't want to change the log-syntax. so i don't understand how you want SBL removals to be logged? :-) -- seth 09:43, 29 August 2008 (UTC)Reply
My mistake on the blogspot one then. I've said nothing about not changing the log format - feel free to do so in order to log both additions and removals - the template you would want to change is {{sbl-log}} and the "snippet" at the top of this page.  — Mike.lifeguard | @en.wb 18:01, 29 August 2008 (UTC)Reply
oh, ok. i missunderstood "I imagine leaving old logs will be fine." -- seth 09:50, 31 August 2008 (UTC)Reply
Afaics sbl-log does not need to be changed. To keep the log syntax somehow downwards compatible, it will suffice to change the syntax like this:
example\.org # name # b+ reason
where "b+" means addition on blacklist, "b-" means removal.
To keep the format more compact we could use the dewiki-style
example\.org # [SBL-diff b+] # reason
which results in something like
example\.org # b+ # reason
But this is a bit more work for the admins and gives just a small additional information (the exact date of addition/removal), so I don't know whether this is really better. Although Erwin said, the dewiki-syntax looked good and Mike.lifeguard told me to feel free, I'm not sure, if any other admin will beat me, if I change the syntax to dewiki-style. :-) -- seth 11:21, 31 August 2008 (UTC)Reply
However, i've been bold. Now we have same syntax as dewiki. -- seth 15:10, 2 September 2008 (UTC)Reply
I adapted COIBot in the XWiki reports, it now (should) say(s) (have to wait for the next report from nowdiff):
 \bexample\.org             # [SBL-diff b+] # see [[User:COIBot/XWiki/example.org]]
with (hopefully) the first # at position 40 (may have miscalculated that). Replace SBL-diff by the diff and save. It is going to be more work, but well, it is also clearer from now what happens. --Dirk Beetstra T C (en: U, T) 15:33, 2 September 2008 (UTC)Reply
Can we please keep the admin's name (and the span which was there previously)? Furthermore, when someone is using the log snippet at the top of this page, it will follow the old format.  — Mike.lifeguard | @en.wb 16:59, 2 September 2008 (UTC)Reply
I've added a snippet for logging on the actual blacklist. Take the snippet after you make an edit.
So, to log an addition, grab the snippet from this page and the snippet from the blacklist page.
For additions, use {{sbl-log|1161258#{{subst:anchorencode:Example}}}} {{sbl-diff|1161261}}
which produces request addition
For removals, use {{sbl-log|1161258#{{subst:anchorencode:Example}}}} {{sbl-diff|1161261|removal}}
which produces request removal
This should make it faster to log things, I think.  — Mike.lifeguard | @en.wb 17:27, 2 September 2008 (UTC)Reply
OK, changed it back .. we are not sure about this implementation yet (for me, it does give extra work, and IMHO does not add, a simple '+' or '-' in the logs without the actual difflink should suffice .. )? --Dirk Beetstra T C (en: U, T) 17:34, 2 September 2008 (UTC) (forgot to sign)Reply
Actually the admins name is redundant, because it is included in the diff. If all (even redundant) information is provided (like now), it makes a lot of work for the admins.
The additional information provided by the difflink is quite small (i.e. exact modification date). The simple '+'/'-' (or 'b+'/'b-') would be enough. (That's why I was asking a few lines above.) The difflink would be fully superfluous, if all admins used the edit summary line of the sbl to inform about the added/removed entry explicitly, but that it unrealistic, I know.
So which syntax shall be used? Afaics its main features must be: 1. provide important information, 2. easy to input for admins, and 3. not too hard to read for machines. I guess all above suggestions will do, so it doesn't really make a big difference, which one will be chosen.
I guess, if nodody answers, we will just continue like now. -- seth 08:22, 3 September 2008 (UTC)Reply
The admin's name isn't redundant - that is information we will want without having to look at the diff. By that logic, we would also not have whether it was an addition or removal, since that is information contained in the diff 0.o  — Mike.lifeguard | @en.wb 14:20, 3 September 2008 (UTC)Reply
The '+'/'-' is redundant, right. But it is a main information about the sbl modification. The admin's name is imho not so important. But we don't need to discuss about that small point. For me the current syntax is no problem. :-) -- seth 19:12, 3 September 2008 (UTC)Reply

tool for log searching

The simpliest way to improve searchability is to write a tool that searches the logs for you. I'm in the middle of doing so, and I'll have a working prototype in a few days. The way this would work is it would load all the pages (really does not matter where the pages are), and apply a few regex to them. This means we really don't have to merge nacon's stuff, I can just add that page to the tool. As long as the logs keep the same pattern of one entry per line, a tool is not difficult.

I don't really think logging removals is smart, we never remove entries from the logs anyway. Simpliest way is to keep the logs write only (only new entries), and have a tool list all matches. (I'm writing the tool in a manner where you will be able to put the domain in "plain", as in google.com, and it will find all the relevant entries, even if it has \bgoogle\.com\b, or some other weirdness. —— nixeagle 20:23, 6 August 2008 (UTC)Reply

lol, by accident i started writing a similar tool 2 hours ago. but i write a cli-perl-script only. until now it greps all sbl-entries (in meta-blacklist, de-blacklist and de-whitelist), which would match a given url. -- seth 01:35, 7 August 2008 (UTC)Reply
Seth, nixeagle: actually, having a tool that searches all blacklists and logs (i.e. cross-wiki) to see if it is blacklisted somewhere, and if there is a log for that would be great. IMHO, it should be 'easy' to write a tool that extracts all regexes from the page, and tries if it is possitive against a certain url that we search (and it could then be incorporated into the {{linksummary}} to easily find it ..). Or is this just what you guys are working on ;-) .. --Dirk Beetstra T C (en: U, T) 09:50, 7 August 2008 (UTC)Reply
beta version. :-) -- seth 14:38, 7 August 2008 (UTC)Reply
WONDERFUL!
one question, can you make it add 'http://' by itself (as we only put the domain in the linksummary as to prevent the blacklist to block it ..). --Dirk Beetstra T C (en: U, T) 14:48, 7 August 2008 (UTC)Reply
Thats about what I was writing. I was putting it in the framework of http://toolserver.org/~eagle/spamArchiveSearch.php where the tool retrieves the section/page and links you directly to where the item was mentioned. For logs I was working on displaying the line entry in the log as one of the results, so you would not even have to view the log page. —— nixeagle 15:11, 7 August 2008 (UTC)Reply
if you want to combine my script with that framework, i can give you the source code. but it is perl-code and it is ugly, about 110 lines. -- seth 17:00, 7 August 2008 (UTC)Reply
some pages

Suggestion for pages:

Thanks! --Dirk Beetstra T C (en: U, T) 14:48, 7 August 2008 (UTC)Reply

i had to cope with a bug in en-sbl. but now it seems to work. further suggestions? (the more lists i include, the slower the script will get.)-- seth 16:44, 7 August 2008 (UTC)Reply
I would suggest to do it progressive, first meta and en blacklist, the rest later (roughly in order of wiki-size), similar to luxo does. --Dirk Beetstra T C (en: U, T) 17:07, 7 August 2008 (UTC)Reply
i used a hash, and those don't care about the order of declaration. now it should be sorted. -- seth 22:11, 7 August 2008 (UTC)Reply

User page advertising

Another "thinking aloud" one!

I guess I come across a commercial orietated user page on Commons once a day on average. The past week has bought a "Buying cars" page, an "Insurance sales" page, a "Pool supplies" page as well as blog/software/marketing pages. I do usually run vvv's SUL tool but quite often there is nothing immediatly (the Pool suplies one cropped up on en wp a couple of days after Commons). I know en wp are often reluctant to delete such pages out of hand (which I find incredible).

I think I am probably saying should we open up a section here to allow others to watch/comment/block/delete or whatever across wikis? --Herby talk thyme 09:51, 10 August 2008 (UTC)Reply

I agree, this is a great idea, as I have also noticed spammers like this go cross-wiki to multiple projects (Wikinews/Commons, etc.) Cirt 11:40, 10 August 2008 (UTC)Reply
Agree. Others may be interested in watching only that part of our work - perhaps a transcluded subpage so it may be watched separately?  — Mike.lifeguard | @en.wb 14:03, 10 August 2008 (UTC)Reply
Sounds like the best way to proceed. Cirt 14:09, 10 August 2008 (UTC)Reply
Thanks so far - good to get other views as well but as an idea of the scale I picked these out from the last few days on Commons (all user names) -
Sungate - design advert
Totalpoolwarehouse - obvious & en wp too
Theamazingsystem - two spamvert pages "The Automated Blogging System is a Powerful SEO Technology"
Adventure show - pdf spam file
Firmefront - fr "Banque, Assurance, Gestion Alternative et Private Equity"
The Car Spy - internet car sales
DownIndustries - clothing sales
Serenityweb1 - Nicaragua tourism & en wp
Macminicover - "Dust Cover or Designer Cover for Apple Mac"
I can't instantly find the insurance sales one & I am sure another user produced a page the same as Theamazingsystem. We could do with working out the best way of presenting the info - whether the standard template is needed or whether just an SUL link would allow us a quick check on cross wiki activity?
It would be good to know if the COI bot excludes User: space and whether that may need rethinking?
Cheers --Herby talk thyme 14:35, 10 August 2008 (UTC)Reply
So far as I know, it watches only the mainspace. But Beetstra above said this could be changed.  — Mike.lifeguard | @en.wb 14:40, 10 August 2008 (UTC)Reply
Not sure what else is in the works but I think an SUL link to check activity cross-projects would be sufficient. Anything else would be above and beyond but would also be nice. Cirt 15:06, 10 August 2008 (UTC)Reply
The standard {{ipsummary}} template is pretty good but (I think) lacks the SUL link which for this kind of stuff would be useful (luxo would be a help tho I guess).
The other thing I guess would be to get agreement to lock the blatantly commercial accounts just so that they do not do a "JackPotte" on us I think. I'll maybe point a couple of people to this section. --Herby talk thyme 16:04, 10 August 2008 (UTC)Reply
As it happens I was just trying to lock an account that wasn't SUL yet. I think the concept is sound, these accounts prolly should be locked and hidden. Not sure about mechanics of implementation. ++Lar: t/c 18:39, 10 August 2008 (UTC)Reply
IPs can't have a unified account, so the SUL tool is useless. We have luxo's for that.  — Mike.lifeguard | @en.wb 16:49, 10 August 2008 (UTC)Reply
Yeah - this type really needs an SUL link I think. And we do nee to look at the best way we can lock overtly commercial accounts I think. --Herby talk thyme 16:51, 10 August 2008 (UTC)Reply
Today I also saw some spamming, by 3 accounts on Commons:Talk:Main Page, I have to say that I do agree with Herby about this here, really a nice idea on how to stop spamming at least some of it. --Kanonkas 18:29, 10 August 2008 (UTC)Reply
Good idea, Herby! If you want I can set up a tool similar to SUL:, i.e. list user pages and blocks, for IPs. Of course, other tools are possible as well. --Erwin(85) 19:33, 10 August 2008 (UTC)Reply

Today :) user:Restaurant-lumiere - restaurant spam - [8]. User page advert, series of images all with plenty of information about the restaurant in the "description". --Herby talk thyme 07:07, 11 August 2008 (UTC)Reply

Well .. enough is enough then. The linkwatchers are from now on also parsing the user namespace. --Dirk Beetstra T C (en: U, T) 10:23, 11 August 2008 (UTC)Reply
Bot is adapted for the new task. Had to tweak en:User:XLinkBot for that, but well, do I also have to add the 'Wikipedia:' namespace? --Dirk Beetstra T C (en: U, T) 10:36, 11 August 2008 (UTC)Reply
Personally I think not but others may vary?
+ User talk:Americarx - online pharmacy ads [9], Commons (images & page) & en wp page (& the en wp one had been there a long time. Caught by Kanonkas so thanks. --Herby talk thyme 10:48, 11 August 2008 (UTC)Reply
Everything that the linkwatchers parse is now getting into the database, and may trigger the XWiki functionality mechanism. We may get more work from this... some more manpower is still necessery (as there are things that I can autocatch which have been excluded this far ..). --Dirk Beetstra T C (en: U, T) 10:53, 11 August 2008 (UTC)Reply
So are we going to make a transcluded subpage etc or this will get difficult :)
+ User:Tbraustralia - spam page - "TBR Australia is the parent company to TBR Calculators and Australian Student" - [10]. --Herby talk thyme 11:09, 11 August 2008 (UTC)Reply
+ user:Housingyou - www.housingyoumakelaars.nl page & image [11]. --Herby talk thyme 17:57, 11 August 2008 (UTC)Reply

I think others already asked about this, but shouldn't this type of listing of problem cross-project spammers/userpages be moved to a subpage? Cirt 23:06, 12 August 2008 (UTC)Reply

For logging purposes, I put it on this page. I think that will work fine.  — Mike.lifeguard | @en.wb 00:46, 13 August 2008 (UTC)Reply
For me it would just be easier to find and check users with the SUL tool if it were in some unified location on a subpage, but either way is probably okay. Cirt 02:21, 13 August 2008 (UTC)Reply
Re-thought this... The original reason I had put things back on this page was for logging purposes (ie you can use the snippet at the top as normal). However, I think we can probably do with a separate page, and when things need to be blacklisted, we can do one of two things:
  1. Start a new section in proposed additions and link to the relevant oldid of the User: namespace abuse page; then log that when blacklisting (more complicated, but more transparent, which is an especially good thing for this set of cases, I think); or
  2. Add a logging snippet to the separate page (perhaps easier than the above, though less transparent).
I welcome comments either here or there.  — Mike.lifeguard | @en.wb 19:35, 20 August 2008 (UTC)Reply

Our spam filter is now blocking spam URLs in edit summaries

FYI: our spam filter now appears to block spam addresses in edit summaries even if the domain is not in the page text. I just learned this the hard way. It's probably a response to all the shock site spam recently left in edit summaries by vandals; some will crash browsers. --A. B. (talk) 07:45, 20 August 2008 (UTC)Reply

it's not a very new feature: see bugzilla:13599. -- seth 07:52, 20 August 2008 (UTC)Reply
Yes, this has been mentioned before, and is quite a nice feature.
On a not-very-related-subject, do we think it would be a good idea to make rollback exempt from the spam blacklist? Removing spam which has been blacklisted should be a task separate from vandal fighting, and vandal fighting shouldn't be hindered by necessitating the removal of blacklisted domains. I'm not sure how difficult this would be to do from a technical standpoint, but it may be worth requesting. Input definitely requested.  — Mike.lifeguard | @en.wb 19:14, 20 August 2008 (UTC)Reply
ehm, i guess, i didn't get your point. do you mean notional vandalists, who delete blacklisted links? -- seth 23:55, 20 August 2008 (UTC)Reply
I mean if a page has a blacklisted link and a vandal blanks it or otherwise vandalizes, we cannot simply revert (which would be "adding" a blacklisted link) - you must instead edit the page to remove the link. This slows down vandal fighting.  — Mike.lifeguard | @en.wb 00:25, 21 August 2008 (UTC)Reply
i see. have there been any vandalisms like that already? or would you just like to protect wikipedia pre-emtively? how should we practically (not technically) try to solve this? i guess that such vandalism can't be avoided. the block-problem could perhaps be reduced, but probably not fully avoided. -- seth 03:18, 21 August 2008 (UTC)Reply

Mike.lifeguard (talk · contribs)'s idea makes a lot of sense and I support it. We should make rollback exempt from the spam blacklist. Cirt (talk) 02:19, 3 September 2008 (UTC)Reply

I guess, if a rollback worked around the sbl, the sbl could be worked around too easy. :-) -- seth 07:50, 3 September 2008 (UTC)Reply
I don't see how. You can use rollback to rollback only - you couldn't use it to spam. Note this has been requested as bugzilla:15450.  — Mike.lifeguard | @en.wb 21:17, 3 September 2008 (UTC)Reply
If user A vandalizes and removes blacklisted (but good) links and if user B tries to undo that, then you are right. But:
If user A spams by an url, and user B adds this url to sbl and undoes user A's edits, then user A (or any other user!) may use the rollback function to respawn the blacklisted links (if rollbacks were not harmed by sbl). -- seth 23:01, 3 September 2008 (UTC)Reply
True, but then the spammer would have to get access to rollback (and I would very quickly indef block that userif he manages to do this ..) .. but I think it is better to remove blacklisted links from revisions, as for other users they do give 'problems', having the link on the blacklist before we clean is merely handy to avoid disruption, it does not mean we don't have to clean anymore. --Dirk Beetstra T C (en: U, T) 09:26, 4 September 2008 (UTC)Reply
There are more reasons for the fix of bug 1505; see e.g. bugzilla:14091.
Another example: In dewiki we had some spamming of an url, say www.example.org/something. Actually this url is good, but shall only be linked in article foo. However, some users posted that link always in some other articles, so this link was put to sbl and deleted from all pages execept talk pages and article foo. Because of the fix of bug 1505 the problem could be solved in that way. -- seth 10:23, 4 September 2008 (UTC)Reply
Yes, that is true, but if 'foo' gets vandalised, and the link gets broken, you can't repair it without removing the link, or whitelisting the specific link. So it still needs removing, even if it is supposed to be there. It would be nice that we could still use rollback to revert then (overriding blacklist notice), but it does not actually solve the problem. The good thing is that we can now first blacklist a link, and then remove, we don't first have to try and keep up with a spammer in removing links before we blacklist, that is the only gain of this feature. --Dirk Beetstra T C (en: U, T) 14:37, 4 September 2008 (UTC)Reply
If a page gets vandalized and a link gets broken (i guess, in dewiki we did not have such a case yet), one can temporarily comment the sbl entry. So we are again at the beginning... All possibilities have their advantages and disadvantages. In dewiki the vandalized blacklisted links are no problem, are they a serious problem in enwiki (or somewhere else)? -- seth 14:57, 4 September 2008 (UTC)Reply
That I don't know. I expect that relatively new editors will simply follow the suggestion and remove the link, not knowing where to complain. But I do think that the proper solution is to whitelist such links anyway, then the problem is simply never occuring. --Dirk Beetstra T C (en: U, T) 15:00, 4 September 2008 (UTC)Reply
No, blacklisted links should be removed in general. In cases where there is legitimate use, whitelisting should be used. However that is a separate issue from reverting vandalism. Removing bad links is a separate task from reverting vandalism - we shouldn't slow vandal fighters doing one job by forcing them to do another, unrelated job as well.  — Mike.lifeguard | talk 15:36, 4 September 2008 (UTC)Reply
The thing is, if the link is blacklisted, but wanted, and a vandal damages the link, you need an administrator to repair the page completely. When the link is on the whitelist that can also be done by non-admins, who do a lot of antivandalism and antispam work as well (and for us cross-wiki spam fighters, I am an admin on en and meta, but if this happens on nl, de, whereever (I am thinking about link-high-jacking, e.g.), I need to find someone on those wikis to repair it). (Whee, I have just found another reason why global admins should be GLOBAL). --Dirk Beetstra T C (en: U, T) 15:41, 4 September 2008 (UTC)Reply

Single wiki spam - opinions?

Over the past couple of days I have dealt with two spammers on Commons (related) & one on en wp. The links they are placing are of no value to the project whatsoever. In the case of the Commons ones the links can be seen here (+deleted ones) & here (sorry - deleted only). For the en wp one here (again - deleted only, all Chinese websites).

I do know that this list is solely for blacklisting cross wiki spamming however these seem to be of no value & I would be surprised if they did not try again (I have not had time to check cross wiki to be fair). I'd be inclined to list them but other opinions would be great. Cheers --Herby talk thyme 07:19, 6 September 2008 (UTC)Reply

Well, it depends. I'm not against using the global blacklist if the link is clearly of no use, has been spammed a lot on a single project and could be spammed on other projects. Cases in which the URL is limited to one project and not expected to be spammed on other projects should be dealt with locally though. If I look at the Dutch Wikipedia's blacklist we have some Dutch URL's which probably won't be used on other projects, so there's no need to blacklist them here. --Erwin(85) 08:31, 6 September 2008 (UTC)Reply
Agreed - when there is some strong indication that the spam will be cross-wiki it is appropriate to consider global blacklisting. I would be cautious in doing so, but yes it is worth discussing on a case-by-case basis.  — Mike.lifeguard | talk 14:25, 6 September 2008 (UTC)Reply
I think there are two times when single-wiki spam should always be blacklisted:
  • URL redirects
  • Site with malware, viruses, etc.
There are some others that are judgement calls. Thousands of non-Wikimedia wikis using Media-Wiki elect to use the meta blacklist in their own filtering. Our list is publicly available and we have neither control over who else chooses to use it nor responsibility for their choice. Just the same, if a site is spammed to nl.wikipedia for instance but is total junk and would be unwelcome on every non-Wikimedia wiki, then it probably makes sense to blacklist it here; it's probably getting spammed to other Dutch wikis outside the Wikimedia world. --A. B. (talk) 02:27, 7 September 2008 (UTC)Reply
Yes, spamming non-WMF sites would be one indication that they are not going to stop after one wiki. In such cases, blacklisting here may well be worthwhile. The other two cases you mention have been common practice for some time now.  — Mike.lifeguard | talk 21:11, 8 September 2008 (UTC)Reply