Re: [Nix-dev] Wiki spam

2013-11-14 Thread Cillian de Róiste
On Wed, Nov 13, 2013 at 4:12 PM, Eelco Dolstra
eelco.dols...@logicblox.comwrote:

 Hi all,

 I've done another purge of spam pages / accounts on the wiki [1].  Let's
 try to
 keep it spam-free this time :-)  If you care about making/keeping the wiki
 useful and want to help, please let me know and I can give you access
 rights to
 block spammers and all that.


This is great, thanks! I would love to join the fight against spam, my wiki
user is: Goibhniu



 I've also enabled the spam blacklist extension
 (http://www.mediawiki.org/wiki/Extension:SpamBlacklist), which might help
 a bit.

 [1] Mostly done by deleting almost all pages that had no incoming links.
 Hopefully I didn't delete any legitimate content :-)  Please make sure
 that when
 you add a page, it is linked from another article.  Otherwise it may get
 nuked
 in the future.

 --
 Eelco Dolstra | LogicBlox, Inc. | http://nixos.org/~eelco/
 ___
 nix-dev mailing list
 nix-dev@lists.science.uu.nl
 http://lists.science.uu.nl/mailman/listinfo/nix-dev

___
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev


Re: [Nix-dev] Wiki spam

2013-11-14 Thread Philip Carlsen
2013/11/13 Eelco Dolstra eelco.dols...@logicblox.com:
 Hi all,

 I've done another purge of spam pages / accounts on the wiki [1].  Let's try 
 to
 keep it spam-free this time :-)  If you care about making/keeping the wiki
 useful and want to help, please let me know and I can give you access rights 
 to
 block spammers and all that.


Count me in :-) .. I'm plcplc on the wiki.

 I've also enabled the spam blacklist extension
 (http://www.mediawiki.org/wiki/Extension:SpamBlacklist), which might help a 
 bit.

 [1] Mostly done by deleting almost all pages that had no incoming links.
 Hopefully I didn't delete any legitimate content :-)  Please make sure that 
 when
 you add a page, it is linked from another article.  Otherwise it may get nuked
 in the future.

 --
 Eelco Dolstra | LogicBlox, Inc. | http://nixos.org/~eelco/
 ___
 nix-dev mailing list
 nix-dev@lists.science.uu.nl
 http://lists.science.uu.nl/mailman/listinfo/nix-dev
___
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev


Re: [Nix-dev] Wiki spam

2013-11-13 Thread Marc Weber
Well done (Thanks!).
Pressing random page link 10 times only showed 'nix' content this time.

 Please make sure that when you add a page, it is linked from another article.
This is mentioned here: https://nixos.org/wiki/Contributing briefly now.

 help ..
I'd like to prevent spam rather then deleting spam. Thus if
If there are still problems I'd like to implement what I proposed.
The proposal is mentioned on this updated page:
https://nixos.org/wiki/TODO_-_Cleaning_up_wiki_spam

It was about introducing a custom cookie which must be set in order to
see the standard registration/login forms. That would be set by
redirecting to a simple form whose action would set the cookie.

Thus if there are still problems please let me know.

Marc Weber
___
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev


Re: [Nix-dev] Wiki spam

2013-10-05 Thread phreedom
On Tuesday, October 01, 2013 03:29:53 PM Marc Weber wrote:
 Excerpts from phreedom's message of Tue Oct 01 14:59:09 +0200 2013:
  Because both are off by default and require whitelisting,
 
 cookies off? If cookies are off how does Mediawiki remember my name
 after logging in?

it doesn't, but it provides a clear message that you need them enabled. There 
are of course better ways to track logins like session IDs, but that's another 
topic.

 Thinking about maintainability: We could also introduce a simple
 stateful proxy.
 
 user - proxy - media wiki
 
 the proxy returns special you're human login test if register new
 user form is requested. No idea which would be the best technology to
 implement it. Probbaly there are many ways.
___
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev


Re: [Nix-dev] Wiki spam

2013-10-05 Thread Marc Weber
Excerpts from phreedom's message of Sat Oct 05 18:34:13 +0200 2013:
 it doesn't, but it provides a clear message that you need them enabled. There 
 are of course better ways to track logins like session IDs, but that's 
 another 
 topic.
So all we need is Eelco telling me Please test those 20 lines of PHP
code an send me that small patch. Thus if he tells me try it I'l take
care and spend the time.

Note: Only the login page and create new user page will be affected. And
if you login you need cookies anyway.

Marc Weber
___
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev


Re: [Nix-dev] Wiki spam

2013-10-01 Thread phreedom
On Monday, September 30, 2013 08:58:41 PM Marc Weber wrote:
 Excerpts from Philip Lykke Carlsen's message of Mon Sep 30 20:38:09 +0200 
 2013:
  If it was common practise for wikis to require user registrations lay a
  money deposit as security for constructive behaviour the problem would go
  away overnight.
 
 Sure. but can't we try a less intrusive non standard way first? Such as:
 
 if create user page / edit page is requested test for our own cookie.
 
   If cookie does not exist:
 redirect to custom password protection page, if simple password gets
 typed right, set cookie, redirect to previous page (also passing
 GET/POST vars)
 
   If cookie does exists:
 behave normally?
 
 then at least we'll know whether we get spammed by bots or humans.
 Humans will suceed. Bots should fail, because they were programmed to
 spawn standarrd media wikis only.
 
 Such an implementation is  about 20-30 PHP lines or less which can be
 added to index.php ? Just tell me do it and I'll provide that code.
 We have somebody who is willing to delete the spam, what else do we
 need to win this situation?

All these cookies and javascripts tend to break secure and efficient setups :(

Long-term automated solutions to *prevent* spam don't exist for a growing 
community like 
nixos. Making it unprofitable though might work: pre-moderation(at least when 
links are 
added/modified), marking links as no-follow for search engines.

Having a git-based wiki should help to efficiently delete spam. qgit + git 
cherry-pick should 
be like an order of magnitude faster than any web interface.

___
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev


Re: [Nix-dev] Wiki spam

2013-10-01 Thread Marc Weber
 All these cookies and javascripts tend to break secure and efficient setups :(
Please make me understand why ?

My change is about adding a simple require_once 'create-user-guard.php' file.
That can be automated. And even if not, it *does solve the problem*
unless I get proven wrong. And if we do, we can be pretty sure that
humans are spamming the wiki. And that would be a step towards solving
the issue, too.

 Long-term automated solutions to *prevent* spam don't exist for a
 growing community like nixos.
Don't overengineer. Don't think today about problems which may happen
in 2 years.

Maybe its even enough to hide the string Media Wiki on each page?
You can do so by adding a simple regex post processing to apache AFAIK.

We have to

  1) document what has been tried

  2) try new cheap things to find out whether bots are spamming - then
there should be simple soultions - or whether humans are spamming.

But I'll shut up. I've offered help. I don't have access, so I cannot
change anything. If you want me to setup a demo how the addiotional non
standard password protection would look like let me know.

But let's consider trying the trivial things first before asking people
to spend money or concluding that changes are not maintainable.

Count the packages in nixpkgs to see how powerful the nixos community
actually is. We should be able to cope with such a simple problem,
unless the attackers spend much more effort than we do.

Another way to solve this issue would be not allowing to create new
users automatically. Setup a simple form:

You want to become a wiki member?
  username [ .. ]
  password [ .. ]

and process this once every 2 weeks.

This should be less effort than deleting spam, too.

Marc Weber
___
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev


Re: [Nix-dev] Wiki spam

2013-10-01 Thread phreedom
On Tuesday, October 01, 2013 02:52:14 PM Marc Weber wrote:
  All these cookies and javascripts tend to break secure and efficient
  setups :(
 Please make me understand why ?

Because both are off by default and require whitelisting, after you discover 
that something is broken. This should at least print a useful error message. A 
slightly tweaked registration form sounds like a much better idea though.

 My change is about adding a simple require_once 'create-user-guard.php'
 file. That can be automated. And even if not, it *does solve the problem*
 unless I get proven wrong. And if we do, we can be pretty sure that humans
 are spamming the wiki. And that would be a step towards solving the issue,
 too.
 
  Long-term automated solutions to *prevent* spam don't exist for a
  growing community like nixos.
 
 Don't overengineer. Don't think today about problems which may happen
 in 2 years.

I agree that a simple solution should be tried first. We'll still have a 
problem with useless web interface though :/

 Maybe its even enough to hide the string Media Wiki on each page?
 You can do so by adding a simple regex post processing to apache AFAIK.
 
 We have to
 
   1) document what has been tried
 
   2) try new cheap things to find out whether bots are spamming - then
 there should be simple soultions - or whether humans are spamming.
 
 But I'll shut up. I've offered help. I don't have access, so I cannot
 change anything. If you want me to setup a demo how the addiotional non
 standard password protection would look like let me know.
 
 But let's consider trying the trivial things first before asking people
 to spend money or concluding that changes are not maintainable.
 
 Count the packages in nixpkgs to see how powerful the nixos community
 actually is. We should be able to cope with such a simple problem,
 unless the attackers spend much more effort than we do.
 
 Another way to solve this issue would be not allowing to create new
 users automatically. Setup a simple form:
 
 You want to become a wiki member?
   username [ .. ]
   password [ .. ]
 
 and process this once every 2 weeks.
 
 This should be less effort than deleting spam, too.
 
 Marc Weber
 ___
 nix-dev mailing list
 nix-dev@lists.science.uu.nl
 http://lists.science.uu.nl/mailman/listinfo/nix-dev
___
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev


Re: [Nix-dev] Wiki spam

2013-10-01 Thread Marc Weber
Excerpts from phreedom's message of Tue Oct 01 14:59:09 +0200 2013:
 Because both are off by default and require whitelisting,
cookies off? If cookies are off how does Mediawiki remember my name
after logging in?

Thinking about maintainability: We could also introduce a simple
stateful proxy.

user - proxy - media wiki

the proxy returns special you're human login test if register new
user form is requested. No idea which would be the best technology to
implement it. Probbaly there are many ways.

Marc Weber
___
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev


Re: [Nix-dev] Wiki spam

2013-09-30 Thread Eelco Dolstra
Hi,

On 30/09/13 06:53, Philip Carlsen wrote:

 I just joined this mailing list because nix(+os) seems like a very promising
 project.
 
 But the wiki is quite filled with spam pages, which I find off-putting as a
 newcomer, as it's usually a sign of a stagnating community.
 
 Now, I could just start manually deleting pages, but surely there is a better
 way. Most of the spam seems to originate from the same 5'ish accounts, so if
 there is a way to delete pages according to user that would seem the most
 effective measure.

Unfortunately all attempts to prevent wiki spam (registration, captchas, etc.)
have been in vain, and I don't really feel like spending an hour each week
deleting spam accounts/pages.  We may be better off switching to the GitHub 
wiki.

-- 
Eelco Dolstra | LogicBlox, Inc. | http://nixos.org/~eelco/
___
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev


Re: [Nix-dev] Wiki spam

2013-09-30 Thread phreedom
On Monday, September 30, 2013 11:08:00 AM Eelco Dolstra wrote:
 Hi,
 
 On 30/09/13 06:53, Philip Carlsen wrote:
  I just joined this mailing list because nix(+os) seems like a very
  promising project.
  
  But the wiki is quite filled with spam pages, which I find off-putting as
  a
  newcomer, as it's usually a sign of a stagnating community.
  
  Now, I could just start manually deleting pages, but surely there is a
  better way. Most of the spam seems to originate from the same 5'ish
  accounts, so if there is a way to delete pages according to user that
  would seem the most effective measure.
 
 Unfortunately all attempts to prevent wiki spam (registration, captchas,
 etc.) have been in vain, and I don't really feel like spending an hour each
 week deleting spam accounts/pages.  We may be better off switching to the
 GitHub wiki.

and it also means you can edit wiki using sane tools like text editor and git
___
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev


Re: [Nix-dev] Wiki spam

2013-09-30 Thread Marc Weber
Eelco: Have a look at vim-wiki.mawercer.de I have not yet had any spam.
and if it happens its as easy as git revert. There is *no* password
protection at all. You just have to use special ?get parameter.
I know the design is not that pretty, its just a hack so that I don't
have to type and talk about the same topics again. Maybe its also much
less popular than nixos.org/wiki ...

I seriously think that bots (or humans?) are specialized on media wiki.
I think the real fix is using non standard wikis or doing anything
non-standard, such as protecting login page by htaccess and dummy password.
Something simple such as 'nixonixo' would probably be enough and more
effective than captchas. If irc knowns about it ..

Me too thinks that its bad to host spam. Additional alternatives such as
github wiki has been discussed. And they are options, too.
You can also access those git repositories easily.

The community is strong enough to handle this. We just have to make a
list of features we want and we must get started.

Some additional discussions took place on this mailinglist earlier.
How much spam does exist? Click the random page 10 times, then you'll
know.

click1: Multi-level Marketing Two Schools Of Believed Collide
click2: Women's Manner Trend Principles To Stay By
click3: Ageing Tips That Can Help You In The Process
click4: Meet The Equipment Shop Manager
click5: Picking out Straightforward Advice For beauty salon insurance

I'm stopping here, you get it, all 5 pages are spam.

I'm protecting against arbitrary spammers by adding a 1x1 px input spam
bots are likely to fill, but humans miss. Thus if such a field is non
empty I can disregard the request.

The wiki is one of the weakest parts of this project.
People spend their time on the libraries, packages and the core instead
mostly.

Marc Weber
___
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev


Re: [Nix-dev] Wiki spam

2013-09-30 Thread Domen Kožar
You don't have spam because nixos.org has pagerank 5, which makes it
attractive to spammers in a way that humans are paid to enter spam links.

Your site has pagerank of 0.


On Mon, Sep 30, 2013 at 3:12 PM, Marc Weber marco-owe...@gmx.de wrote:

 Eelco: Have a look at vim-wiki.mawercer.de I have not yet had any spam.
 and if it happens its as easy as git revert. There is *no* password
 protection at all. You just have to use special ?get parameter.
 I know the design is not that pretty, its just a hack so that I don't
 have to type and talk about the same topics again. Maybe its also much
 less popular than nixos.org/wiki ...

 I seriously think that bots (or humans?) are specialized on media wiki.
 I think the real fix is using non standard wikis or doing anything
 non-standard, such as protecting login page by htaccess and dummy password.
 Something simple such as 'nixonixo' would probably be enough and more
 effective than captchas. If irc knowns about it ..

 Me too thinks that its bad to host spam. Additional alternatives such as
 github wiki has been discussed. And they are options, too.
 You can also access those git repositories easily.

 The community is strong enough to handle this. We just have to make a
 list of features we want and we must get started.

 Some additional discussions took place on this mailinglist earlier.
 How much spam does exist? Click the random page 10 times, then you'll
 know.

 click1: Multi-level Marketing Two Schools Of Believed Collide
 click2: Women's Manner Trend Principles To Stay By
 click3: Ageing Tips That Can Help You In The Process
 click4: Meet The Equipment Shop Manager
 click5: Picking out Straightforward Advice For beauty salon insurance

 I'm stopping here, you get it, all 5 pages are spam.

 I'm protecting against arbitrary spammers by adding a 1x1 px input spam
 bots are likely to fill, but humans miss. Thus if such a field is non
 empty I can disregard the request.

 The wiki is one of the weakest parts of this project.
 People spend their time on the libraries, packages and the core instead
 mostly.

 Marc Weber
 ___
 nix-dev mailing list
 nix-dev@lists.science.uu.nl
 http://lists.science.uu.nl/mailman/listinfo/nix-dev

___
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev


Re: [Nix-dev] Wiki spam

2013-09-30 Thread Philip Lykke Carlsen
On Monday 30 September 2013 15:18:19 Domen Kožar wrote:
 You don't have spam because nixos.org has pagerank 5, which makes it
 attractive to spammers in a way that humans are paid to enter spam links.
 
 Your site has pagerank of 0.
 
 On Mon, Sep 30, 2013 at 3:12 PM, Marc Weber marco-owe...@gmx.de wrote:
  Eelco: Have a look at vim-wiki.mawercer.de I have not yet had any spam.
  and if it happens its as easy as git revert. There is *no* password
  protection at all. You just have to use special ?get parameter.
  I know the design is not that pretty, its just a hack so that I don't
  have to type and talk about the same topics again. Maybe its also much
  less popular than nixos.org/wiki ...
  
  I seriously think that bots (or humans?) are specialized on media wiki.
  I think the real fix is using non standard wikis or doing anything
  non-standard, such as protecting login page by htaccess and dummy
  password.
  Something simple such as 'nixonixo' would probably be enough and more
  effective than captchas. If irc knowns about it ..
  
  Me too thinks that its bad to host spam. Additional alternatives such as
  github wiki has been discussed. And they are options, too.
  You can also access those git repositories easily.
  
  The community is strong enough to handle this. We just have to make a
  list of features we want and we must get started.
  
  Some additional discussions took place on this mailinglist earlier.
  How much spam does exist? Click the random page 10 times, then you'll
  know.
  
  click1: Multi-level Marketing Two Schools Of Believed Collide
  click2: Women's Manner Trend Principles To Stay By
  click3: Ageing Tips That Can Help You In The Process
  click4: Meet The Equipment Shop Manager
  click5: Picking out Straightforward Advice For beauty salon insurance
  
  I'm stopping here, you get it, all 5 pages are spam.
  
  I'm protecting against arbitrary spammers by adding a 1x1 px input spam
  bots are likely to fill, but humans miss. Thus if such a field is non
  empty I can disregard the request.
  
  The wiki is one of the weakest parts of this project.
  People spend their time on the libraries, packages and the core instead
  mostly.
  
  Marc Weber
  ___
  nix-dev mailing list
  nix-dev@lists.science.uu.nl
  http://lists.science.uu.nl/mailman/listinfo/nix-dev

If it was common practise for wikis to require user registrations lay a money 
deposit as security for constructive behaviour the problem would go away 
overnight.
But it isn't, and implementing it would probably be a rather non-trivial task 
and also raise the bar to entry (due to inconvenience) for new good members 
too.

If there is an effective interface (available to users) I don't mind spending 
some time tonight deleting spam..

-plc
___
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev


Re: [Nix-dev] Wiki spam

2013-09-30 Thread Marc Weber
Excerpts from Philip Lykke Carlsen's message of Mon Sep 30 20:38:09 +0200 2013:
 If it was common practise for wikis to require user registrations lay a money 
 deposit as security for constructive behaviour the problem would go away 
 overnight.
Sure. but can't we try a less intrusive non standard way first? Such as:

if create user page / edit page is requested test for our own cookie.

  If cookie does not exist:
redirect to custom password protection page, if simple password gets
typed right, set cookie, redirect to previous page (also passing GET/POST
vars)

  If cookie does exists:
behave normally?

then at least we'll know whether we get spammed by bots or humans.
Humans will suceed. Bots should fail, because they were programmed to
spawn standarrd media wikis only.

Such an implementation is  about 20-30 PHP lines or less which can be
added to index.php ? Just tell me do it and I'll provide that code.
We have somebody who is willing to delete the spam, what else do we
need to win this situation?

Marc Weber
___
nix-dev mailing list
nix-dev@lists.science.uu.nl
http://lists.science.uu.nl/mailman/listinfo/nix-dev