[Bug 40672] Abuse filter: Increase 5% limit to allow filtering for very short posts

2012-11-05 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=40672

Matthias Mullie mmul...@wikimedia.org changed:

   What|Removed |Added

 Status|NEW |RESOLVED
 Resolution||FIXED

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 40672] Abuse filter: Increase 5% limit to allow filtering for very short posts

2012-10-23 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=40672

--- Comment #3 from Matthias Mullie mmul...@wikimedia.org 2012-10-23 13:27:08 
UTC ---
I suggest to make this configurable per Filter group.

It makes sense to treat different kinds of text (e.g. articles vs feedback)
differently.

I've pushed a couple of patches:

* https://gerrit.wikimedia.org/r/#/c/29570/ AbuseFilter change: make it
possible for other extensions to define new emergency shutdown values per
filter group
* https://gerrit.wikimedia.org/r/#/c/29569/ ArticleFeedback change: set
different values for AFT then the current AbuseFilter defaults (which are more
conservative)
* https://gerrit.wikimedia.org/r/#/c/29571/ Config change: update WMF config to
use the above method

The emergency shutdown values for regular article submission would remain
unchanged, the values for feedback would become:
- 10% rather than 5%
- sample size from 25 to 50

How does that sound?

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 40672] Abuse filter: Increase 5% limit to allow filtering for very short posts

2012-10-23 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=40672

Fabrice Florin fflo...@wikimedia.org changed:

   What|Removed |Added

 CC||bs...@wikimedia.org,
   ||rkald...@wikimedia.org

--- Comment #4 from Fabrice Florin fflo...@wikimedia.org 2012-10-23 16:40:29 
UTC ---
Thanks, Matthias, this sounds great to me!

Andrew, do these revisions work for you as well?

If so, could you please review them and/or propose edits?

Nicely done!

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 40672] Abuse filter: Increase 5% limit to allow filtering for very short posts

2012-10-22 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=40672

Fabrice Florin fflo...@wikimedia.org changed:

   What|Removed |Added

 CC||agarr...@wikimedia.org,
   ||oke...@wikimedia.org
Summary|Abuse filter for very short |Abuse filter: Increase 5%
   |posts   |limit to allow filtering
   ||for very short posts

--- Comment #2 from Fabrice Florin fflo...@wikimedia.org 2012-10-22 21:29:13 
UTC ---
Hi Andre, 

I believe we will need to make a modification to the Abuse filter extension to
increase the cutoff for disabling articles to a higher value than 5% --
possibly up to 10%. 

Right now, filter 458 only disallows posts with 2 characters or less, because
it gets automatically disabled if we try 3, 4 or 5 characters. We really want
to disallow posts with 5 characters or less ASAP, and ultimately even 10
characters or less. One way to accomplish that is to increase the value for
disabling articles.

To quote extension creator Andrew Garrett: The AbuseFilter has a special
mechanism for new filters in which filters that match more than X% of the
actions that they are compared against are disabled. It is presumed that any
filter that matches more than X% of actions is out of control. The current
value of X is 5. In order to determine whether a filter matching more than X%
of actions is actually out of control or just unlucky, we need a decent sample
size. So the minimum sample size is Y, the variable that we changed from 2 to
25.

The goal would be to have a higher cut-off for feedback than for edits -- so we
don't disrupt the current cutoff used for edits, only increase the cutoff for
feedback posts …

We are now waiting for Andrew Garrett and Matthias Mullie to offer a
recommendation on that point, as well as assess the complexity of this proposed
revision. 

If we're only talking about a couple hours of development, I think we should do
it, so we don't have to keep resetting the filters manually. I suspect that we
will need a higher limit anyway before we can deploy AFT5 to 100%.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l