You might try some of the rules they use for identifying email spam in  
SpamAssassin.

-Eric


On Jul 1, 2010, at 12:28 PM, Morten Wang wrote:

> On Thu, Jul 1, 2010 at 12:03 PM, S. Nunes <[email protected]> wrote:
>> I've been working on a vandalism detection tool for Wikipedia and I  
>> am
>> currently looking for a list of spam words.
>> Basically, I am looking for a list of terms typically associated with
>> vandalism or spam.
>> Is anybody aware of such resource?
>
> One place to look would be the ClueBot source, both for a list of
> words as well as some heuristics they use to battle certain typical
> vandalism cases: http://en.wikipedia.org/wiki/User:ClueBot/Source
>
>
>
> Cheers,
> Morten
>
> _______________________________________________
> Wiki-research-l mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l


_______________________________________________
Wiki-research-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l

Reply via email to