Indeed. The script eliminated some tens of thousands of spam pages among only ~400 actual content pages. It was not perfect (there were still a few pages that had spam on them), but it definitely worked amazingly and did not have any false positives that I am aware of.
*--* *Tyler Romeo* Stevens Institute of Technology, Class of 2015 Major in Computer Science www.whizkidztech.com | [email protected] On Sat, Oct 6, 2012 at 9:51 AM, Yury Katkov <[email protected]> wrote: > I think that it's possible to make the tool more generic. Can you put > the scripts on a repository? I'm giving a talk of fighting spam on > Semantic MediaWiki conference and would be glad to include your bot as > an example of solution of this common problem. > ----- > Yury Katkov > > > > On Sat, Oct 6, 2012 at 5:42 PM, John <[email protected]> wrote: > > His wiki is clean, Ive found that the scripts require tweaking for each > wiki > > > > On Sat, Oct 6, 2012 at 9:21 AM, Yury Katkov <[email protected]> > wrote: > >> Tyler, how are the results? John, can you upload it on some > >> repository? Google code, github? > >> > >> P.S. Sorry for that super-late response, I appreciate your effort to > help! > >> ----- > >> Yury Katkov > >> > >> > >> > >> On Fri, Aug 24, 2012 at 8:56 PM, Tyler Romeo <[email protected]> > wrote: > >>> I do! http://wiki.sittv.com has been building up spam for a number of > >>> months (or longer). > >>> > >>> *--* > >>> *Tyler Romeo* > >>> Stevens Institute of Technology, Class of 2015 > >>> Major in Computer Science > >>> www.whizkidztech.com | [email protected] > >>> > >>> > >>> > >>> On Fri, Aug 24, 2012 at 12:52 PM, John <[email protected]> > wrote: > >>> > >>>> Ive got a script but would like to test it before I make it public. If > >>>> someone has a site with spam and would let me test it, it would be > >>>> appreciated > >>>> > >>>> On Fri, Aug 24, 2012 at 12:20 PM, Derric Atzrott > >>>> <[email protected]> wrote: > >>>> >>Its rather easy to write in pywiki I just need some information from > >>>> >>you about your wiki. (IE are all edits after X date bad, we only > have > >>>> >>Y valid users and here are their names) exc stuff like that allows > me > >>>> >>to tailor the script to your needs. > >>>> >> > >>>> >>Can I get a link to your site? I would love to take a look and write > >>>> >>you that script, (I always love a challenge) > >>>> > > >>>> > If you make your script have some sort of configuration variables or > >>>> something > >>>> > along those lines for these different things, then you could > release it > >>>> and > >>>> > many people could be helped by it. > >>>> > > >>>> > If you do decide to release it. I would cross post to the mailing > list > >>>> for > >>>> > Mediawiki administrators as well. I'm sure someone on there could > use > >>>> it. > >>>> > > >>>> > Thank you, > >>>> > Derric Atzrott > >>>> > > >>>> > > >>>> > _______________________________________________ > >>>> > Wikitech-l mailing list > >>>> > [email protected] > >>>> > https://lists.wikimedia.org/mailman/listinfo/wikitech-l > >>>> > >>>> _______________________________________________ > >>>> Wikitech-l mailing list > >>>> [email protected] > >>>> https://lists.wikimedia.org/mailman/listinfo/wikitech-l > >>>> > >>> _______________________________________________ > >>> Wikitech-l mailing list > >>> [email protected] > >>> https://lists.wikimedia.org/mailman/listinfo/wikitech-l > _______________________________________________ Wikitech-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikitech-l
