Justin Forder wrote:

There are now 744 spammed pages out of 1835. The oldest is 10th January (it's possible that there was spam before that which has now been removed). The spam is still coming, but slowly now.

I have cleaned those now. I used Watir to visit the Edit screen for every page, check the content for spam, and remove it if present.

Before doing this for real, I downloaded the current markup from hundreds of spammed pages to test the detection/correction on. I tested the interaction with the Wiki in the sandbox first, then on a small number of pages, then gradually increasing numbers. Using Watir allowed me to watch the changes being made. By 11:45 GMT today all 1px-high divs and !OK! flags had been removed.

I was driving this from the All Pages list, and it took an hour and a half (including small runs at first, and manual checking). A lot of the time was spent checking pages that contained no spam. In future it will be possible to run from the Recently Revised list, just covering the time since the previous run - this should only need a few minutes per day.

There are a few pages that are uneditable, e.g.

  http://wiki.rubyonrails.com/rails/pages/MacOSX%3Ca+href%3D

...but they will be correspondingly unspammable.

regards

  Justin

_______________________________________________
Rails-core mailing list
Rails-core@lists.rubyonrails.org
http://lists.rubyonrails.org/mailman/listinfo/rails-core

Reply via email to