Justin Forder wrote:
A new signature from yesterday:
<u style="display:none;">
...
</u>
sometimes with no semicolon after 'none'
Another spammer, another signature:
<div style="display: none;">
from 82.200.208.98
with links to online pharmacies.
13 of those.
16 spammed pages from 85.202.154.230
I cleaned about 160 pages yesterday, mostly automatically, some by hand
in the course of investigating new sources, and some by hand because
they were hard to match reliably - specifically where the data pasted in
was truncated, i.e. started with a <div> but ended in mid-URL, with no
closing div and an incomplete <a> element. These break the displayed
page - the Edit, Back in time etc. links don't display (Firefox). Also
some that had the links in plain view, with no enclosing HTML element -
I haven't tried matching those automatically yet.
Counting URLs in the pages is working well - I shall extend this to
looking at the delta between versions, and only counting external URLs.
I want to start automatically rolling back, rather than editing out
offending content, but this becomes tricky if the previous version is
also spammed. A single level of rollback is not always enough.
There is a fight going on over the contents of the RealWorldUsage page.
It's up to version 406 now, and it's been through 20 or so versions in
the last couple of days. Someone keeps replacing the content with a
bunch of visible links to online pharmaceuticals, and others keep
rolling back to the proper content.
That's all for now.
Justin
_______________________________________________
Rails-core mailing list
Rails-core@lists.rubyonrails.org
http://lists.rubyonrails.org/mailman/listinfo/rails-core