The process of running the conversion script is an excellent opportunity to automatically catch some spam that has crept in.
There is no doubt that we have missed some vandalism cases. We are only a few humans trying to manually catch it. Also remember the problem with the diff notification that only runs every hour and we only get the most recent change. Is it possible to generate a list of vandalised pages? For example one pattern is "emmss.com". On the other hand, we could probably run some 'find | grep' commands on the server-side after the conversion. --David
