will it not be much faster just to make a md5 sum on the image file without
thinking if it a appel or orange ? :-)
Yes, but just taking a straight sum will be sensitive to all of those small pixels which are changed by the spammers so that they have different sums, but the differences aren't visible to the human eye.

That's my point. If you drop out the lower bits of the colors, then you mostly retain what is perceptible (in color ranges) to the human, while losing those parts that a) the human wouldn't have noticed anyway, and b) throw off your sum of the image for comparison to known spam images.
Hi

You're idea is kinda interesting, but what would you do about a pic with white background, black font and some random black noise on it ?


Matt

Reply via email to