Thiago LPS wrote:


On 11/17/06, *Sietse van Zanen* <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>> wrote:

    To be more exact, the procedure would be:

    1.       Save the image file, and the message

    2.       Calculate the hash and delete it from the bad hash db
    with the fuzzy-find.pl script

    3.


In the body of mail marked as spam , i have the hash value...
so.. i removed this hash from hashdb...
it was happen because i didnt yet apply the Patch to only include in hasb db pictures matched as pic-spam.. after removed the hash and applied the patch... the picture wasn't include in the hasb db anymore..

but.. the question is: even with patch applied if some good-picture be included in the hashdb nothing better than a white-hashdb to solve it.. :D
im not expert with perl.. but it doesnt sounds dificult to do.. :D
I'm not sure if I understand you correctly, but FuzzyOcr 3.x has already a whitelist hashdb :)


And for all the others, I just checked in revision 40, which contains a modified "fuzzy-find" script, to be found at

http://fuzzyocr.own-hero.net/browser/trunk/devel/Utils/fuzzy-find

Please note that this is bleeding edge, if you want to try it out, go for it, but backup the database first in case something breaks...


The script now features --learn-spam, and --learn-ham which will manually add the hash of a given image file, i.e. fuzzy-find --learn-ham somepic.gif


Best regards,

Chris



    Create an empty wordlist, or fill it with some bogus words, that
    don't appear in the image

    4.       Update the FuzzyOcr.cf file to point to the new wordlist.
    If you're using spamd don't restart, it'll keep using the correct
    wordlist. Otherwise you might want to stop incoming mail for a
    little while.

    5.       Pipe the message through FuccyOcr.pm directly, it'll put
    the hash into the known good db.

    6.       Correct the config. (and restart maild).

    7.       Send in a feature request to update the fuzzy-find.pl
    script to insert hashes into a db. ;-)

    -Sietse

    *From:* Sietse van Zanen [mailto:[EMAIL PROTECTED]
    <mailto:[EMAIL PROTECTED]>]
    *Sent:* Friday, November 17, 2006 20:09
    *To:* Thiago LPS; users@spamassassin.apache.org
    <mailto:users@spamassassin.apache.org>
    *Subject:* RE: image exception with FuzzyOCR??

    Ofcourse, save the image, calculate the hash and then use the
    fuzzy-find.pl script to delete it from the bad hash db.

    Next you'll have to use a little trick to get it into the good
    hash db, as that's not possible from the fuzzy-find.pl script.

    Simply make an empty word list and yank the image through FuzzyOcr
    again. It'll put it into the known good db.

    -Sietse

    *From:* Thiago LPS [mailto:[EMAIL PROTECTED]
    <mailto:[EMAIL PROTECTED]>]
    *Sent:* Friday, November 17, 2006 18:25
    *To:* users@spamassassin.apache.org
    <mailto:users@spamassassin.apache.org>
    *Subject:* image exception with FuzzyOCR??



    Hello everybody...

    there is a way to do a exception to some image that isn't a
    SPAM... but the FuzzyOCR thinks that it is a spam image??

    i really dont want to disable the Hashdb...





--
--------------------------------------------------
Thiago LPS
C.E.S.A.R - Administrador de Sistemas
msn: [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>
0xx 81 8735 2591
--------------------------------------------------

Reply via email to