Re: image exception with FuzzyOCR??

decoder Fri, 17 Nov 2006 14:17:19 -0800

Thiago LPS wrote:

On 11/17/06, *Sietse van Zanen* <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> wrote:
    To be more exact, the procedure would be:
    1.       Save the image file, and the message

    2.       Calculate the hash and delete it from the bad hash db
    with the fuzzy-find.pl script

    3.


In the body of mail marked as spam , i have the hash value...
so.. i removed this hash from hashdb...
it was happen because i didnt yet apply the Patch to only include inhasb db pictures matched as pic-spam..after removed the hash and applied the patch... the picture wasn'tinclude in the hasb db anymore..
but.. the question is: even with patch applied if some good-picture beincluded in the hashdb nothing better than a white-hashdb to solveit.. :D
im not expert with perl.. but it doesnt sounds dificult to do.. :D

I'm not sure if I understand you correctly, but FuzzyOcr 3.x has alreadya whitelist hashdb :)

And for all the others, I just checked in revision 40, which contains amodified "fuzzy-find" script, to be found at


http://fuzzyocr.own-hero.net/browser/trunk/devel/Utils/fuzzy-find

Please note that this is bleeding edge, if you want to try it out, gofor it, but backup the database first in case something breaks...

The script now features --learn-spam, and --learn-ham which willmanually add the hash of a given image file, i.e. fuzzy-find --learn-hamsomepic.gif



Best regards,

Chris

    Create an empty wordlist, or fill it with some bogus words, that
    don't appear in the image

    4.       Update the FuzzyOcr.cf file to point to the new wordlist.
    If you're using spamd don't restart, it'll keep using the correct
    wordlist. Otherwise you might want to stop incoming mail for a
    little while.

    5.       Pipe the message through FuccyOcr.pm directly, it'll put
    the hash into the known good db.

    6.       Correct the config. (and restart maild).

    7.       Send in a feature request to update the fuzzy-find.pl
    script to insert hashes into a db. ;-)

    -Sietse

    *From:* Sietse van Zanen [mailto:[EMAIL PROTECTED]
    <mailto:[EMAIL PROTECTED]>]
    *Sent:* Friday, November 17, 2006 20:09
    *To:* Thiago LPS; users@spamassassin.apache.org
    <mailto:users@spamassassin.apache.org>
    *Subject:* RE: image exception with FuzzyOCR??

    Ofcourse, save the image, calculate the hash and then use the
    fuzzy-find.pl script to delete it from the bad hash db.

    Next you'll have to use a little trick to get it into the good
    hash db, as that's not possible from the fuzzy-find.pl script.

    Simply make an empty word list and yank the image through FuzzyOcr
    again. It'll put it into the known good db.

    -Sietse

    *From:* Thiago LPS [mailto:[EMAIL PROTECTED]
    <mailto:[EMAIL PROTECTED]>]
    *Sent:* Friday, November 17, 2006 18:25
    *To:* users@spamassassin.apache.org
    <mailto:users@spamassassin.apache.org>
    *Subject:* image exception with FuzzyOCR??



    Hello everybody...

    there is a way to do a exception to some image that isn't a
    SPAM... but the FuzzyOCR thinks that it is a spam image??

    i really dont want to disable the Hashdb...





--
--------------------------------------------------
Thiago LPS
C.E.S.A.R - Administrador de Sistemas
msn: [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>
0xx 81 8735 2591

--------------------------------------------------

Re: image exception with FuzzyOCR??

Reply via email to