----- Original Message -----
From: "Steve [Spamassasin]" <[EMAIL PROTECTED]>
Jorge Valdes wrote:
There are multiple images in these gifs, and because the first image
is 'junk', sending this image through gocr will yield no results. The
problem is that you have to scan all images to find the text. Try
this with each image:
convert -append News.gif pnm:- | gocr -
That works a treat...
I have an updated version of the FuzzyOcr plugin that has this and
other improvements available here:
http://www.joval.info/proj/FuzzyOcr.html
Version 2.3j works much better... I'd previously been using version
2.3b for which I had an ebuild for gentoo.
One thing I have noticed, however, is a number of errors/warnings which
spamd sticks into /var/log/messages when it is started:
--
Sep 26 17:20:48 server spamd[25563]: Subroutine new redefined at
/etc/mail/spamassassin/FuzzyOcr.pm line 122.
Sep 26 17:20:48 server spamd[25563]: Subroutine parse_config redefined
at /etc/mail/spamassassin/FuzzyOcr.pm line 132.
Sep 26 17:20:49 server spamd[25563]: Subroutine finish_parsing_end
redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 184.
Sep 26 17:20:49 server spamd[25563]: Subroutine dummy_check redefined at
/etc/mail/spamassassin/FuzzyOcr.pm line 288.
Sep 26 17:20:49 server spamd[25563]: Subroutine load_global_words
redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 292.
Sep 26 17:20:49 server spamd[25563]: Subroutine load_personal_words
redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 315.
Sep 26 17:20:49 server spamd[25563]: Subroutine max redefined at
/etc/mail/spamassassin/FuzzyOcr.pm line 343.
Sep 26 17:20:49 server spamd[25563]: Subroutine within_threshold
redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 351.
Sep 26 17:20:49 server spamd[25563]: Subroutine fmt_time redefined at
/etc/mail/spamassassin/FuzzyOcr.pm line 388.
Sep 26 17:20:49 server spamd[25563]: Subroutine check_image_hash_db
redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 414.
Sep 26 17:20:49 server spamd[25563]: Subroutine add_image_hash_db
redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 492.
Sep 26 17:20:49 server spamd[25563]: Subroutine calc_image_hash
redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 539.
Sep 26 17:20:49 server spamd[25563]: Subroutine debuglog redefined at
/etc/mail/spamassassin/FuzzyOcr.pm line 580.
Sep 26 17:20:49 server spamd[25563]: Subroutine wrong_ctype redefined at
/etc/mail/spamassassin/FuzzyOcr.pm line 590.
Sep 26 17:20:49 server spamd[25563]: Subroutine corrupt_img redefined at
/etc/mail/spamassassin/FuzzyOcr.pm line 608.
Sep 26 17:20:49 server spamd[25563]: Subroutine known_img_hash redefined
at /etc/mail/spamassassin/FuzzyOcr.pm line 626.
Sep 26 17:20:49 server spamd[25563]: Subroutine removedir redefined at
/etc/mail/spamassassin/FuzzyOcr.pm line 637.
Sep 26 17:20:49 server spamd[25563]: Subroutine fuzzyocr_check redefined
at /etc/mail/spamassassin/FuzzyOcr.pm line 657.
--
Have I somehow loaded this module twice? I didn't get these messages
until I upgraded to version 2.3j from 2.3b....
No problem here, these are just informational messages that only recently
showed up for me with the more recent versions of the FuzzyOcr plugin, as
well. However, with the two latest versions, it only gets written to the
log once during start-up, not with each image file that gets scanned like I
was seeing a few versions back.
Bill