Steve,
I recommend to disable all image processing in the OCR plugin (only use
the PDF-text parts). How ever, if you want to use it, you have to
controll, that imagemagik and tesseract are working well. You have to
learn tesseract to 'OCR' - the default definition files are not enough.
Keep in mind, that imagemagik will convert all images to an uncompressed
format. So a 2MB image could get a size of 30MB. Running tesseract on such
a large file, will need 100% CPU (per thread) for a long long time.
Because tesseact is called at system level, we don't have any controll
about it - this could lead in a complete stucking system, if the number of
cores is equal to the number of processed images at a time.
What I want to say is, any part of assp and the plugins could be easy
used, except the OCR plugin. Using the OCR needs an extravagant manual
expense in system design and configuring (learning) tesseract - I expect
at least one week or even more. I gave up on tesseract after a week of
work. It was processing well on standard fonts - but spammers are smart,
they never uses standard fonts and they change them too often. It is much
more easy to detect spammers with the other assp filters.
Thomas
Von: Steve Moffat <[email protected]>
An: ASSP development mailing list <[email protected]>
Datum: 25.01.2011 15:00
Betreff: Re: [Assp-test] Multiple messages sent
Ok, I seem to be able to confirm this. If I send a plain email, it gets
sent once. If I attach or embed a picture then the mail is resent
constantly.....assp_ocr is disabled and only one email gets sent.....
-----Original Message-----
From: Steve Moffat [mailto:[email protected]]
Sent: Tuesday, January 25, 2011 9:35 AM
To: '[email protected]'
Subject: [Assp-test] Multiple messages sent
OK, I'm still trying to hunt down why I am sending multiple messages per
email. I may be wrong, but it's looking like an imagemagic issue going by
the Exchange 2010 smtp log. Anyone seen this before? I have many of these
entries, and they all end at the imagemagik decoding....
Again, this is the Exchange 2010 smtp log...
2011-01-25T13:03:13.937Z,ASSP,08CD89649FC93061,15,172.16.16.5:22364,172.16.16.10:25,-,,Local
2011-01-25T13:04:15.003Z,ASSP,08CD89649FC93064,0,,172.16.16.10:25,*,,attempting
to connect
2011-01-25T13:04:15.011Z,ASSP,08CD89649FC93064,1,172.16.16.5:22396,172.16.16.10:25,+,,
2011-01-25T13:04:15.104Z,ASSP,08CD89649FC93064,2,172.16.16.5:22396,172.16.16.10:25,<,"220
mg1.optimum.bm ESMTP MailEnable Service, Version: 5.03-- ready at 01/25/11
09:04:11",
2011-01-25T13:04:15.105Z,ASSP,08CD89649FC93064,3,172.16.16.5:22396,172.16.16.10:25,>,EHLO
mail.optimum.bm,
2011-01-25T13:04:15.282Z,ASSP,08CD89649FC93064,4,172.16.16.5:22396,172.16.16.10:25,<,"250-optimum.bm
[172.16.16.10], this server offers 2 extensions",
2011-01-25T13:04:15.283Z,ASSP,08CD89649FC93064,5,172.16.16.5:22396,172.16.16.10:25,<,250-SIZE
5120000,
2011-01-25T13:04:15.283Z,ASSP,08CD89649FC93064,6,172.16.16.5:22396,172.16.16.10:25,<,250
HELP,
2011-01-25T13:04:15.284Z,ASSP,08CD89649FC93064,7,172.16.16.5:22396,172.16.16.10:25,*,920,sending
message
2011-01-25T13:04:15.284Z,ASSP,08CD89649FC93064,8,172.16.16.5:22396,172.16.16.10:25,>,MAIL
FROM:<[email protected]> SIZE=116346,
2011-01-25T13:04:15.479Z,ASSP,08CD89649FC93064,9,172.16.16.5:22396,172.16.16.10:25,<,"250
Requested mail action okay, completed",
2011-01-25T13:04:15.479Z,ASSP,08CD89649FC93064,10,172.16.16.5:22396,172.16.16.10:25,>,RCPT
TO:<[email protected]>,
2011-01-25T13:04:15.655Z,ASSP,08CD89649FC93064,11,172.16.16.5:22396,172.16.16.10:25,<,"250
Requested mail action okay, completed",
2011-01-25T13:04:15.660Z,ASSP,08CD89649FC93064,12,172.16.16.5:22396,172.16.16.10:25,>,DATA,
2011-01-25T13:04:15.848Z,ASSP,08CD89649FC93064,13,172.16.16.5:22396,172.16.16.10:25,<,354
Start mail input; end with <CRLF>.<CRLF>,
2011-01-25T13:04:16.537Z,ASSP,08CD89649FC93064,14,172.16.16.5:22396,172.16.16.10:25,<,Magick:
no decode delegate for this image format
`c:/assp/tmp/ocr_pl12959606534541image001.png' @
error/constitute.c/ReadImage/532.,
2011-01-25T13:04:16.540Z,ASSP,08CD89649FC93064,15,172.16.16.5:22396,172.16.16.10:25,-,,Local
Thanks
Steve
Steve Moffat
Operations Director
Optimum IT Solutions
Desk: 441 292 8849
Mobile: 441 292 8849
MSN IM: [email protected]
Web: http://www.optimum.bm
------------------------------------------------------------------------------
Special Offer-- Download ArcSight Logger for FREE (a $49 USD value)!
Finally, a world-class log management solution at an even better
price-free!
Download using promo code Free_Logger_4_Dev2Dev. Offer expires February
28th, so secure your free ArcSight Logger TODAY!
http://p.sf.net/sfu/arcsight-sfd2d
_______________________________________________
Assp-test mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/assp-test
------------------------------------------------------------------------------
Special Offer-- Download ArcSight Logger for FREE (a $49 USD value)!
Finally, a world-class log management solution at an even better
price-free!
Download using promo code Free_Logger_4_Dev2Dev. Offer expires
February 28th, so secure your free ArcSight Logger TODAY!
http://p.sf.net/sfu/arcsight-sfd2d
_______________________________________________
Assp-test mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/assp-test
DISCLAIMER:
*******************************************************
This email and any files transmitted with it may be confidential, legally
privileged and protected in law and are intended solely for the use of the
individual to whom it is addressed.
This email was multiple times scanned for viruses. There should be no
known virus in this email!
*******************************************************
------------------------------------------------------------------------------
Special Offer-- Download ArcSight Logger for FREE (a $49 USD value)!
Finally, a world-class log management solution at an even better price-free!
Download using promo code Free_Logger_4_Dev2Dev. Offer expires
February 28th, so secure your free ArcSight Logger TODAY!
http://p.sf.net/sfu/arcsight-sfd2d
_______________________________________________
Assp-test mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/assp-test