This is SO incredibly simple.  I missed the feature for years!  So, in an
effort to help others:

If you want to check text in PDF and TXT attachments simply install the OCR
plugin.  You do can turn the actual OCR part of the plugin off and just to
PDF and TXT extraction.  This won't convert images to text, but that's
awfully processor intensive (which is why I shied away from the whole
plugin for years).



On Thu, Mar 15, 2018 at 10:37 AM, K Post <nntp.p...@gmail.com> wrote:

> Terrific.  Thank you.
>
> On Thu, Mar 15, 2018 at 2:25 AM, Thomas Eckardt <
> thomas.ecka...@thockar.com> wrote:
>
>> Do not use the image OCR processing! It consumes too many resources and
>> time. Only use the PDF text extraction.
>> The extracted text is checked against :
>>
>> HMM
>> Bayesian
>> blackRe
>> bombRe
>> ClamAV
>> FileScan
>> scriptRe
>> URIBL
>>
>>
>> process whitelisted mails
>> procWhiteASSP_OCR:=0
>> Whitelisted mails will be processed by this Plugin!
>>
>> extract text from text files
>> DoSimpleTextASSP_OCR:=0
>> The text components of attached text/html or similar files will be
>> extracted!
>>
>> extract text from pdf files
>> DoPDFTextASSP_OCR:=1
>> The text components of attached pdf files will be extracted!
>>
>> extract text from images inside pdf files
>> DoPDFImageASSP_OCR:=0
>> The text components of images inside of attached pdf files will be
>> extracted!
>>
>> extract text from attached image files
>> DoImageASSP_OCR:=0
>> The text components of attached images be extracted!
>>
>> Before installing the plugin check the following OS calls - and solve
>> possible missing OS components and perl modules.
>>
>> perl -e "use File::Which 'which';"
>> perl -e "use File::Spec;"
>> perl -e "use Thread::Semaphore;"
>> perl -e "use PDF::OCR2;"
>>
>>
>>
>>
>> Thomas
>>
>>
>>
>>
>> Von:        "K Post" <nntp.p...@gmail.com>
>> An:        "ASSP development mailing list" <assp-test@lists.sourceforge.n
>> et>
>> Datum:        15.03.2018 01:12
>> Betreff:        Re: [Assp-test] Blocking specific content in PDF
>> ------------------------------
>>
>>
>>
>> Ah, I knew there had to be a way.  Is it stable on Windows?  I remember
>> from YEARS ago, many of them, all sorts of imagemagic problems on win32.
>>
>> On Wed, Mar 14, 2018 at 4:48 PM, Thomas Eckardt <
>> *thomas.ecka...@thockar.com* <thomas.ecka...@thockar.com>> wrote:
>> ASSP_OCR can do this.
>>
>> Thomas
>>
>>
>>
>>
>>
>> Von:        "K Post" <*nntp.p...@gmail.com* <nntp.p...@gmail.com>>
>> An:        "ASSP development mailing list" <
>> *assp-test@lists.sourceforge.net* <assp-test@lists.sourceforge.net>>
>> Datum:        14.03.2018 20:15
>> Betreff:        [Assp-test] Blocking specific content in PDF
>> ------------------------------
>>
>>
>>
>>
>> We had a bunch of pdf's with links in them slip through ASSP today.
>> There's not much content, in the pdf, but some of it is consistent.  The
>> body of the emails pass hmm/bayes and varies from message to message.
>>
>> Is there a way to scan incoming PDF files for specific words and remove
>> the attachment or score if there's a hit?
>>
>> ------------------------------------------------------------
>> ------------------
>> Check out the vibrant tech community on one of the world's most
>> engaging tech sites, Slashdot.org! *http://sdm.link/slashdot*
>> <http://sdm.link/slashdot>
>> _______________________________________________
>> Assp-test mailing list
>> *Assp-test@lists.sourceforge.net* <Assp-test@lists.sourceforge.net>
>> *https://lists.sourceforge.net/lists/listinfo/assp-test*
>> <https://lists.sourceforge.net/lists/listinfo/assp-test>
>>
>>
>>
>>
>>
>> DISCLAIMER:
>> *******************************************************
>> This email and any files transmitted with it may be confidential, legally
>> privileged and protected in law and are intended solely for the use of the
>> individual to whom it is addressed.
>> This email was multiple times scanned for viruses. There should be no
>> known virus in this email!
>> *******************************************************
>>
>>
>> ------------------------------------------------------------
>> ------------------
>> Check out the vibrant tech community on one of the world's most
>> engaging tech sites, Slashdot.org! *http://sdm.link/slashdot*
>> <http://sdm.link/slashdot>
>> _______________________________________________
>> Assp-test mailing list
>> *Assp-test@lists.sourceforge.net* <Assp-test@lists.sourceforge.net>
>> *https://lists.sourceforge.net/lists/listinfo/assp-test*
>> <https://lists.sourceforge.net/lists/listinfo/assp-test>
>>
>>
>> ------------------------------------------------------------
>> ------------------
>> Check out the vibrant tech community on one of the world's most
>> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
>> _______________________________________________
>> Assp-test mailing list
>> Assp-test@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/assp-test
>>
>>
>>
>>
>>
>> DISCLAIMER:
>> *******************************************************
>> This email and any files transmitted with it may be confidential, legally
>> privileged and protected in law and are intended solely for the use of the
>> individual to whom it is addressed.
>> This email was multiple times scanned for viruses. There should be no
>> known virus in this email!
>> *******************************************************
>>
>>
>> ------------------------------------------------------------
>> ------------------
>> Check out the vibrant tech community on one of the world's most
>> engaging tech sites, Slashdot.org! http://sdm.link/slashdot
>> _______________________________________________
>> Assp-test mailing list
>> Assp-test@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/assp-test
>>
>>
>
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Assp-test mailing list
Assp-test@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/assp-test

Reply via email to