On 12/11/19 8:00 PM, Mauricio Tavares wrote: > On Wed, Dec 11, 2019 at 1:58 PM Giovanni Bechis <giova...@paclan.it> wrote: >> >> On 12/11/19 3:17 PM, Bill Cole wrote: >>> On 11 Dec 2019, at 2:39, Giovanni Bechis wrote: >>> >>>> On 12/11/19 6:21 AM, KADAM, SIDDHESH wrote: >>>>> Hi PFA... >>>>> >>>>> On 12/11/2019 12:36 AM, Giovanni Bechis wrote: >>>>>> On 12/10/19 7:49 PM, Michael Storz wrote: >>>>>> [...] >>>>>>> My copy hit >>>>>>> >>>>>>> BODY_SINGLE_WORD=1.347, HTML_IMAGE_ONLY_04=1.172, MPART_ALT_DIFF=0.79 >>>>>>> >>>>>>> not enough to mark it as spammy. >>>>> >>>> FuzzyOcr + bayes is killing this kind of emails for me: >>> >>> FuzzyOcr is unmaintained and doesn't even have an authoritative repository >>> as far as I can tell. It is computationally very expensive, to the degree >>> that it isn't safe to just add it to an existing mail system which does not >>> have a lot of idle CPU and memory capacity. >>> >> it's true that it's unmaintained but I have it running on Perl 5.28 with >> some patches and it's still useful every now and then (if you have some >> spare cpu cycles and you know what you are doing). >> A new ocr plugin could be definetely a better choice. >> Giovanni > > I asked the project owner if I could put fuzzyocr on github. He said > go for it, so it is now at https://github.com/raubvogel/FuzzyOcr. > Cool, you can grab my patches (if they are needed) here: http://cvsweb.openbsd.org/ports/mail/p5-FuzzyOcr/patches/
Giovanni