Re: Memory requirements

2006-08-09 Thread Daryl C. W. O'Shea
James Lay wrote: I have almost 500 megs of swap. And Postfix and SpamAssassin are the only things running on it. Thanks! Swap is pretty much useless for anything but programs you need running but rarely use. You can swap out a *tiny* bit of spamd, but that's about it if you don't want

Re: updates.spamassassin.org.cf overrides local.cf?

2006-08-09 Thread Daryl C. W. O'Shea
DAve wrote: Daryl C. W. O'Shea wrote: Updates are variable... they go in /var. Anywhere else wouldn't be following FHS. http://www.pathname.com/fhs/ Daryl The point is really moot. What files are in what directories doesn't really matter. It seems the idea is that anyone reading all

Re: Memory requirements

2006-08-09 Thread jdow
From: James Lay [EMAIL PROTECTED] On Mon, 7 Aug 2006 20:46:05 -0700 jdow [EMAIL PROTECTED] wrote: From: James Lay [EMAIL PROTECTED] Hey all! Anyone happen to know the memory requirements of SpamAssassin? I have 3.0.4 running on 128 Megs okwill upgrading to 3.1.4 plus the SARE

Poor gocr results on some pics?

2006-08-09 Thread Matthias Keller
Hi I have some troubles getting good results using gocr on some of the pics that came in. Strangely Chris from the FuzzyOCR Plugin was able to scan them correctly but we didn't find out why there's so much of a difference I'm using gocr-0.40-3 on SuSE 10.1 and netpbm-10.26.12-5.4 (for

Re: Memory requirements

2006-08-09 Thread Nigel Frankcom
On Wed, 9 Aug 2006 00:52:58 -0700, jdow [EMAIL PROTECTED] wrote: From: James Lay [EMAIL PROTECTED] On Mon, 7 Aug 2006 20:46:05 -0700 jdow [EMAIL PROTECTED] wrote: From: James Lay [EMAIL PROTECTED] Hey all! Anyone happen to know the memory requirements of SpamAssassin? I have

Re: new version, please test

2006-08-09 Thread Matthias Keller
decoder wrote: decoder wrote: decoder wrote: Hello there, I have improved the original OcrPlugin (found at http://wiki.apache.org/spamassassin/OcrPlugin), so it contains fuzzy matching. Like that, mistakes made by the OCR recognition or intentional obfuscations in the text don't make

Re: Memory requirements

2006-08-09 Thread jdow
From: Nigel Frankcom [EMAIL PROTECTED] On Wed, 9 Aug 2006 00:52:58 -0700, jdow [EMAIL PROTECTED] wrote: From: James Lay [EMAIL PROTECTED] On Mon, 7 Aug 2006 20:46:05 -0700 jdow [EMAIL PROTECTED] wrote: From: James Lay [EMAIL PROTECTED] Hey all! Anyone happen to know the memory

Re: Memory requirements

2006-08-09 Thread Daryl C. W. O'Shea
Nigel Frankcom wrote: The largest factor to take into consideration is how much mail SA will be dealing with. Running a single child will be limiting, if you are getting anything more than a few hundred mails per day that hardware will be insufficient. You will either hit long delays or mail

Re: Re: Memory requirements

2006-08-09 Thread Nigel Frankcom
On Wed, 9 Aug 2006 01:10:59 -0700, jdow [EMAIL PROTECTED] wrote: From: Nigel Frankcom [EMAIL PROTECTED] On Wed, 9 Aug 2006 00:52:58 -0700, jdow [EMAIL PROTECTED] wrote: From: James Lay [EMAIL PROTECTED] On Mon, 7 Aug 2006 20:46:05 -0700 jdow [EMAIL PROTECTED] wrote: From: James Lay [EMAIL

Using a ramdisk

2006-08-09 Thread Bjorn Jensen
Can spamassassin benefit in any way from a ramdisk ? The server we have for spamassassin, has 3 gigs of ram, and spamd doesn't even use 1 gig of that, so I thought perhaps it would speed things up if I could place something on a ramdisk. But this leads to the question, does spamassassin do any

Re: Re: Memory requirements

2006-08-09 Thread jdow
From: Nigel Frankcom [EMAIL PROTECTED] {^_-} Joanne, who has a bad habit if running numbers. And I note he might be able to run two instances to get SOME benefit from paralleling the DNS lookups. Point conceded :-D jdow That's DEADLY in political arguments when I bother to

Re: Using a ramdisk

2006-08-09 Thread jdow
From: Bjorn Jensen [EMAIL PROTECTED] Can spamassassin benefit in any way from a ramdisk ? The server we have for spamassassin, has 3 gigs of ram, and spamd doesn't even use 1 gig of that, so I thought perhaps it would speed things up if I could place something on a ramdisk. But this leads to

Re: Using a ramdisk

2006-08-09 Thread Daryl C. W. O'Shea
Bjorn Jensen wrote: Can spamassassin benefit in any way from a ramdisk ? The server we have for spamassassin, has 3 gigs of ram, and spamd doesn't even use 1 gig of that, so I thought perhaps it would speed things up if I could place something on a ramdisk. But this leads to the question,

Re: Poor gocr results on some pics?

2006-08-09 Thread Loren Wilton
Here's the pic in question as original gif (I joined the parts to make it easier for gocr): http://www.matthias-keller.ch/ocrmail.gif and converted to pnm: http://www.matthias-keller.ch/ocrmail.pnm And here's what gocr -i ocrmail.pnm spits out in my case:

Re: Poor gocr results on some pics?

2006-08-09 Thread decoder
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Loren Wilton wrote: Here's the pic in question as original gif (I joined the parts to make it easier for gocr): http://www.matthias-keller.ch/ocrmail.gif and converted to pnm: http://www.matthias-keller.ch/ocrmail.pnm And here's what gocr -i

Re: Using a ramdisk

2006-08-09 Thread Ramprasad
On Wed, 2006-08-09 at 10:27 +0200, Bjorn Jensen wrote: Can spamassassin benefit in any way from a ramdisk ? The server we have for spamassassin, has 3 gigs of ram, and spamd doesn't even use 1 gig of that, so I thought perhaps it would speed things up if I could place something on a ramdisk.

RE: Using a ramdisk

2006-08-09 Thread Xander
When using amavisd-new a ramdisk. Ralf Hildebrandt has setup a small page about it: http://www.stahl.bau.tu-bs.de/~hildeb/postfix/amavisd_tmpfs.shtml He says it gives some optimalization. Regards, Xander From: Bjorn Jensen [EMAIL PROTECTED] Can spamassassin benefit in any way from a

Re: Using a ramdisk

2006-08-09 Thread Bjorn Jensen
Ramprasad wrote: On Wed, 2006-08-09 at 10:27 +0200, Bjorn Jensen wrote: Can spamassassin benefit in any way from a ramdisk ? The server we have for spamassassin, has 3 gigs of ram, and spamd doesn't even use 1 gig of that, so I thought perhaps it would speed things up if I could place

Re: Using a ramdisk

2006-08-09 Thread Bjorn Jensen
Daryl C. W. O'Shea wrote: Bjorn Jensen wrote: Can spamassassin benefit in any way from a ramdisk ? The server we have for spamassassin, has 3 gigs of ram, and spamd doesn't even use 1 gig of that, so I thought perhaps it would speed things up if I could place something on a ramdisk. But this

RE: Memory requirements

2006-08-09 Thread Michel Vaillancourt
jdow's plugged nickel's worthBased on the bad case I ran his machine should do on the order of 10 to 30 seconds per email depending on the speed of his processor. At 30 seconds per that gives him the capacity, with delays to be sure, for 3000 emails per day. When they come in batched there

Re: HTML-tests good or bad?

2006-08-09 Thread Daryl C. W. O'Shea
jdow wrote: But of the British it takes a Harry Potter to spell correctly. (I know he's Canadian. But if carried too far British spelling smells, or in the past tense is fishy, smelt.) I know, like a sane person, Chris likes hockey (don't worry Chris, it starts real soon), but WTH did he

Re: Poor gocr results on some pics?

2006-08-09 Thread jdow
From: Loren Wilton [EMAIL PROTECTED] Here's the pic in question as original gif (I joined the parts to make it easier for gocr): http://www.matthias-keller.ch/ocrmail.gif and converted to pnm: http://www.matthias-keller.ch/ocrmail.pnm And here's what gocr -i ocrmail.pnm spits out in my

Re: Memory requirements

2006-08-09 Thread jdow
From: Michel Vaillancourt [EMAIL PROTECTED] jdow's plugged nickel's worthBased on the bad case I ran his machine should do on the order of 10 to 30 seconds per email depending on the speed of his processor. At 30 seconds per that gives him the capacity, with delays to be sure, for 3000 emails

Re: HTML-tests good or bad?

2006-08-09 Thread jdow
For some reason I thought he was. If he's a United Statesian then whainheck is he spelling British fashion? {o.o} - Original Message - From: Daryl C. W. O'Shea [EMAIL PROTECTED] jdow wrote: But of the British it takes a Harry Potter to spell correctly. (I know he's Canadian. But

Spam came from users in whitelist

2006-08-09 Thread Reginaldo Bray Mendoza
Good day. I have spamassassin working with MailScanner in a redhat linux machine. Recently, we are receiving SPAM that claims to be from some users that are on whitelist and, for that reason, spamassassin marks them as NOT SPAM (user in whitelist rule scores -100). What can we do to

Re: HTML-tests good or bad?

2006-08-09 Thread John D. Hardin
On Wed, 9 Aug 2006, jdow wrote: For some reason I thought he was. If he's a United Statesian then whainheck is he spelling British fashion? Maybe he's seeking a little class? -- John Hardin KA7OHZICQ#15735746http://www.impsec.org/~jhardin/ [EMAIL PROTECTED]FALaholic #11174

DEAR_SOMETHING rule scoring issue

2006-08-09 Thread Gregory T Pelle
What is the procedure to have a rule score reviewed? I have been looking over the scoring for version 3.1.x at http://spamassassin.apache.org/tests_3_1_x.html and think that a score of 1.6 is high for the DEAR_SOMETHING rule. I know that our customer support emails have the first line

outlook web access - valid emails getting marked

2006-08-09 Thread Terry
Hello, Our OWA server sits inside of our network. I just tried sending an email to a valid outside recipient. The email wasnt anything crazy but here are the tests that were ran:

Re: outlook web access - valid emails getting marked

2006-08-09 Thread Benny Pedersen
On Wed, August 9, 2006 16:14, Terry wrote: Our OWA server sits inside of our network. I just tried sending an email to a valid outside recipient. The email wasnt anything crazy but here are the tests that were ran:

Image spam with inline jpeg image

2006-08-09 Thread Ramprasad
All my rulesets and the LARGO rules are for catching inline png and inline gif. Now I am getting stock spams with images like --=_NextPart_001_000C_01C6BBE8.11C02650-- --=_NextPart_000_000B_01C6BBE8.11BB4450 Content-Type: image/jpeg; name=militarism.jpg Content-Transfer-Encoding:

Bayes and SQL and Vpopmail and /user + global bayes

2006-08-09 Thread Szeki - Inc
Hello, SA 3.1.4 exec /usr/bin/spamd -v -m 32 -D -q -u vpopmail -s stderr 21 I am using vpopmail installation, and use /user perfs, for /user bayes and other user conf is stored in SQL. Problem: If a mail comes in, and no real vpopmail user is present (smtproutes), than SA pick's a random real

Re: Image spam with inline jpeg image

2006-08-09 Thread Dhawal Doshy
Ramprasad wrote: All my rulesets and the LARGO rules are for catching inline png and inline gif. Now I am getting stock spams with images like --=_NextPart_001_000C_01C6BBE8.11C02650-- --=_NextPart_000_000B_01C6BBE8.11BB4450 Content-Type: image/jpeg; name=militarism.jpg

Re: Image spam with inline jpeg image

2006-08-09 Thread Richard
-BEGIN PGP SIGNED MESSAGE- Hash: RIPEMD160 Are you using the updated version OR the one originally posted? http://www.rulesemporium.com/plugins.htm#imageinfo can the rules_du_jour script be config'd to pickup plugin updates as well? i'd guess more than just an add to

Re: DEAR_SOMETHING rule scoring issue

2006-08-09 Thread Chris Lear
* Gregory T Pelle wrote (09/08/06 15:14): What is the procedure to have a rule score reviewed? I have been looking over the scoring for version 3.1.x at http://spamassassin.apache.org/tests_3_1_x.html and think that a score of 1.6 is high for the DEAR_SOMETHING rule. I know that

Re: updates.spamassassin.org.cf overrides local.cf?

2006-08-09 Thread Theo Van Dinter
On Wed, Aug 09, 2006 at 02:59:30AM -0400, Daryl C. W. O'Shea wrote: The point is really moot. What files are in what directories doesn't really matter. It seems the idea is that anyone reading all the documentation, and the wiki, should be able to discern what will go where, in what order,

RE: modifications done by Courier MTA confusing SpamAssassin?

2006-08-09 Thread Bowie Bailey
Wolfgang Jeltsch wrote: Am Dienstag, 8. August 2006 23:04 schrieb Bowie Bailey: Wolfgang Jeltsch wrote: I use Courier MTA. Courier MTA replaces certain mailformed mails with mails which contain some explaination and the original mail as an attachment. The attachment of the mail

Re: DEAR_SOMETHING rule scoring issue

2006-08-09 Thread Jim Maul
Gregory T Pelle wrote: What is the procedure to have a rule score reviewed? I have been looking over the scoring for version 3.1.x at http://spamassassin.apache.org/tests_3_1_x.html and think that a score of 1.6 is high for the DEAR_SOMETHING rule. I know that our customer support emails

Re: Image spam with inline jpeg image

2006-08-09 Thread Ramprasad
http://www.rulesemporium.com/plugins.htm#imageinfo Updates: - added optimization changes by Theo Van Dinter - added jpeg support - added function image_named() - added function image_size_exact() - added function image_size_range() - added function image_to_text_ratio() - dhawal

Re: Image spam with inline jpeg image

2006-08-09 Thread MennovB
Ramprasad wrote: But still this mail is getting thru http://ecm.netcore.co.in/tmp/imagespam.txt I tested your mail here with the latest imageinfo.pm and it comes through indeed. The exact same one in .gif (same text, same background) was detected though. It was even my first and only

Re: image spam detection idea

2006-08-09 Thread Todd Merritt
I had a similar, less expensive thought; Checking the global color table in the header of all of the gif images in a particular message. I tested a couple of spam cases and the GCTs are identical in all of my limited number of test cases. Logan Shaw wrote: Looks like people have started to

RE: Image spam with inline jpeg image

2006-08-09 Thread Gary Funck
Menno wrote: Ramprasad wrote: But still this mail is getting thru http://ecm.netcore.co.in/tmp/imagespam.txt I tested your mail here with the latest imageinfo.pm and it comes through indeed. The exact same one in .gif (same text, same background) was detected though. It was even my

Re: Image spam with inline jpeg image

2006-08-09 Thread Benny Pedersen
On Wed, August 9, 2006 16:39, Richard wrote: can the rules_du_jour script be config'd to pickup plugin updates as well? i'd guess more than just an add to TRUSTED_RULESETS everyone likes to have sa-update ruledujour now :-) rules_du_jour was done when sa-update did not exists -- Benny

Re: DEAR_SOMETHING rule scoring issue

2006-08-09 Thread Loren Wilton
I could be wrong on this as i am not much of a regex expert, but it doesnt appear that this rule will trigger on normal things like Dear Jim body DEAR_SOMETHING /\bDear (?:IT\W|Internet|candidate|sirs?|madam|investor|travell?er|car shopper|web)\b/i describe DEAR_SOMETHING

Re: Spam came from users in whitelist

2006-08-09 Thread Stuart Johnston
Reginaldo Bray Mendoza wrote: Good day. I have spamassassin working with MailScanner in a redhat linux machine. Recently, we are receiving SPAM that claims to be from some users that are on whitelist and, for that reason, spamassassin marks them as NOT SPAM (user in whitelist rule scores

Re: Spam came from users in whitelist

2006-08-09 Thread Ninja Dude
Reginaldo Bray Mendoza wrote: What can we do to avoid this??.. is there another way we can secure our whitelist users rather than use spamassassin's whitelist_from rule??? Use whitelist_from_rcvd or whitelist_from_spf instead of just whitelist_from. whitelist_from_rcvd takes both an address

image spam where is plugin directory on FC3 using SA3.1.3

2006-08-09 Thread Obantec Support
Hi i am reading the link http://www.rulesemporium.com/plugins.htm#imageinfo then the .pm file and do not have a plugins directory. where does the .pm file go? i assume the .cf goes in /etc/mail/spamassassin and i edit v310.pre then restart spamd Mark

Re: Image spam with inline jpeg image

2006-08-09 Thread Richard
-BEGIN PGP SIGNED MESSAGE- Hash: RIPEMD160 hi, can the rules_du_jour script be config'd to pickup plugin updates as well? i'd guess more than just an add to TRUSTED_RULESETS everyone likes to have sa-update ruledujour now :-) i'm sorry, i don't understand that sentence.

Re: Image spam with inline jpeg image

2006-08-09 Thread DAve
Richard wrote: -BEGIN PGP SIGNED MESSAGE- Hash: RIPEMD160 hi, can the rules_du_jour script be config'd to pickup plugin updates as well? i'd guess more than just an add to TRUSTED_RULESETS everyone likes to have sa-update ruledujour now :-) i'm sorry, i don't understand that

SA to Outlook built-in junk email filter

2006-08-09 Thread Matthew V
Hi there, Server: qmail vpopmail simscan-1.2 spamassassin-3 clamav Client: Win2k/XP with Office 2003 I've been trying to get Outlook 2003 to automatically deposit mail marked by spamassassin as spam into its junk email folder. What I'm looking for is a built-in junk filter rule for Outlook

Re: Image spam with inline jpeg image

2006-08-09 Thread Theo Van Dinter
On Wed, Aug 09, 2006 at 09:58:19AM -0700, Richard wrote: rules_du_jour was done when sa-update did not exists are you implying that sa-update replaces rules-du-jour? That depends on what you mean by replaces. i though sa-update updates the SA distro's bundled rules, but NOT any additional

Re: SA to Outlook built-in junk email filter

2006-08-09 Thread Jim Maul
Matthew V wrote: Hi there, Hello. Server: qmail vpopmail simscan-1.2 spamassassin-3 clamav good Client: Win2k/XP with Office 2003 not so good I've been trying to get Outlook 2003 to automatically deposit mail marked by spamassassin as spam into its junk email folder. What I'm

Re: Image spam with inline jpeg image

2006-08-09 Thread Spamassassin List
- Original Message - From: Gary Funck [EMAIL PROTECTED] To: users@spamassassin.apache.org Sent: Thursday, August 10, 2006 12:04 AM Subject: RE: Image spam with inline jpeg image Menno wrote: Ramprasad wrote: But still this mail is getting thru

ImageInfo plugin update via SARE/RulesDuJour ?

2006-08-09 Thread Richard
-BEGIN PGP SIGNED MESSAGE- Hash: RIPEMD160 (just pulling this out of the other thread ... my bad) Are you using the updated version OR the one originally posted? http://www.rulesemporium.com/plugins.htm#imageinfo can the rules_du_jour script be config'd to pickup *plugin* updates as

sa-update vs RDJ

2006-08-09 Thread DAve
Theo Van Dinter wrote: On Wed, Aug 09, 2006 at 09:58:19AM -0700, Richard wrote: rules_du_jour was done when sa-update did not exists are you implying that sa-update replaces rules-du-jour? That depends on what you mean by replaces. i though sa-update updates the SA distro's bundled rules,

Re: DEAR_SOMETHING rule scoring issue

2006-08-09 Thread Gregory T Pelle
Loren Wilton wrote: I could be wrong on this as i am not much of a regex expert, but it doesnt appear that this rule will trigger on normal things like Dear Jim body DEAR_SOMETHING /\bDear (?:IT\W|Internet|candidate|sirs?|madam|investor|travell?er|car shopper|web)\b/i describe

Unrecognized encodings make text rules painfully slow and give FP

2006-08-09 Thread Mark Martinec
I recently noticed a couple of cases where SA (3.1.4 or earlier) would take over a minute (instead of few seconds) to check a 500 kB message. Investigation reavealed that cases have one thing in common: these were all message/partial chunks of a longish transfer of some document or other data.

RE: HTML-tests good or bad?

2006-08-09 Thread Chris Santerre
Title: RE: HTML-tests good or bad? ROFL!! I just saw this thread! I haven't played hockey since January :( And still my spelling hasn't improved! I am in fact, a product of the US skool system ;) And if you're a Boston Bruins Hockey fan, today is a GREAT DAY!

RE: Image spam with inline jpeg image

2006-08-09 Thread Gary Funck
Theo wrote (in part): sa-update is a generic tool that lets users download channels (ie: bundles of rules/plugins) from anywhere that decides to publish them (requires a certain setup, etc.) At the moment, the only published channel that I know of is updates.spamassassin.org. (all

Re: Looking for a good Ebay whitelist

2006-08-09 Thread Logan Shaw
On Tue, 8 Aug 2006, jdow wrote: From: Logan Shaw [EMAIL PROTECTED] On Tue, 8 Aug 2006, jdow wrote: From: Logan Shaw [EMAIL PROTECTED] On Tue, 8 Aug 2006, wrote: I have been having FPs from Ebay in AU and DE, as well as [EMAIL PROTECTED] Does anybody have a good whitelist for these?

Bayesian not scoring at all - help!

2006-08-09 Thread JeanNarH
Hi, I think i've tried everything Bayesian isn't scoring, not only once, never. Never any Bayes_XX score. Here's the results for sa-learn --dump magic|grep token : 0.000 0 3 0 non-token data: bayes db version 0.000 0 4795 0 non-token

Re: Broken images in mails

2006-08-09 Thread John D. Hardin
On Wed, 9 Aug 2006, decoder wrote: Hrm. How much, if any, image processing is duplicated across the imageinfo/OCR/fuzzyOCR plugins? It might be a benefit to merge them and expose some options to control which tests are performed. Well, for example with gif, FuzzyOCR first checks what

Re: Improved OCR Plugin with approximate matching

2006-08-09 Thread Expertsites, Inc.
decoder wrote: See http://wiki.apache.org/spamassassin/FuzzyOcrPlugin Major changes: Replaced imagemagick with netpbm, support png, invoked giffix for broken gifs, detect image format with magic bytes and not by content-type, added various configuration options. Feedback is welcome :) Chris

Re: DEAR_SOMETHING rule scoring issue

2006-08-09 Thread Logan Shaw
On Wed, 9 Aug 2006, Gregory T Pelle wrote: Loren Wilton wrote: I could be wrong on this as i am not much of a regex expert, but it doesnt appear that this rule will trigger on normal things like Dear Jim body DEAR_SOMETHING /\bDear

RE: Image spam with inline jpeg image

2006-08-09 Thread Logan Shaw
On Wed, 9 Aug 2006, Gary Funck wrote: Has anyone considered also supplying new rules in the form of rpm's available via a yum-compatible repository? It'd be nice to have the usual versioning and logging support as well as a central update facility. This could be done as a gateway to sa-update,

RE: DEAR_SOMETHING rule scoring issue

2006-08-09 Thread Bowie Bailey
Logan Shaw wrote: On Wed, 9 Aug 2006, Gregory T Pelle wrote: Loren Wilton wrote: I could be wrong on this as i am not much of a regex expert, but it doesnt appear that this rule will trigger on normal things like Dear Jim body DEAR_SOMETHING /\bDear

Re: Broken images in mails

2006-08-09 Thread Logan Shaw
On Wed, 9 Aug 2006, John D. Hardin wrote: Could the image-size calculation stuff from the ImageInfo plugin be merged into this? I was envisioning all of those tests in a single plugin, with configuration options to control whether or not the OCR itself (fuzzy or not) takes place and whether the

Re: Unrecognized encodings make text rules painfully slow and give FP

2006-08-09 Thread Mark Martinec
I recently noticed a couple of cases where SA (3.1.4 or earlier) would take over a minute (instead of few seconds) to check a 500 kB message. ... ...In other words, a single user can, at a click of a button, bring down a dual processor mailer with SA for two hours, by sending a 70 MB video

RE: Image spam with inline jpeg image

2006-08-09 Thread Bowie Bailey
Logan Shaw wrote: On Wed, 9 Aug 2006, Gary Funck wrote: Has anyone considered also supplying new rules in the form of rpm's available via a yum-compatible repository? It'd be nice to have the usual versioning and logging support as well as a central update facility. This could be done

Re: Improved OCR Plugin with approximate matching

2006-08-09 Thread decoder
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Expertsites, Inc. wrote: decoder wrote: See http://wiki.apache.org/spamassassin/FuzzyOcrPlugin Major changes: Replaced imagemagick with netpbm, support png, invoked giffix for broken gifs, detect image format with magic bytes and not by

Re: Improved OCR Plugin with approximate matching

2006-08-09 Thread Spamassassin List
decoder wrote: See http://wiki.apache.org/spamassassin/FuzzyOcrPlugin Major changes: Replaced imagemagick with netpbm, support png, invoked giffix for broken gifs, detect image format with magic bytes and not by content-type, added various configuration options. I install the above plugin,

Re: Unrecognized encodings make text rules painfully slow and give FP

2006-08-09 Thread Kris Deugau
Mark Martinec wrote: ...In other words, a single user can, at a click of a button, bring down a dual processor mailer with SA for two hours, by sending a 70 MB video clip, and his Outlook Express will happily do the dirty work for him, splitting a mail into 190 chunks. What does a user know, he

Re: Improved OCR Plugin with approximate matching

2006-08-09 Thread decoder
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Spamassassin List wrote: decoder wrote: See http://wiki.apache.org/spamassassin/FuzzyOcrPlugin Major changes: Replaced imagemagick with netpbm, support png, invoked giffix for broken gifs, detect image format with magic bytes and not by

Re: Improved OCR Plugin with approximate matching

2006-08-09 Thread Spamassassin List
Spamassassin List wrote: decoder wrote: See http://wiki.apache.org/spamassassin/FuzzyOcrPlugin Major changes: Replaced imagemagick with netpbm, support png, invoked giffix for broken gifs, detect image format with magic bytes and not by content-type, added various configuration options. I

Re: Improved OCR Plugin with approximate matching

2006-08-09 Thread decoder
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Spamassassin List wrote: Spamassassin List wrote: decoder wrote: See http://wiki.apache.org/spamassassin/FuzzyOcrPlugin Major changes: Replaced imagemagick with netpbm, support png, invoked giffix for broken gifs, detect image format with

Re: Broken images in mails

2006-08-09 Thread Stuart Johnston
Logan Shaw wrote: On Wed, 9 Aug 2006, John D. Hardin wrote: Could the image-size calculation stuff from the ImageInfo plugin be merged into this? I was envisioning all of those tests in a single plugin, with configuration options to control whether or not the OCR itself (fuzzy or not) takes

Re: Broken images in mails

2006-08-09 Thread Theo Van Dinter
On Wed, Aug 09, 2006 at 04:42:15PM -0500, Stuart Johnston wrote: which is already handled by SA core modules. I'm assuming that SA only decodes an attachment once and reuses it for any plugin that needs it. Yes -- the decode run happens once and the result is stored in the tree node/object

Re: Broken images in mails

2006-08-09 Thread Logan Shaw
On Wed, 9 Aug 2006, Theo Van Dinter wrote: On Wed, Aug 09, 2006 at 04:42:15PM -0500, Stuart Johnston wrote: which is already handled by SA core modules. I'm assuming that SA only decodes an attachment once and reuses it for any plugin that needs it. Yes -- the decode run happens once and

Re: Memory requirements

2006-08-09 Thread John Thompson
On 2006-08-09, jdow [EMAIL PROTECTED] wrote: (I used to run SA on a 256 meg 66 MHz Pentium that was also the firewall. It was erm ahm slow, VERY slow. But it ran. This was in the 2.6.3 days give or take some.) I run SA on FreeBSD on an IBM M-Pro dual PII-400 with 512MB RAM. Sendmail, IMAP,

Re: Unrecognized encodings make text rules painfully slow and give FP

2006-08-09 Thread Justin Mason
Mark -- can you mail a *real* sample? private mail would be fine. --j. Mark Martinec writes: I recently noticed a couple of cases where SA (3.1.4 or earlier) would take over a minute (instead of few seconds) to check a 500 kB message. Investigation reavealed that cases have one thing in

Re: Unrecognized encodings make text rules painfully slow and give FP

2006-08-09 Thread Mark Martinec
Justin, Mark -- can you mail a *real* sample? private mail would be fine. It is coming your way (private) in a minute or two. It is a real sample, the only change I made is to replace the From and To header, so that it looks like a mail from me to you, subject: promotional video ... Watch

Re: Broken images in mails

2006-08-09 Thread John D. Hardin
On Wed, 9 Aug 2006, Logan Shaw wrote: On Wed, 9 Aug 2006, Theo Van Dinter wrote: On Wed, Aug 09, 2006 at 04:42:15PM -0500, Stuart Johnston wrote: which is already handled by SA core modules. I'm assuming that SA only decodes an attachment once and reuses it for any plugin that needs it.

Re: Improved OCR Plugin with approximate matching

2006-08-09 Thread Bill Landry
- Original Message - From: Spamassassin List [EMAIL PROTECTED] To: users@spamassassin.apache.org Sent: Wednesday, August 09, 2006 2:26 PM Subject: Re: Improved OCR Plugin with approximate matching Spamassassin List wrote: decoder wrote: See

RE: Improved OCR Plugin with approximate matching

2006-08-09 Thread Rick Cooper
-Original Message- From: decoder [mailto:[EMAIL PROTECTED] Sent: Wednesday, August 09, 2006 5:31 PM To: Spamassassin List; users@spamassassin.apache.org Subject: Re: Improved OCR Plugin with approximate matching [snip] According to google, libungif seems correct for yum... If

Re: HTML-tests good or bad?

2006-08-09 Thread jdow
I've been noticing that this seems to be cropping up in an awful lot in the righting committed by younger folks. It contributes to the impression that even college graduates these days are functionally illiterate. {O.O} So says the old fartess curmudgeonette. - Original Message -

Re: Image spam with inline jpeg image

2006-08-09 Thread jdow
From: Theo Van Dinter [EMAIL PROTECTED] There's nothing stoping the SARE folks from publishing a single or a bunch of channels and getting rid of RDJ in favor of sa-update if they wanted to... There are some benefits either way I suppose, and I'm biased towards sa-update of course. :| Um,

Re: Image spam with inline jpeg image

2006-08-09 Thread jdow
From: Gary Funck [EMAIL PROTECTED] Theo wrote (in part): sa-update is a generic tool that lets users download channels (ie: bundles of rules/plugins) from anywhere that decides to publish them (requires a certain setup, etc.) At the moment, the only published channel that I know of is

Re: HTML-tests good or bad?

2006-08-09 Thread John Rudd
righting? is that the opposite of wronging? :-) On Aug 9, 2006, at 7:19 PM, jdow wrote: I've been noticing that this seems to be cropping up in an awful lot in the righting committed by younger folks. It contributes to the impression that even college graduates these days are functionally

Re: HTML-tests good or bad?

2006-08-09 Thread jdow
I'm younger than the folks that REALLY learned to spell. And I do wonder why I made that typo. Either of my parents would have had my skin for doing that. {O.O} - Original Message - From: John Rudd [EMAIL PROTECTED] To: jdow [EMAIL PROTECTED] Cc: users@spamassassin.apache.org Sent:

Re: Unrecognized encodings make text rules painfully slow and give FP

2006-08-09 Thread Loren Wilton
...In other words, a single user can, at a click of a button, bring down a dual processor mailer with SA for two hours, Sounds to me like it would be worth an entry in Bugzilla. Loren

Re: HTML-tests good or bad?

2006-08-09 Thread John D. Hardin
On Wed, 9 Aug 2006, jdow wrote: I'm younger than the folks that REALLY learned to spell. And I do wonder why I made that typo. Either of my parents would have had my skin for doing that. {O.O} *I* gave you the benefit of the doubt and assumed you'd done it on purpose for illustrative