Re: Failed: PAUSE indexer report PARKER/Mail-SpamAssassin-3.3.2.tar.gz

2011-06-22 Thread Theo Van Dinter
me: module userid fullnametypeowner Mail::SpamAssassin DOS Daryl C. W. O'Shea co-maint JMASON Mail::SpamAssassin FELICITYTheo Van Dinter co-maint JMASON Mail::SpamAssassin JMASON Justin Masonmodulelist JMASON Mail::SpamAssassin

Re: [Bug 6536] reuse DNSWL and IADB Rules in masscheck

2011-01-23 Thread Theo Van Dinter
On Thu, Jan 20, 2011 at 12:03 PM, bugzilla-dae...@issues.apache.org wrote: The way I viewed it is if the committer is still consider active, the sandbox is off-limits without asking them first. However, perhaps this is time to consider moving some committers to non-active status. Hey folks!

Re: Pulling messages out of the masscheck corpa?

2009-06-26 Thread Theo Van Dinter
Typically you'd post the messages (user/paths) in question on dev, or mail the people directly, and they can either get you copies of the mails or otherwise try to help debug a little (if the mails are sensitive, etc.) On Fri, Jun 26, 2009 at 9:12 AM, John Hardinjhar...@impsec.org wrote: Justin

Re: How many people are still using perl 5.6.x?

2009-06-25 Thread Theo Van Dinter
fwiw, the decision to drop support for older perls isn't really about those folks don't get support for their OS or it's old so let's drop it. In the let's drop 5.0 support and require 5.6 decision process, it was about we are having a hard time supporting the 5.0 series with numerous regexp

Re: 3.3.0 plans

2009-06-24 Thread Theo Van Dinter
fwiw, the process used to be: - beta releases to get things stabilized - use a beta release to do mass-check runs - generate scores with mass-check data and submit to svn - rc releases to get wider testing w/ scores - release after rc appear to work On Wed, Jun 24, 2009 at 2:11 PM, Quanah

Re: How can a plugin add message-specific headers to an email?

2009-04-13 Thread Theo Van Dinter
Plugins can't directly add headers to the message rewrite process. However, the plugin can create a tag which you can setup to be added w/ add_header. ie: $permsgstatus-{tag_data}-{FOO} = foo; then add_header all Foo _FOO_ will add X-Spam-Foo to all mails. On Thu, Apr 9, 2009 at 10:48 AM,

Re: svn commit: r752746 - /spamassassin/site/updates/MIRRORED.BY

2009-03-12 Thread Theo Van Dinter
I had forgotten about some mirrored file bits, but then discovered them again the other day, and even found some new things this morning, so fyi here ... :) If sa-update has not been run for a given channel before, it looks up mirrors.[channel] in DNS

Re: Which Message-ID is supposed to go into Bayes 'seen' database?

2009-02-11 Thread Theo Van Dinter
I'd have to go look at the mail archives, assuming we discussed it in email and not just irc ... but I seem to recall it had to do with mails coming in w/ the same message-id and sa-learn seeing them as the same message, thereby bypassing our ability to learn tokens. Since we already generated

Re: moving JM_SOUGHT to official status

2009-01-07 Thread Theo Van Dinter
fwiw, I'm pretty +1 on the idea. I was a bit skeptical of these rules at the start (concerns about the automated nature more than anything else), but things seem to be going pretty well. I'd like to see several people w/ large corpus publish their own similar channels. The build it and they

Re: updatesd svn cert updated

2008-12-10 Thread Theo Van Dinter
On Wed, Dec 10, 2008 at 09:35:20PM -0500, Daryl C. W. O'Shea wrote: I updated the updatesd cert, but not the release cert, so things still weren't running correctly. Third time's the charm? If it works, can you send a mail about where you updated it (or if you're ambitious document it

Re: updatesd svn cert updated

2008-12-09 Thread Theo Van Dinter
I haven't touched the zones stuff in a while. sigh I updated the updatesd cert, but not the release cert, so things still weren't running correctly. Third time's the charm? -- Randomly Selected Tagline: I wonder if they'd use a cow as a heat shield for a space rocket... Make it kind of like

Re: updatesd svn cert updated

2008-12-07 Thread Theo Van Dinter
Apparently attempt #1 didn't take. :( I kicked it again, so hopefully ... tomorrow, woot. Looks like a problem with the rsync area, found rev 724102 but expected rev 722809 On Sat, Dec 06, 2008 at 06:26:52PM -0500, Theo Van Dinter wrote: The nightly runs weren't properly getting setup, so

updatesd svn cert updated

2008-12-06 Thread Theo Van Dinter
The nightly runs weren't properly getting setup, so I went and updated the svn certificate on updatesd. It ought to be functional tomorrow. :) On Sat, Dec 06, 2008 at 05:20:07AM -0500, Cron Daemon wrote: Looks like a problem with the rsync area, found rev 723957 but expected rev 721652 --

Re: turned off mass-checks, fyi

2008-11-30 Thread Theo Van Dinter
. :( On Fri, Oct 31, 2008 at 12:06:35PM -0400, Theo Van Dinter wrote: This week I noticed that my usual run took around 23h to complete, which is much (2x?) longer than usual. Poking around, it seems that my second machine starts running through it's message queue and then stops at some point, leaving

Re: turned off mass-checks, fyi

2008-11-30 Thread Theo Van Dinter
. Fixed, I think, in r721962... :) On Sun, Nov 30, 2008 at 05:22:21PM -0500, Theo Van Dinter wrote: Now I need to go through and find out why the server has so many errors accessing a non-changing corpus. :( -- Randomly Selected Tagline: 1) Your kid mistakenly gets sent to the principal's office

turned off mass-checks, fyi

2008-10-31 Thread Theo Van Dinter
Just in case there's a question about where my results went... This week I noticed that my usual run took around 23h to complete, which is much (2x?) longer than usual. Poking around, it seems that my second machine starts running through it's message queue and then stops at some point, leaving

Who broke mass-check?

2008-09-24 Thread Theo Van Dinter
I started seeing these the other day: status: starting scan stage now: 2008-09-22 05:12:36 Out of memory! mass-check: no messages to process It could just be that I need to allocate more memory to mass-check now, but the 256M limit that I've given it has run fine

mailhost.zones.apache.org bounces from zones

2008-08-04 Thread Theo Van Dinter
I got tired of this stuff, so I removed the fallbacksmarthost option in sendmail.cf and restarted sendmail. Hopefully it'll stop mailing constantly now: - Transcript of session follows - 550 5.1.2 [EMAIL PROTECTED] Host unknown (Name server: mailhost.zones.apache.org: host not found)

Re: inconsistent DNS for updates.spamassassin.org

2008-08-04 Thread Theo Van Dinter
On Mon, Aug 04, 2008 at 07:02:13PM +0100, Tony Finch wrote: ns1.kluge.net isn't serving an up-to-date zone for spamassassin.org ns.hyperreal.org. 681717 ns1.kluge.net. 668092 Hrm. Aug 4 15:11:13 eclectic named[9975]: zone spamassassin.org/IN: Transfer started. Aug 4 15:11:13 eclectic

Re: inconsistent DNS for updates.spamassassin.org

2008-08-04 Thread Theo Van Dinter
On Mon, Aug 04, 2008 at 03:54:04PM -0400, Theo Van Dinter wrote: I'm guessing that started failing around the time I upgraded to bind 9.5.0. I'll look into it when I get some time. Aha: Aug 4 19:28:28 eclectic named[9975]: dumping master file: slave/tmp-mVQ0HgIztJ: open: permission denied

Re: [VOTE] proposed release tarballs for 3.2.5

2008-06-11 Thread Theo Van Dinter
+1 looks good here. ran tests, built rpm, installed and ran a bunch of mails through. :) On Tue, Jun 10, 2008 at 10:50:00AM +0100, Justin Mason wrote: - bug 5775: newer gpg versions require keys to be cross-certified (backsig). Did a cross-verify on our sa-update public key and re-exported

Re: ham check, Theo and Daryl

2008-03-28 Thread Theo Van Dinter
On Fri, Mar 28, 2008 at 05:35:03PM +, Justin Mason wrote: theo: . 1 /home/corpus/SpamAssassin/corpus/ham/personal/2008/03/05/71efdf24c3 sigh misfiled spam. removed. . 2 /home/corpus/SpamAssassin/corpus/ham/hamtrap/2008/01/29/286160f9fd Valid, I'll send you a copy via other channel.

Re: mass-check'ers need NetAddr::IP (Re: svn commit: r638750 - /spamassassin/trunk/rules/active.list)

2008-03-19 Thread Theo Van Dinter
On Wed, Mar 19, 2008 at 06:15:43AM -0400, Daryl C. W. O'Shea wrote: I'm guessing that a bunch of people need to install the newly required NetAddr::IP. Yeah, of the two mass-check machines I have, only one had it installed already. Whenever we make a required module change, we should alert the

Re: a SpamAssassin summit?

2008-03-14 Thread Theo Van Dinter
On Thu, Mar 13, 2008 at 01:22:02PM +, Justin Mason wrote: +1, agreed on all points there. That really is the problem with Apachecon for us. I'm +1 at this point too. At least before I could goto httpd talks and have it be work related, but (relative) $newjob isn't dealing with websites or

So who broke SA updates?

2008-02-21 Thread Theo Van Dinter
[updates] comment = SpamAssassin updates (updates.spamassassin.org) path = /var/www/buildbot.spamassassin.org/updatestage read only = true $ ls /var/www/buildbot.spamassassin.org/updatestage /var/www/buildbot.spamassassin.org/updatestage: No such file or directory It's related to: lrwxrwxrwx

Re: A question on rulesrc/sandbox/

2008-02-07 Thread Theo Van Dinter
On Fri, Feb 08, 2008 at 12:47:11AM +0100, Mark Martinec wrote: Are the rulesrc/sandbox/* directories shared across SVN branches? Yes. Having added my sandbox under spamassassin-trunk, it is now also seen under a spamassassin-3.2 branch. Which is fine, as my rules (e.g. in 20_rpvalid.cf) are

Re: RuleQA - dnswl.org rules

2008-01-25 Thread Theo Van Dinter
On Fri, Jan 25, 2008 at 10:28:28AM +, Justin Mason wrote: yep, a spam; a phish, specifically, coming via: [...] Received hdrs look like that's a botnet-infected machine. Ditto. * -0.0 RCVD_IN_DNSWL_MED RBL: Sender listed at http://www.dnswl.org/, * medium trust

Re: Upgrading to use Berkeley V4 databases?

2008-01-23 Thread Theo Van Dinter
On Wed, Jan 23, 2008 at 10:38:50PM -0800, Quanah Gibson-Mount wrote: We've noticed that SpamAssassin still uses the rather inefficient V1 BDB databases for Bayes via DB_File. Is there any move to re-work this to use the more efficient V4 database types, or is all BDB support essentially

Re: 3.2.4 remaining work

2008-01-01 Thread Theo Van Dinter
We should also finally get around to doing 3.1.10. On Tue, Jan 01, 2008 at 11:36:52AM +, Justin Mason wrote: ok, I'll do that today or tomorrow, depending on holiday workload ;) On Jan 1, 2008 9:20 AM, Sidney Markowitz [EMAIL PROTECTED] wrote: It look like we are now all ready to defer

Re: sa-update failure causes rule failure

2007-12-17 Thread Theo Van Dinter
On Mon, Dec 17, 2007 at 02:20:58PM -0500, Warren Togami wrote: https://bugzilla.redhat.com/show_bug.cgi?id=392851 Is this already solved in the upcoming 3.2.4? Unless it was reported to us, I don't think it would have been looked at. It sounds like if the local state dir had to be created, and

Re: [VOTE] Release of SpamAssassin 3.1.10

2007-12-16 Thread Theo Van Dinter
Wow. This has sat around for ages! :( It doesn't look like anything got committed to fix these issues, but we should get that done and get 3.1.10 out. Sidney, do you still have the code you were working on for this? I didn't see any open bugs w/ milestone of 3.1.10, and 3.1.11 only has a

Re: Time is *way* off on SA zone machine

2007-12-12 Thread Theo Van Dinter
Yeah, it looks like it got dealt with in the last day or so. :) On Wed, Dec 12, 2007 at 02:36:15PM +, Justin Mason wrote: this is now fixed, right? Theo Van Dinter writes: My nightly/weekly runs have been failing because of mismatched versions (*-versions.txt says version X

Re: Time is *way* off on SA zone machine

2007-12-12 Thread Theo Van Dinter
On Wed, Dec 12, 2007 at 12:37:02PM -0500, Daryl C. W. O'Shea wrote: Really? Wouldn't the rsync code have been updated the same time apart from the version file being updated as before? It depends. :) There are different cronjobs to update the versions and the rsync area, so there's a race

Time is *way* off on SA zone machine

2007-12-09 Thread Theo Van Dinter
My nightly/weekly runs have been failing because of mismatched versions (*-versions.txt says version X, but the rsync code is version Y). I thought it may be related to the clocks, so I went to check and found this: my machine: $ TZ=GMT date Sun Dec 9 18:30:08 GMT 2007 zone machine: $ date Sun

My issues w/ nightly runs, fyi ...

2007-12-06 Thread Theo Van Dinter
Hey folks, A few weeks ago, I noticed my nightly runs were not finishing in a timely manner. I didn't have time to debug the issue before going on vacation, so I shut them off while I was gone. Now that I'm back I took a look at it, and it appears the problem is that both bayes and awl are

Re: spare CPU power for nightly mass-checks?

2007-09-21 Thread Theo Van Dinter
On Fri, Sep 21, 2007 at 08:08:39PM +0100, Justin Mason wrote: Unfortunately the only offer has been a non-committer -- and I'd prefer to keep it committer-only due to confidentiality. Anyone? Yeah, I still have the machine I offered a while ago available. :) I can add a new run after my

Re: net mass-checks triggering (URI)DNSBL provider blocks?

2007-09-06 Thread Theo Van Dinter
On Thu, Sep 06, 2007 at 08:29:24AM -0400, Matt Kettler wrote: I don't think the intent was to allow the_world, merely the_committers. If the idea is to help people doing the weekly/net runs, then it's not the_committers, it's anyone_who_does_the_weekly/net_runs. And that list is an open-ended

Re: net mass-checks triggering (URI)DNSBL provider blocks?

2007-09-05 Thread Theo Van Dinter
On Wed, Sep 05, 2007 at 10:16:21PM -0400, Daryl C. W. O'Shea wrote: If we do more frequent --net mass-checks we may individually run the chance of being blocked by the providers of the (URI)DNSBLs such as Spamhaus. Has anyone been blocked to date? Probably not given the once a week

Re: net mass-checks triggering (URI)DNSBL provider blocks?

2007-09-05 Thread Theo Van Dinter
. -- Randomly Selected Tagline: Windows 98 -- Go for the bloat! - Theo Van Dinter pgphoUiIV2HbK.pgp Description: PGP signature

Something's up with the net run

2007-08-26 Thread Theo Van Dinter
status: starting scan stage now: 2007-08-25 05:12:15 status: completed scan stage, 202620 messagesnow: 2007-08-25 05:13:06 status: starting run stage now: 2007-08-25 05:13:06 status: server ready for connections

Re: [Bug 5623] New: Implement versioning of plugins

2007-08-23 Thread Theo Van Dinter
On Thu, Aug 23, 2007 at 12:33:04AM -0500, Michael Parker wrote: I wanted to revisit unloadplugin. Its been awhile since I messed with the config code but I think we load plugins right away on loadplugin. What if during .pre parsing we didn't load right away, but waited until all the .pre

Re: Access RAW email from plugin in method check_end

2007-08-19 Thread Theo Van Dinter
On Sun, Aug 19, 2007 at 06:09:34PM +0200, Nicolas Fertig wrote: Is there a way to access RAW email (Full email with all headers and mime content) from a plugin in the method check_end ? Sure. sub check_end { my ($self, $params, $fulltext) = @_; # the $fulltext does not exists in this

Re: 3.1.10

2007-08-19 Thread Theo Van Dinter
On Sun, Aug 19, 2007 at 08:56:24PM -0400, Matt Kettler wrote: I think so.. Sydney made a call for Theo to make a new release candidate a week ago (8/11), but nothing happened. Yeah, I got the mails, just been so busy I haven't been able to do anything with them. We should do a 3.1.10 if it's

Re: Easier Rules Work

2007-08-18 Thread Theo Van Dinter
On Sat, Aug 18, 2007 at 02:30:11AM -0400, Kevin A. McGrail wrote: First, I'm wondering if there is an easy way that already exists to re-run spamassassin on all the mails in an mbox format file? Sure, spamassassin --mbox. :) But for rule testing, you'll want to use mass-check/hit-frequencies,

AICache still broken

2007-08-03 Thread Theo Van Dinter
fyi. - Forwarded message - [Using '-90 days' for --after setting] status: starting scan stage now: 2007-08-03 05:12:28 print() on closed filehandle CACHE at /home/corpus/SA/spamassassin-corpora/masses/../lib/Mail/SpamAssassin/AICache.pm line 162. Can't

Re: Votes for Rules Updates?

2007-08-02 Thread Theo Van Dinter
On Thu, Aug 02, 2007 at 05:54:18PM +0100, Justin Mason wrote: as far as I know, if they're rules in the rulesrc tree, it's C-T-R; but rules in the rules dir are still R-T-C. I'd be happy to loosen this up, though. I'm fine with that too. The rules dir (imo) is essentially a snapshot of the

[EMAIL PROTECTED]: Cron [EMAIL PROTECTED] ulimit -s unlimited ; nice -10 /home/felicity/SA/corpus/run-corpora-cs --server]

2007-08-01 Thread Theo Van Dinter
I was going to go through the AICache patches and make some comments, but I haven't had a chance yet. However, here's a definite comment: it's broken in my environment. I run mass-checks as both felicity and corpus. corpus only has write access to the cache file, and read-only access to the

Re: [EMAIL PROTECTED]: Cron [EMAIL PROTECTED] ulimit -s unlimited ; nice -10 /home/felicity/SA/corpus/run-corpora-cs --server]

2007-08-01 Thread Theo Van Dinter
On Wed, Aug 01, 2007 at 07:11:28PM +0100, Justin Mason wrote: you could use --cachedir? Sure, if I wanted to spend time re-rigging my entire corpus/mass-check setup/migrating cache information/etc. I'd be ok with reverting that part of the change back to what it was before, but note that it's

Re: Requirement for perl RFC

2007-07-26 Thread Theo Van Dinter
On Thu, Jul 26, 2007 at 07:08:32PM -0400, Kevin A. McGrail wrote: I would vote +1 if it's the last version we *test* with 5.6.1 as a requirement for release. But I would prefer nothing actively be done to stop working w/5.6.1. Is there some major feature or anything that is causing the

Re: [VOTE] Release of SpamAssassin 3.1.10

2007-07-16 Thread Theo Van Dinter
On Tue, Jul 17, 2007 at 01:41:38PM +1200, Sidney Markowitz wrote: Apache SpamAssassin 3.1.10 is now available! This is a maintenance and security release of the 3.1.x branch. Is the reference to security release a copy and paste error? Hrm. Yes indeedy. It'll be just maintenance in the

[VOTE] Release of SpamAssassin 3.1.10

2007-07-15 Thread Theo Van Dinter
Hi -- 3.1.10 tarballs are available for testing: http://people.apache.org/~felicity/3110/ By the way: per ASF policies, only PMC member votes are binding for a release, but we encourage everyone to download, test, and report any issues! my vote: +1. BTW: my proposed release announcement is as

Re: svn commit: r555646 - /spamassassin/rules/trunk/sandbox/jm/20_basic.cf

2007-07-12 Thread Theo Van Dinter
Please tell me that I'm not seeing new full rules... ;) /me thinks we need to have a new plugin rule type. Perhaps like: rawmatch PDF_1 eval:rawmatch('^application/', '\AJVBERi0xLjMgCjEgMCBvYmoKPDwKPj4KZW5kb2JqCj') and then rawmatch() would simply find_parts(qr/$1/i,1) and return

Re: svn commit: r555062 - /spamassassin/rules/branches/3.1/80_additional.cf

2007-07-10 Thread Theo Van Dinter
Hrm. So where do updates go in the 3.2 branch? In 3.1 it's a very (imo) obviously named 80_additional.cf ... 72_active.cf ? On Tue, Jul 10, 2007 at 04:42:52PM -0400, Daryl C. W. O'Shea wrote: [EMAIL PROTECTED] wrote: Author: felicity Date: Tue Jul 10 12:55:50 2007 New Revision: 555062

Re: Inflated hit-frequencies results?

2007-07-03 Thread Theo Van Dinter
On Tue, Jul 03, 2007 at 10:24:02AM +0100, Justin Mason wrote: no ahas here unfortunately :( -- is this in your own local freqs, or the freqs on the server (with everyone else's logs too)? This is from hit-frequencies off of my net-theo weekly logs. It's very reproducable too:

Re: Inflated hit-frequencies results?

2007-07-03 Thread Theo Van Dinter
, and apparently only for weekly runs, and apparently only for this rule (pcregrep '([A-Z0-9_]+),\1(,|$)', shows only this rule duplicating)? sigh On Tue, Jul 03, 2007 at 12:02:12PM -0400, Theo Van Dinter wrote: On Tue, Jul 03, 2007 at 10:24:02AM +0100, Justin Mason wrote: no ahas here unfortunately

Re: Adding in mirrors for the updates channel ?

2007-06-18 Thread Theo Van Dinter
On Mon, Jun 18, 2007 at 10:40:51AM -0400, Kevin A. McGrail wrote: A month's worth of requests (~535k) right now is ~15G, which breaks down to ~18000 requests/day and ~6KB/sec. Do you really mean 18K requests / day? That seems like a lot. Yeah, that's the current count. I actually expected

Re: [Bug 5419] kill -HUP `pidof spamd` causes the ps name to change from spamd to perl

2007-06-18 Thread Theo Van Dinter
On Mon, Jun 18, 2007 at 11:26:43AM -0400, Daryl C. W. O'Shea wrote: Target Milestone|3.2.2 |3.2.3 Are we in a hurry to get 3.2.2 out? As I mentioned in bug 5510, I want to put an end to the problems installing via CPAN really soon since there's been a lot of noise

Re: Adding in mirrors for the updates channel ?

2007-06-16 Thread Theo Van Dinter
On Mon, Jun 11, 2007 at 07:36:24AM -0400, Theo Van Dinter wrote: So I'm thinking we should get some more mirrors for the channel setup to distribute the load. Coral sucked, so let's not use them. I also don't want to add in the zones machine since we already use a large amount of resources

Re: sa's SPF plugin suggestion

2007-06-15 Thread Theo Van Dinter
On Sat, Jun 16, 2007 at 12:27:44AM +0200, arni wrote: I've looked into sa's SPF plugin but to me (i'm not a perl person) it looks like it doesnt even see the raw record which would allow modification but leaves it all up to a Mail::SPF object. Any suggestions on how i could archieve this?

Adding in mirrors for the updates channel ?

2007-06-11 Thread Theo Van Dinter
Hey folks, I was having a chat with duncf about sa-update via irc this weekend, and it got me to thinking ... Right now, my machine is the only one serving up the updates channel. I originally did this so that I could see all of the downloads in the logs, and thereby generate statistics about

Re: Please Help

2007-05-09 Thread Theo Van Dinter
Hi, It sounds like you'd want to talk to the Amavis folks since you're using their tools to call SA. We can't help you with that. Also, for general SA questions, please use the users@ list. dev@ is meant for discussions about SA development and not support. fyi. On Wed, May 09, 2007 at

Re: Enabling Plugins

2007-03-30 Thread Theo Van Dinter
On Fri, Mar 30, 2007 at 11:35:25AM -0700, hubby2debbie wrote: the etc/spamassassin/Plugin/ directory, and enabled the TextCat plugin in the user_prefs file. I've also added a directive stating what languages I You can't enable plugins in a user_prefs file. You need to enable them in a *.pre

Re: rsync: link_stat nightly_mass_check (in tagged_builds) failed: No such file or directory (2)

2007-03-01 Thread Theo Van Dinter
On Thu, Mar 01, 2007 at 02:34:30PM +, Justin Mason wrote: There's another issue, too -- now SVN uses $HOME instead of the home field from getpwuid() when looking for the user's auth info. This means that 'sudo -u automc svn up' no longer works :( sudo -H -u ... :) -- Randomly

Messed up SVN install on our zone...

2007-02-25 Thread Theo Van Dinter
Hrm. So apparently on spamassassin.zones.apache.org:: $ which svn /opt/subversion-1.1.4/bin/svn but $ svn --version svn, version 1.4.3 (r23084) compiled Feb 25 2007, 04:34:40 [...] which is interesting because: $ svnversion --version svnversion, version 1.1.4 (r13838) compiled May 2

ImageInfo plugin error

2007-02-15 Thread Theo Van Dinter
fyi: Use of uninitialized value in bitwise and () at /home/corpus/SA/spamassassin-corpora/masses/../lib/Mail/SpamAssassin/Plugin/ImageInfo.pm line 100, $tmpfile line 1. -- Randomly Selected Tagline: Awkward moment?!? I'll give you an awkward moment: One time during sex I called Lois Frank.

Re: ImageInfo plugin error

2007-02-15 Thread Theo Van Dinter
On Thu, Feb 15, 2007 at 01:17:32PM -0500, Kevin A. McGrail wrote: Looks like less of an error perhaps and maybe just an invalid graphic? In the get_details hash procedure, it looks like unpacking the header and sometimes return a $packed that isn't initialized. Does this fail every time

failure in nightly run

2007-02-14 Thread Theo Van Dinter
This is a new one on me... JM, I assume this is related to the score generation changes? Can't use string (0) as a HASH ref while strict refs in use at /home/corpus/SA/spamassassin-corpora/masses/../build/parse-rules-for-masses line 161. parse-rules-for-masses failed! at

Re: svn commit: r507718 - /spamassassin/branches/3.1/build/announcements/3.1.8.txt

2007-02-14 Thread Theo Van Dinter
/branches/3.1/Changes - --- -Randomly Selected Tagline: -You will soon learn the secrets of butterflies. - Their stark, unrelenting malice will drive you gibberingly insane. - (from The Onion's weekly horoscope for Gemini) - -From [EMAIL PROTECTED] Wed Feb 14 15:38:18 2007 -From: Theo Van

ANNOUNCE: Apache SpamAssassin 3.1.8 available!

2007-02-14 Thread Theo Van Dinter
Apache SpamAssassin 3.1.8 is now available! This is a maintenance and security release of the 3.1.x branch. It is highly recommended that people upgrade to this version. Downloads are available from: http://spamassassin.apache.org/downloads.cgi?update=200702131100 The release file will also

[VOTE] SpamAssassin 3.1.8 release

2007-02-13 Thread Theo Van Dinter
Hi -- 3.1.8 tarballs are available for testing: http://people.apache.org/~felicity/318/ by the way: per ASF policies, only PMC member votes are binding for a release, but we encourage everyone to download, test, and report any issues! my vote: +1. BTW: my proposed release announcement is as

GSoC 2007 ...

2007-02-13 Thread Theo Van Dinter
I'm assuming that there will be a Google Summer of Code 2007 going on, and that the ASF will be involved again. So it's a good time to start thinking about things we'd like to put up as possible projects. We still have a number of items from last year that we could use again. Anything else that

Re: Nagios

2007-02-12 Thread Theo Van Dinter
On Mon, Feb 12, 2007 at 10:53:40PM -0500, Duncan Findlay wrote: Any chance we can turn off the nagios notifications? Or at least turn them down in frequency? How about fixing the issues? ;) I haven't had time to figure out what it's monitoring for, so I haven't prodded the box to figure out

Re: sa-update and missing deps

2007-02-10 Thread Theo Van Dinter
On Sat, Feb 10, 2007 at 10:25:37AM -0500, Warren Togami wrote: Is it possible to modify sa-update to output a meaningful error message if it fails due to missing dependencies? (A different non-zero return code would be nice too.) ie: instead of the standard perl can't load module X error,

Re: need horsepower

2007-02-09 Thread Theo Van Dinter
On Fri, Feb 09, 2007 at 07:31:41PM +, Justin Mason wrote: actually, it may have been a mistake on my part... let's see how it goes now. ;) Did you forget to send input to STDIN? ;) Does anyone have a well-connected machine with [...] Anyway, yes, I still have systems available we could

Re: mass-check results ready?

2007-02-06 Thread Theo Van Dinter
On Tue, Feb 06, 2007 at 10:57:15AM -0600, Michael Parker wrote: Plus the list of contributors thus far is surprisingly short! Maybe in general we should just send out another announcement and get other folks who have requested rsync access a chance to respond. I'm not surprised. It's

Re: Scoregen round duex?

2007-02-05 Thread Theo Van Dinter
On Mon, Feb 05, 2007 at 03:14:07PM -0600, Doc Schneider wrote: Are we going to need to do another rescoring masscheck for 3.2.0? We should only have to do a single run unless we're going to change code that would have an effect on the outcome. I'm tempted to check in the URL deobfuscation

Re: Please vote on remaining 3.1.8 bugs

2007-02-01 Thread Theo Van Dinter
On Thu, Feb 01, 2007 at 08:45:55AM -0500, Thomas Bolioli wrote: With this release will you be running the scoring analysis before it goes out? No, 3.1.8 has no new scores. That'd be a separate process anyway, going out through the update channel. If so, is the utility provided self

Re: spamassassin rule config

2007-02-01 Thread Theo Van Dinter
On Thu, Feb 01, 2007 at 09:42:31AM -0800, mofturos wrote: should be ok if i just put this test in the local.cf ? First, I'd send this type of question to the users list since it's not a development-related question. Second, yes, your local site configs go in /etc/mail/spamassassin, typically in

Re: rule for new 'image' spam

2007-02-01 Thread Theo Van Dinter
On Thu, Feb 01, 2007 at 10:14:08AM -0500, Kevin A. McGrail wrote: This test (below) looks to me to be designed to catch all spam .gif through .gif? If so I would say the false positives are going to be pretty high even without running it through a corpus. Certainly it has a high

Re: FP check

2007-01-31 Thread Theo Van Dinter
On Wed, Jan 31, 2007 at 01:28:18AM -0500, Daryl C. W. O'Shea wrote: All are Chapters/Indigo order acknowledgment/confirmation/shipping emails. It looks like it's generated by the horrid MS AspMail 3.0 (which is also triggering MISSING_MIMEOLE). Date: Mon, 22 Jan 2007 19:06:22 GMT Date:

Please vote on remaining 3.1.8 bugs

2007-01-31 Thread Theo Van Dinter
I cleaned out a few bugs from the 3.1.8 queue, and put up some patches for the simple ones that didn't have them. Please vote on them so we can get them committed and get 3.1.8 out the door. I'd really like to get it released this week. Thanks! :) -- Randomly Selected Tagline: Sen. Strom

Re: Using SQL for scoregen run is slow as heck!

2007-01-30 Thread Theo Van Dinter
On Tue, Jan 30, 2007 at 08:30:09AM -0600, Doc Schneider wrote: I didn't know it was possible to use MySQL for the scoregen runs. I would have used it instead. I also should have used a smaller corpus. Yeah, you can do whatever you want for the scoregen run, as long as it doesn't change the

FP check

2007-01-30 Thread Theo Van Dinter
DoS and Zmi: Can you check on these messages? If they're ham, can you send me the Date header(s)? ham-dos.log:. /home/dos/Maildir/._Inbox_2007/cur/1169531451.M60074P23070V0302I001C062A_47.cyan.dostech.net,S=4293:2,S ham-dos.log:.

Re: NOTICE: 3.2.0 rescoring mass-checks

2007-01-29 Thread Theo Van Dinter
On Mon, Jan 29, 2007 at 10:32:43PM +0100, Giampaolo Tomassoni wrote: Why do the instructions have bayes auto learning and AWL turned off? I guess because mass-check logs must be based on an absolute basis: two copies of the very same e-mail checked at beginning and at end of the list

Using SQL for scoregen run is slow as heck!

2007-01-29 Thread Theo Van Dinter
I'm mostly just venting, but I have 2 machines trying to do the scoregen run, so I need to share the Bayes DB, and chose SQL. In total across those machines, I'm currently only seeing ~25 msgs/min or 0.5 messages/sec. So for my ~215k messages, that's 430k seconds, or 5 days of processing. Set 1

Re: Using SQL for scoregen run is slow as heck!

2007-01-29 Thread Theo Van Dinter
On Mon, Jan 29, 2007 at 11:21:20PM -0500, Daryl C. W. O'Shea wrote: MySQL isn't hitting swap perhaps? That's the only way I've seen it go that slow (with spamd) and I've done zero tuning of MySQL. Nope, no swap usage on the machine. Is the load on the machine running the DB actually high,

Re: 2 types of rsync users?

2007-01-28 Thread Theo Van Dinter
On Thu, Jan 25, 2007 at 09:29:02AM -0600, Doc Schneider wrote: I read through the masses README's and it looked to me like one account was all that was needed. But, of course, I could be mistaken. In the past there's been 2, though most people wouldn't know the difference. Generally if people

Re: Output from cron command

2007-01-27 Thread Theo Van Dinter
Had to do a svn cleanup . rulesrc on ~release/versions/cvshead, then accept the cert during update. fyi. On Sat, Jan 27, 2007 at 09:27:53AM +, [EMAIL PROTECTED] wrote: Your cron job on spamassassin.zones.apache.org $HOME/versions/cvshead/build/nightlymc/corpora_runs

Re: Output from cron command

2007-01-27 Thread Theo Van Dinter
On Sat, Jan 27, 2007 at 11:37:23AM -0600, Doc Schneider wrote: I get an error trying to update the trunk [EMAIL PROTECTED] sadev]$ svn up https://svn.apache.org/repos/asf/spamassassin/trunk/ . Skipped 'https://svn.apache.org/repos/asf/spamassassin/trunk' svn: REPORT request failed on

Re: question about the lists

2007-01-26 Thread Theo Van Dinter
On Fri, Jan 26, 2007 at 11:14:47AM -0500, [EMAIL PROTECTED] wrote: Is this the list appropriate for a discussion on how to use Mail::SpamAssassin in a perl application? It's more of a users@ question IMO, but... my $st = new Mail::SpamAssassin() $st-init(1); I found something like

Re: Mass-check question with mbox's

2007-01-25 Thread Theo Van Dinter
On Thu, Jan 25, 2007 at 03:19:46PM -0500, Fred Tarasevicius wrote: Y 6 /spamassassin/masses/corpus.ham/HAM.2007.mbox.258825 I'm guessing that .258825 in the string above is the starting byte position inside the mbox? Yes, mbx and mbox file specifications are path.byteoffset. Everything

Re: Mass-check question with mbox's

2007-01-25 Thread Theo Van Dinter
On Thu, Jan 25, 2007 at 03:00:07PM -0600, Doc Schneider wrote: Can you put it somewhere where we all could get it? Say check it into svn under your rules space? I also use mbox format and am always needing to find some ham mixed in with the spam. Sure. For now I put it in contrib

Re: svn commit: r500015 - /spamassassin/trunk/contrib/samailoffset

2007-01-25 Thread Theo Van Dinter
On Thu, Jan 25, 2007 at 11:26:22PM +, Justin Mason wrote: doesn't masses/mboxget do that? I mentioned that to folks, but they still requested my script, so I threw it in. In hindsight, I could just have put up an URL to get it from my machine. -- Randomly Selected Tagline: But you have to

Re: VOTE: SpamAssassin 3.2.0 prerelease 1 tarballs

2007-01-23 Thread Theo Van Dinter
On Sun, Jan 21, 2007 at 04:45:10PM +, Justin Mason wrote: OK, that didn't go too well. *Still* not enough votes... I've pushed out a not-a-prerelease announcement, instead, since we need to get going on that to avoid having to push out the release schedule before we've even completed

Re: VOTE: SpamAssassin 3.2.0 prerelease 1 tarballs

2007-01-23 Thread Theo Van Dinter
On Tue, Jan 23, 2007 at 06:46:57PM -0600, Doc Schneider wrote: I just noticed something the pre1 tarball release has no masses directory. Ugh. Sort of hard to masscheck without that isn't it? There is one in svn, though. Yes and no. There's nothing that says we have to provide a tarball to

Re: [Spamassassin Wiki] Trivial Update of MassCheck by FredTarasevicius

2007-01-18 Thread Theo Van Dinter
On Thu, Jan 18, 2007 at 04:16:51PM +, Justin Mason wrote: Razor/DCC/Pyzor are all use rules. So there is no need to install those for a mass-check, unless you have lots of msgs that were not checked initially. We should only be using the historical data for these rules anyway, so

Re: 3.2.0 release schedule

2007-01-17 Thread Theo Van Dinter
On Mon, Jan 15, 2007 at 10:56:15PM +, Justin Mason wrote: ok. so something like this? - T + 0 days: issue prerelease. announce a heads-up mail. clean up our corpora, get ready for mass-checking, try out mass-check to spot any big memory leaks or whatnot, fix remaining bugs

Re: SUBJECT_ENCODED_TWICE

2007-01-16 Thread Theo Van Dinter
On Tue, Jan 16, 2007 at 10:49:36AM -0800, Karl Chen wrote: Hi all, I believe SUBJECT_ENCODED_TWICE can be improved. The rule currently is: Ok. header SUBJECT_ENCODED_TWICE Subject:raw =~ /=\?\S+\?[BQ]\?.*=\?\S+\?[BQ]\?/i As I understand it, this rule is intended to match subject lines

Re: 3.2.0 release schedule

2007-01-15 Thread Theo Van Dinter
On Wed, Jan 03, 2007 at 02:42:44PM +, Justin Mason wrote: First step, I think, is to define a schedule. How does this sound? (based approximately on what we did for 3.1.0: http://wiki.apache.org/spamassassin/Release310Schedule ) - T + 0 days: announce a heads-up mail. clean up our

Re: svn commit: r494093 - in /spamassassin/trunk: INSTALL MANIFEST lib/Mail/SpamAssassin/Util/DependencyInfo.pm spamc/config.h.in spamc/configure spamc/configure.in spamc/libspamc.c spamc/libspamc.h s

2007-01-08 Thread Theo Van Dinter
On Mon, Jan 08, 2007 at 03:33:54PM -, [EMAIL PROTECTED] wrote: + - Compress::Zlib (from CPAN) + +If you wish to use the optional zlib compression for communication +between spamc and spamd (the -z option to spamc), useful for +long-distance use of spamc over the internet, you

  1   2   3   4   5   6   >