Anyone have any thoughts on this? the BLANK_LINES rules appear to be
broken.
Michael
Begin forwarded message:
From: Michael Parker <[email protected]>
Date: September 21, 2009 10:22:26 AM CDT
To: SpamAssassin Dev <[email protected]>
Subject: split_into_array_of_short_paragraphs vs short_lines
Howdy,
I was looking at why our old 3.1 instance of SA was hitting a few of
the BLANK_LINES_NN_NN rules where as 3.3 stuff wasn't hitting at
all. I narrowed it down to what get_decoded_body_text_array returns.
For instance, I have a short mail that is 4 lines long. In 3.1
get_decoded_body_text_array would return an array with 4 elements
(ie lines), however in 3.3 that call now returns a single element
array with a "paragraph."
This of course breaks this code in check_blank_line_ratio:
if (scalar @{$fulltext} >= $minlines) {
foreach my $line (@{$fulltext}) {
next if ($line =~ /\S/);
$blank++;
}
$pms->{blank_line_ratio}->{$minlines} = 100 * $blank / scalar
@{$fulltext};
}
else {
$pms->{blank_line_ratio}->{$minlines} = -1; # don't report if
it's a blank message ...
}
Because its looking at array elements and not actual lines.
I suspect there may be other similar eval rules doing that same thing.
Thoughts?
Michael