Eric A. Hall writes:
> I sometimes get SVN notifications that contain lists of files and their
> status. The filenames will often get picked up by the URI matching
> algorithm, each of which end up being processed through numerous lookups
> (URICOUNTRY, my LDAP filter, etc). Sometimes I get very large messages
> with hundreds of file lists, which in turn causes spamassassin to go into
> never-never land while it thinks about the hundreds of "URI" matches.
> 
> For example,
> 
>   A    fpo/reports/perl/nagios_notifications1.pl.bak
>   A    foo/reports/perl/nagios_outages1.pl
>   A    foo/reports/perl/GWIR.pm
> 
> nagios_outages1.pl will be determined as a URI for .pl domain and GWIR.pm
> will be determined as a URI for .pm domain, and so forth. The only way to
> get these messages through is to disable spamassassin...
> 
> I've updated to 3.2.4 just now and it still has the same problem
> 
> I'm guessing the URI analyzer needs to be smarter.

The URI analyzer already is smarter ;)

Changing the URICountry plugin is the way to fix this.

The Mail/SpamAssassin/Plugin/URIDetail plugin is a good example of how
plugins can get metadata about the URIs via the get_uri_detail_list() API.
looking at the POD doc and source for that
in Mail/SpamAssassin/PerMsgStatus, I see that "types" == "parsed" should
mean that the URI was inferred, instead of found in a link or image.
URICountry should ignore URIs of that type.

--j.

Reply via email to