On Wed Feb 04, 2009 at 08:45:19 +1100, Ted Cooper wrote:

> What do you use to get the URI out reliably? I also run most of my
> testing externally via ${readsocket} but have no method other than using
> SpamAssassin to get the URI out and that only gives we the raw domain names.

  I use the URI::Find perl module, which is passed the message body
 and allows me to retrieve all the links contained in it.  I then
 strip out links which don't begin with https?:// and iterate over
 each one.

    http://search.cpan.org/dist/URI-Find/lib/URI/Find.pm

  This plugin for qpsmtpd was what inspired me to use this for testing
 incoming mail:

    http://devin.com/qpsmtpd/uribl

  There's a example of the technique which might be more digestible,
 even though it isn't related to email:

    
http://blogspam.repository.steve.org.uk/?file/9ff750c55b37/server/plugins/surbl.pm

Steve
-- 
Managed Anti-Spam Service
http://mail-scanning.com/


-- 
## List details at http://lists.exim.org/mailman/listinfo/exim-users 
## Exim details at http://www.exim.org/
## Please use the Wiki with this list - http://wiki.exim.org/

Reply via email to