http://bugzilla.spamassassin.org/show_bug.cgi?id=3268





------- Additional Comments From [EMAIL PROTECTED]  2004-04-19 12:48 -------
Subject: Re:  RFE: Strip off redirectors for URI tests

On Mon, Apr 19, 2004 at 11:27:49AM -0700, [EMAIL PROTECTED] wrote:
> I need to go through and find out where that yahoo.c parsing is happening 
> (I'm fairly positive I know 
> where).  I can see why it does it though:
> 
> <a 
> href="http://pa.yahoo.com/*http://us.rd.yahoo.com/evt=23765/*http://photos.yahoo.c
> om/ph/print_splash">
> 
> there's a hard newline in there, so part of the parsing sees it as EOL, and 
> the HTML parsing successfully 
> sees it in an href and takes the whole thing, including the newline -- which 
> is why I added in code to 
> strip the newlines out.

Yeah, the problem is that the get_uri_list() was using the decoded
body and parsing for URIs using REs, then using the URI results of the
HTML parser.

I modified the code to use the rendered body for generic RE parsing,
then let the HTML parser do its thing for the HTML sections...





------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to