--- Geoff Hutchison <[EMAIL PROTECTED]> wrote: > > > > where would I change the URL parser? > > If you'd like to hack away at the URL parser, it's in htlib/URL.cc. >
If you're going to hack the parser to handle Broken URLs, then please make this configurable. Personally, I'd rather get warnings/errors in the output of htdig indicating that a URL in a webpage is invalid rather than break htdig's conformance to standards. Just because IE (and possibly NS) are broken, doesn't mean that all web clients (browsers and/or robots) are. Just my thoughts (I've written URL parsing code a few times, back before there ever was a CPAN or Java 1.0...adhering to standards is a Good Thing). greg_fenton. ===== Greg Fenton [EMAIL PROTECTED] __________________________________________________ Do you Yahoo!? Faith Hill - Exclusive Performances, Videos & More http://faith.yahoo.com ------------------------------------------------------- This sf.net email is sponsored by:ThinkGeek Welcome to geek heaven. http://thinkgeek.com/sf _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

