According to David Adams: > I am also using ht://Dig version 3.1.6 and for me it IS indexing URLs like > > http://www.soton.ac.uk/~lopsoc/gallery.php?gallery=sorcerer1&photo=CNV00023.jpg > > even though I have .jpg in my bad_extensions: list.
Actually, I find this surprising. Upon looking at the code that handles bad_extensions, in both 3.1.6 and 3.2.0b5, it seems to me that there is indeed a bug in the way htdig locates filename extensions in URLs, as Toby described. Can you confirm that you're running vanilla 3.1.6 with no patches to htdig/Retriever.cc which might correct this bug? The fix to the code should be pretty simple, but I haven't had the time to sit down and stare at it long enough to get the fix coded yet. I'll try to get around to it by Friday, so it'll be in the next development snapshot for the 3.2 betas, and posted to the list. > ----- Original Message ----- > From: "Toby Thain" <[EMAIL PROTECTED]> ... > > I noticed today that htdig is not indexing URLs like: > > > > /foo/page.php3?f=bar.jpg > > > > because it notices the URL ends with ".jpg". I am surprised that it's > > not smart enough to realise that the fetched object is actually a > > ".php3", and I definitely want that URL followed. > > > > Is this fixed in a recent version (I am using ht://Dig 3.1.6)? Or is > > there a simple configuration fix? -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/ Dept. Physiology, U. of Manitoba Winnipeg, MB R3E 3J7 (Canada) ------------------------------------------------------- This SF.Net email is sponsored by: IBM Linux Tutorials Free Linux tutorial presented by Daniel Robbins, President and CEO of GenToo technologies. Learn everything from fundamentals to system administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

