Yo,
Here's an idea for you:
What if we had a feature that stripped the querystrs from a URL
contained in "bad_querystr" rather than rejecting them?
This would allow htdig to better index php/asp etc pages which may use
the same page for different documents.
Example:
http://www.xxxx.com/document.php?docid=5&session_id=98721491204724
http://www.xxxx.com/document.php?docid=5&session_id=09235783432458
These would 'map' to the same URL
http://www.xxxx.com/document.php?docid=5
But still allow these two URLs to be treated as different pages
http://www.xxxx.com/document.php?docid=5
http://www.xxxx.com/document.php?docid=10
Has this been done? Please hit me with a verbal fying pan if htdig
supports this now.
I've noticed that Google is starting to get smart about beng able to strip
some querystrs that are sessionids while leaving others alone.
There is no 'out-of-the-box' default way to do this automatically since
ndividual webdevelopers can chose any querystr they want to represent
whatever.
Thanks and Happy Valentine's day you female HtDigers lurking out there!
Neal Richter
Knowledgebase Developer
RightNow Technologies, Inc.
Customer Service for Every Web Site
Office: 406-522-1485
-------------------------------------------------------
This SF.NET email is sponsored by: FREE SSL Guide from Thawte
are you planning your Web Server Security? Click here to get a FREE
Thawte SSL guide and find the answers to all your SSL security issues.
http://ads.sourceforge.net/cgi-bin/redirect.pl?thaw0026en
_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/htdig-dev