Author: Aaron Hawley
Email: [EMAIL PROTECTED]
Message:
so to follow up:

multiple slashes in a url will not allow it to be matched by string and regex commands 
or robots.txt files.  mnoGoSearch needs to convert urls like this:

^(http://.*/)/+(.*)
to
$1$2

and have that be done repeatedly to a url until all multiple slashes are removed.

aaron


> How do folks handle situations in your string and regular expressions where a web 
>server can handle multiple backslashes in a url?
> 
> example:
> 
> http://www.uvm.edu//foo
> http://www.uvm.edu///foo
> http://www.uvm.edu////foo
> 
> all equal
> http://www.uvm.edu/foo
> 
> which is already handled by a disallow command or the robots file.
> 

Reply: <http://www.mnogosearch.org/board/message.php?id=4293>

___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]

Reply via email to