Apache does not allow a URL to attempt access above the public_html location. Example:

http://www.gnu.org/../software/wget/manual/wget.html

will cause a "Bad Request" page to be generated because the ".." in the URL.

But IIS does not handle ".." the same way. IIS will simply ignore ".." and produce the page. So the following two URLs are referencing the same HTML page:

http://www.merseyfire.gov.uk/pages/fire_auth/councillors.htm

and

http://www.merseyfire.gov.uk/../pages/fire_auth/councillors.htm

The reason I bring this up is that if I use wget to access the URL with "..", it will translate ".." into "%2E%2E" like this:

wget -r http://www.merseyfire.gov.uk/../pages/fire_auth/councillors.htm

saved to

www.merseyfire.gov.uk/%2E%2E/pages/fire_auth/councillors.htm

although it should be saved to

www.merseyfire.gov.uk/pages/fire_auth/councillors.htm

It would be nice if wget could handle these mal-adjusted URLs properly since they do appear from time to time. (In the case of www.merseyfire.gov.uk, they appear very frequently unfortunately.)

Thanks,
Frank

Reply via email to