At 6:22 AM -0800 11/18/02, Bill Moseley wrote:
I'm not clear what URI should do here.  In a spider fetching /../foo then
fetches /../../foo and so on.

 perl -MURI -le 'print  URI->new_abs("../foo","http://root.com";)->as_string'
http://root.com/../foo

I can fix $uri->path, of course.
Some relative links are just wrong. Browsers seem to just stop at the root level, so the page creators never get an error in testing. I would say to swallow any parent links above root, perhaps putting a note into the indexing log.

Avi


--
Complete Guide to Search Engines for Web Sites and Intranets
<http://www.searchtools.com>

Reply via email to