At 6:22 AM -0800 11/18/02, Bill Moseley wrote:
Some relative links are just wrong. Browsers seem to just stop at the root level, so the page creators never get an error in testing. I would say to swallow any parent links above root, perhaps putting a note into the indexing log.I'm not clear what URI should do here. In a spider fetching /../foo then fetches /../../foo and so on.perl -MURI -le 'print URI->new_abs("../foo","http://root.com")->as_string'http://root.com/../foo I can fix $uri->path, of course.
Avi
--
Complete Guide to Search Engines for Web Sites and Intranets
<http://www.searchtools.com>