Oh my, I'm writing yet another spider, for some reason.
I'd like to only spider documents one time. So I'm using a hash of
URI->canonical keys.
Although I realize these *could* be two different docs, they are not on our
server:
http://localhost/path/to/my/file.html
http://localhost/path/to/../to/my/file.html
Any (URI?) tricks to seeing those as the same document?
Bill Moseley
mailto:[EMAIL PROTECTED]
- Re: URI and spidering unique docs Bill Moseley
- Re: URI and spidering unique docs Stephen R. Wilcoxon
- Re: URI and spidering unique docs Gisle Aas
