Gregory Kozlovsky wrote: > All right, > > I typed in mysql > select url,hops from urlword where > url='http://www.washtimes.com/entertainment/'; > and got > http://www.washtimes.com/entertainment/ | 2 | > Now, I indexed with index -o command line argument which, according to > the description > -o Index documents with less hops first. Here "hops" means the "depth" > value of the document. > If this was true, this document should've have level 1, because it is linked > to the front page. > Here is the reason, why it has level 2 found with index -P: > > 2 200 http://www.washtimes.com/entertainment/ > 1 301 http://www.washtimes.com/entertainment > 0 200 http://www.washtimes.com/ > > Apparently, the redirection from URL without the slash at the end to one > with the > slash is not recognized by the ASPSeek as a special case. Is there any way > around it? >
Unfortunately, there is no way to workaround, you can only set MaxHops to 3. Or you can modify sources, so that redirect will not increase hop value. See file parse.cpp, method CUrl::HTTPGetUrlAndStore, fix line hrID = wordCache.GetHref(str, CurSrv, doc.m_urlID, doc.m_hops + 1, srv); > > Gregory > > -----Original Message----- > From: Alexander F Avdonkin [mailto:[EMAIL PROTECTED]] > Sent: Donnerstag, 13. Juni 2002 16:34 > To: [EMAIL PROTECTED] > Subject: Re: [aseek-users] Indexing with MaxHops > > You told that page is reacheable by 2 clicks. > Check if all intermediate pages are indexed and which hop values are > assigned > to them. > > Gregory Kozlovsky wrote: > > > No, it was not indexed. I checked logging into mysql and using a SELECT > > statement. > > > > -----Original Message----- > > From: Alexander F Avdonkin [mailto:[EMAIL PROTECTED]] > > Sent: Donnerstag, 13. Juni 2002 16:17 > > To: [EMAIL PROTECTED] > > Subject: Re: [aseek-users] Indexing with MaxHops > > > > Hello Gregory, > > > > As for first problem, check if page referring to absent URL is indexed and > > what > > hop value is assigned to it. > > > > Alexander.
