On Wed, 18 Apr 2001, Adam Lewenberg wrote:

> Why does the TITLE sometimes not appear in the
> search results page? 
> 
> Try this: 
> 
> http://helios.library.uiuc.edu:9000/experimental-cgi-bin/htsearch
>
> I am using 3.2.0b4-041501 on Linux RH 6.2

I don't see any problems at this point. There's one page with a
[filename.html] but that one doesn't have a <TITLE> tag.

I'm going to make a guess--you don't run htpurge after running htdig.

Keep in mind that if you stem the indexing in some fashion (e.g. using
server_max_docs, or a hopcount, or whatever), there will be "stubs" for
additional documents from links and so on. They haven't been fetched, but
they're in the database. If you use server_max_docs, you can actually
slowly go through and fill out some of these stubs on additional runs.

You can make sure that all stubs are removed from htpurge using the 3.2
"remove_unretrieved_urls" attribute:
<http://www.htdig.org/dev/htdig-3.2/attrs.html#remove_unretrieved_urls>

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/



_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to