Hi, Geoff. Answers and queries follow.
> htdig -? (with any program)
> gives the help message and a version number.
I'm not seeing the expected result:
% htdig -?
htdig: No match.
% htmerge -?
htmerge: No match.
% htfuzzy -?
htfuzzy: No match.
Am I missing something here?
> I guess the next question I'd ask is what your max_head_length and
> max_document_length variables are--excerpts are trimmed by the former and
> all documents are trimmed by the latter before indexing.
max_head_length: 10000
max_doc_size is unset, so it's the default (100,000).
The example document ("p4 delete") is exactly 9993 characters, including
the html, so the above values don't seem to be influencing this.
> I'd also ask how your database is updated--has it been updated
> recently? Have you tried rebuilding it from scratch?
I ran htdig -i last night right before I wrote this letter.
Geoff, how would the comment tags affect the results? And does htdig
follow the simple algorithm of ignoring anything between < and >, and
ignoring anything between noindex_start and noindex_end, and counting
everything else as content?
Thanks,
Robert Orenstein
Perforce Software
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html