Hi, Geoff. Answers and queries follow.

> htdig -? (with any program)
> gives the help message and a version number.

I'm not seeing the expected result:

  % htdig -?
  htdig: No match.
  % htmerge -?
  htmerge: No match.
  % htfuzzy -?
  htfuzzy: No match.

Am I missing something here?

> I guess the next question I'd ask is what your max_head_length and
> max_document_length variables are--excerpts are trimmed by the former and
> all documents are trimmed by the latter before indexing.

max_head_length:        10000
max_doc_size is unset, so it's the default (100,000).

The example document ("p4 delete") is exactly 9993 characters, including
the html, so the above values don't seem to be influencing this. 

> I'd also ask how your database is updated--has it been updated
> recently? Have you tried rebuilding it from scratch?

I ran htdig -i last night right before I wrote this letter.

Geoff, how would the comment tags affect the results? And does htdig
follow the simple algorithm of ignoring anything between < and >, and
ignoring anything between noindex_start and noindex_end, and counting
everything else as content?

Thanks,

Robert Orenstein
Perforce Software


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to