According to s Kyrsa:
> I'm trying to index my website, but I found something strange.
> Sometimes Htdig shows the top of the document, sometimes you can have the 
> part of the document which is countaining the keyword ..
> 
> Do you know why  ???

The excerpt matching that htsearch does is separate from the word
database matching.  When htdig indexes a document, all the words in the
body of the document, as well as those in the title, meta description,
meta keywords, and even the words in the description text of links to this
document all go into the word database.  However, for excerpt matching,
htdig only stores the first "max_head_length" characters of the body of
the document.  See http://www.htdig.org/attrs.html#max_head_length

When htsearch finds a word in the word database, it will display all
matching documents, regardless of where the word was found.  However,
when it tries to find the same word (or words) in the max_head_length
top characters of a document, sometimes this latter match will work
and sometimes it won't because the words aren't there (document too
big, or the search matched a meta tag or link description rather than
a word in the body).  If it can find a match in the excerpt, it will
highlight it (see http://www.htdig.org/attrs.html#start_highlight)
but if it can't find it, it will either show the top of the document
(see http://www.htdig.org/attrs.html#no_excerpt_show_top) or some other
message (see http://www.htdig.org/attrs.html#no_excerpt_text).

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/
Dept. Physiology, U. of Manitoba  Winnipeg, MB  R3E 3J7  (Canada)


-------------------------------------------------------
This SF.net email is sponsored by: SF.net Giveback Program.
SourceForge.net hosts over 70,000 Open Source Projects.
See the people who have HELPED US provide better services:
Click here: http://sourceforge.net/supporters.php
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to