According to s Kyrsa: > I'm trying to index my website, but I found something strange. > Sometimes Htdig shows the top of the document, sometimes you can have the > part of the document which is countaining the keyword .. > > Do you know why ???
The excerpt matching that htsearch does is separate from the word database matching. When htdig indexes a document, all the words in the body of the document, as well as those in the title, meta description, meta keywords, and even the words in the description text of links to this document all go into the word database. However, for excerpt matching, htdig only stores the first "max_head_length" characters of the body of the document. See http://www.htdig.org/attrs.html#max_head_length When htsearch finds a word in the word database, it will display all matching documents, regardless of where the word was found. However, when it tries to find the same word (or words) in the max_head_length top characters of a document, sometimes this latter match will work and sometimes it won't because the words aren't there (document too big, or the search matched a meta tag or link description rather than a word in the body). If it can find a match in the excerpt, it will highlight it (see http://www.htdig.org/attrs.html#start_highlight) but if it can't find it, it will either show the top of the document (see http://www.htdig.org/attrs.html#no_excerpt_show_top) or some other message (see http://www.htdig.org/attrs.html#no_excerpt_text). -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/ Dept. Physiology, U. of Manitoba Winnipeg, MB R3E 3J7 (Canada) ------------------------------------------------------- This SF.net email is sponsored by: SF.net Giveback Program. SourceForge.net hosts over 70,000 Open Source Projects. See the people who have HELPED US provide better services: Click here: http://sourceforge.net/supporters.php _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

