According to [EMAIL PROTECTED]:
> i have encountered a somewhat strange situation with htdig:
> some pages don't show up in the htsearch-result, though they have been
> indexed normaly, what the log (output from -vv) shows clearly.
> 
> thats what it looks like, when i grep for the missing item's article
> numbers in db.wordlist:
> 
> fp11994 i:780   l:0     w:150000
> fp11994 i:1     l:125   w:875
> fp11994 i:780   l:0     w:150000
> 
> and this is one, that is properly found:
> 
> fp6087fsases    i:794   l:0     w:150000
> fp6087  i:1     l:127   w:873
> fp6087fsases    i:1     l:127   w:873
> 
> btw: it seems as if all items that start with fp-1199 (there are 5) suffer
> the same fate, but also some others...
> 
> if someone could explain that to me or give me a clue, that would be
> great!

Well, it doesn't seem to be rigning a bell with anyone, if that's what
you were hoping for, so I think you'd need to give us a little more
information to go on.  htdig does normally crop words in the wordlist
to the length specified by maximum_word_length, which is 12 by default
in 3.1.5.  If that's not the issue here, then you may want to give us a
bit more of a clue as to what a complete article number looks like in one
of your documents, or better yet suggest a URL or two that contain article
numbers that get cropped, so that we could visit and see for ourselves.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to