According to [EMAIL PROTECTED]:
> i have encountered a somewhat strange situation with htdig:
> some pages don't show up in the htsearch-result, though they have been
> indexed normaly, what the log (output from -vv) shows clearly.
>
> thats what it looks like, when i grep for the missing item's article
> numbers in db.wordlist:
>
> fp11994 i:780 l:0 w:150000
> fp11994 i:1 l:125 w:875
> fp11994 i:780 l:0 w:150000
>
> and this is one, that is properly found:
>
> fp6087fsases i:794 l:0 w:150000
> fp6087 i:1 l:127 w:873
> fp6087fsases i:1 l:127 w:873
>
> btw: it seems as if all items that start with fp-1199 (there are 5) suffer
> the same fate, but also some others...
>
> if someone could explain that to me or give me a clue, that would be
> great!
Well, it doesn't seem to be rigning a bell with anyone, if that's what
you were hoping for, so I think you'd need to give us a little more
information to go on. htdig does normally crop words in the wordlist
to the length specified by maximum_word_length, which is 12 by default
in 3.1.5. If that's not the issue here, then you may want to give us a
bit more of a clue as to what a complete article number looks like in one
of your documents, or better yet suggest a URL or two that contain article
numbers that get cropped, so that we could visit and see for ourselves.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html