Author: Zenon Panoussis
Email: [EMAIL PROTECTED]
Message:
indexer.conf says:
# Word lengths. You may change default length range of words
# stored in database. By default, words with the length in the
# range from 1 to 32 are stored. Note that setting MaxWordLength more
# than 32 will not work as expected.
#
#MinWordLength 1
#MaxWordLength 32
But what if a word is longer than 32 chars. Will it be truncated
or will not be indexed at all? The difference is rather important
when indexing semi-binary newsgroups, where you have some text
and some MIME-d code, and you want to index the text but not the
garbage. If "words" longer than 32 bytes are completely dropped,
then the text gets indexed and all the MIME-d binary stuff gets
dropped, but if long words are truncated, the database would
get filled by "words" like
M1TE&.#EAV0&&`??_`/___RDI*3DY.5)
M2MZ]M=ZEE*6,A-ZMG(1:2F-22N><>Z5
MC-:,6L9S.81C2K5S0N>M>Y1K2L:,6MZ
etc.
Z
Reply: <http://search.mnogo.ru/board/message.php?id=1637>
___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]