Author: Zenon Panoussis
Email: [EMAIL PROTECTED]
Message:

indexer.conf says:

# Word lengths. You may change default length range of words
# stored in database. By default, words with the length in the
# range from 1 to 32 are stored. Note that setting MaxWordLength more
# than 32 will not work as expected.
#
#MinWordLength 1
#MaxWordLength 32

But what if a word is longer than 32 chars. Will it be truncated 
or will not be indexed at all? The difference is rather important 
when indexing semi-binary newsgroups, where you have some text 
and some MIME-d code, and you want to index the text but not the 
garbage. If "words" longer than 32 bytes are completely dropped, 
then the text gets indexed and all the MIME-d binary stuff gets 
dropped, but if long words are truncated, the database would 
get filled by "words" like 

M1TE&.#EAV0&&`??_`/___RDI*3DY.5)
M2MZ]M=ZEE*6,A-ZMG(1:2F-22N><>Z5
MC-:,6L9S.81C2K5S0N>M>Y1K2L:,6MZ

etc. 

Z


Reply: <http://search.mnogo.ru/board/message.php?id=1637>

___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]

Reply via email to