Hi,
I was wondering, if there is anywhere a table (similar to Table 1.2 An overview
of different field types, their characteristics, and their usage in Lucene in
Action), listing the possible methods and their usage.
I have created one quickly (thus, not complete and there could be a lot of
failures ;-)
Store |TermVector |Index |reasonable |Usage
YES |NO |NO |1 |URLs
|telephone number
YES |WITH_OFFSETS |NO |1 |?
YES |WITH_POSITIONS |NO |1 |?
YES |WITH_POSITIONS_OFFSETS |NO |1 |?
YES |YES |NO |1 |?
NO |NO |NO |0 |DO NOT USE
NO |WITH_OFFSETS |NO |? |?
NO |WITH_POSITIONS |NO |? |?
NO |WITH_POSITIONS_OFFSETS |NO |? |?
NO |YES |NO |? |?
YES |* |NO_NORMS |0 |no Analyzer,
|not store
NO |NO |NO_NORMS |1 |
NO |WITH_OFFSETS |NO_NORMS |1 |
NO |WITH_POSITIONS |NO_NORMS |1 |
NO |WITH_POSITIONS_OFFSETS |NO_NORMS |1 |
NO |YES |NO_NORMS |1 |
YES |NO |TOKENIZED |1 |Doc content
YES |WITH_OFFSETS |TOKENIZED |1 |
YES |WITH_POSITIONS |TOKENIZED |1 |
YES |WITH_POSITIONS_OFFSETS |TOKENIZED |1 |
YES |YES |TOKENIZED |1 |
NO |NO |TOKENIZED |1 |
NO |WITH_OFFSETS |TOKENIZED |? |
NO |WITH_POSITIONS |TOKENIZED |? |
NO |WITH_POSITIONS_OFFSETS |TOKENIZED |? |
NO |YES |TOKENIZED |? |
YES |* |UN_TOKENIZED |0 |no Analyzer,
|not store
NO |NO |UN_TOKENIZED |1 |
NO |WITH_OFFSETS |UN_TOKENIZED |1 |
NO |WITH_POSITIONS |UN_TOKENIZED |1 |
NO |WITH_POSITIONS_OFFSETS |UN_TOKENIZED |1 |
NO |YES |UN_TOKENIZED |1 |
I think, COMPRESS applies analogously to YES.But use for binary values and long
documents
(x words) (What is the dimension for x??)
Is somewhere, something similar available?
Cheers
Christoph
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]