I started to use the new term vector support. Much more efficient than temporarily reindexing documents in a RAMDirectory in order to get their
terms :-)
However, I think it would be more reasonable if the getTerms() method would
return Terms instead of Strings, since this is what at least I need in the
subsequent analysis process. Off course itīs easy to construct a term given the
field and the text. However outside the package only the public constructor of Term can be called, which does the field.intern(). I donīt know how expensive the call to intern() really is. Maybe my worries are irrelevant.
Christoph
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]