On Aug 30, 2005, at 12:47 PM, Doug Cutting wrote:
Yonik Seeley wrote:
I've been looking around... do you have a pointer to the source
where just the suffix is converted from UTF-8?
I understand the index format, but I'm not sure I understand the
problem that would be posed by the prefix len
Wolfgang Hoschek wrote:
I don't know if it matters for Lucene usage. But if using
CharsetEncoder/CharBuffer/ByteBuffer should turn out to be a
significant problem, it's probably due to startup/init time of these
methods for individually converting many small strings, not inherently
due to
Hi,
In order to delete the documents in the index more efficiently during the
incremental indexing process, I implement the batch deleting process on the
application level. First I get the internal document ids based on the query,
then only delete these documents based on the internal ids whe