date:20050831

Re: Lucene does NOT use UTF-8.

2005-08-31 Thread Wolfgang Hoschek

On Aug 30, 2005, at 12:47 PM, Doug Cutting wrote: Yonik Seeley wrote: I've been looking around... do you have a pointer to the source where just the suffix is converted from UTF-8? I understand the index format, but I'm not sure I understand the problem that would be posed by the prefix len

Re: Lucene does NOT use UTF-8.

2005-08-31 Thread Doug Cutting

Wolfgang Hoschek wrote: I don't know if it matters for Lucene usage. But if using CharsetEncoder/CharBuffer/ByteBuffer should turn out to be a significant problem, it's probably due to startup/init time of these methods for individually converting many small strings, not inherently due to

deleting documents from index

2005-08-31 Thread HUI . OUYANG

Hi, In order to delete the documents in the index more efficiently during the incremental indexing process, I implement the batch deleting process on the application level. First I get the internal document ids based on the query, then only delete these documents based on the internal ids whe

Re: Lucene does NOT use UTF-8.

Re: Lucene does NOT use UTF-8.

deleting documents from index

3 matches

Site Navigation

Mail list logo

Footer information