On Monday 29 August 2005 19:56, Ken Krugler wrote: > "Lucene writes strings as a VInt representing the length of the > string in Java chars (UTF-16 code units), followed by the character > data."
But wouldn't UTF-16 mean 2 bytes per character? That doesn't seem to be the case. Regards Daniel -- http://www.danielnaber.de --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]