Re: Lucene does NOT use UTF-8.

2005-08-28 Thread Marvin Humphrey
Ken Krugler sent a reply to the user list. In an effort to keep all the developers informed, I'm sending my reply to the developer list and including his entire original post below my sig. Ken writes... > Since a null in the > middle of a string is rare, as is a character outside of the BMP, a

RE: Lucene does NOT use UTF-8.

2005-08-28 Thread Robert Engels
Sorry, but I think you are barking up the wrong tree... and your tone is quite bizarre. My personal OPINION is that your "script" language is an abomination, and anyone that develops in it is clearly hurting the advancement of all software - but that is another story, and doesn't matter much to the

Re: Lucene does NOT use UTF-8.

2005-08-28 Thread Otis Gospodnetic
I'm not familiar with UTF-8 enough to follow the details of this discussion. I hope other Lucene developers are, so we can resolve this issue anyone raising a hand? Otis --- Marvin Humphrey <[EMAIL PROTECTED]> wrote: > Ken Krugler sent a reply to the user list. In an effort to keep all > t

Re: Lucene does NOT use UTF-8.

2005-08-28 Thread Ken Krugler
Hi Marvin, Thanks for the detailed response. After spending a bit more time in the code, I think you're right - all strings seem to be funnelled through IndexOutput. The remaining issue is dealing with old-format indexes. I'm going to take this off-list now, since I'm guessing most list rea

Re: Lucene does NOT use UTF-8.

2005-08-28 Thread Marvin Humphrey
Hello, Robert... On Aug 28, 2005, at 7:50 PM, Robert Engels wrote: Sorry, but I think you are barking up the wrong tree... and your tone is quite bizarre. My personal OPINION is that your "script" language is an abomination, and anyone that develops in it is clearly hurting the advancement