Re: Lucene does NOT use UTF-8.

2005-08-28 Thread Marvin Humphrey
Ken Krugler sent a reply to the user list. In an effort to keep all the developers informed, I'm sending my reply to the developer list and including his entire original post below my sig. Ken writes... Since a null in the middle of a string is rare, as is a character outside of the BMP, a

Re: Lucene does NOT use UTF-8.

2005-08-28 Thread Ken Krugler
Hi Marvin, Thanks for the detailed response. After spending a bit more time in the code, I think you're right - all strings seem to be funnelled through IndexOutput. The remaining issue is dealing with old-format indexes. I'm going to take this off-list now, since I'm guessing most list

Re: Lucene does NOT use UTF-8.

2005-08-28 Thread Marvin Humphrey
Hello, Robert... On Aug 28, 2005, at 7:50 PM, Robert Engels wrote: Sorry, but I think you are barking up the wrong tree... and your tone is quite bizarre. My personal OPINION is that your script language is an abomination, and anyone that develops in it is clearly hurting the advancement