Title: Re: utf-8 working code... caution with existing data files.
Pleased to read your comments. Btw go easy on queries (XPath, XUpdate) for the moment. They won't work at all in CORBA (and never will), but I'm still testing them in XML-RPC
 
Now a question:
 
Would it be of any use to you if  when using the command-line tools, you could choose an encoding yourself while retrieving files?
 
You say you submitted the original files in something other than UTF-8. I'm suggesting that for the retrieve and export options, you could specify the desired output encoding. Would that be useful so you get the files back exactly as you put them in? (It's an easy thing to program, that's why I'm asking)
 
Would anyone else be interested in such a feature?
 
James
-----Original Message-----
From: Michael Westbay [mailto:[EMAIL PROTECTED]
Sent: Thu 5/9/2002 2:26 PM
To: [EMAIL PROTECTED]
Cc:
Subject: Re: utf-8 working code... caution with existing data files.

Bates-san wrote:

> I have a patch for the current Xindice CVS that supports
> reading/writing files in UTF-8 containing any Unicode characters you
> want into and out of Xindice.

I just checked out the latest CVS code, and confirmed that the latests checkin works with Japanese.  My original document was in EUC-JP encoding, and it properly converted it to UTF-8 for storage.

I've also tested retrieval with the command line tools (the retrieved file is in UTF-8 encoding), and retrieving and updating files in Japanese with YAP.  They all now work.  Very nice job.

Now that Japanese documents work, I can start really pounding on this.  Great job!

--
Michael Westbay
Work: Beacon-IT http://www.beacon-it.co.jp/
Home:           http://www.seaple.icc.ne.jp/~westbay
Commentary:     http://www.japanesebaseball.com/forum/

Reply via email to