On Tuesday, February 26, 2002, at 11:50 AM, James Bates wrote:

Nothing (i.e. looses them). If XML:DB's one is more complete we can of course use their serializer. I'll have a look at it.

What's the parser going to do with them? I wouldn't worry about the XML:DB thing, your implementation is probably better.


James

-----Original Message-----
From: Kimbro Staken [mailto:[EMAIL PROTECTED]
Sent: Tue 2/26/2002 7:39 PM
To: [EMAIL PROTECTED]
Cc:
Subject: Re: Command-line tools reading in UTF-8


On Tuesday, February 26, 2002, at 10:37 AM, James Bates wrote:

> I have just submitted the changes. A new class, StringSerializer was
> added,
> but it is nothing more than a SAX content and lexical handler that
> serializes the XML it receives. I'm pretty sure such a class already
> exists somewhere (maybe not in Xindice), and that, at any rate, it might
> be more
>

There's one that's used by the XML:DB API to implement setContentAsSAX. It'
s part of the XML:DB source tree though.


> suitable in some "tools" package than where I put it for the moment...
> Any views on this?
>
> The standard entities (< > " and ' I think) and comments
> work now though.

What does this do with DTDs?

>
> James
>
>
>> -----Original Message-----
>> From: Kimbro Staken [mailto:[EMAIL PROTECTED]
>> Sent: 26 February 2002 17:39
>> To: [EMAIL PROTECTED]
>> Subject: Re: Command-line tools reading in UTF-8
>>
>>
>> Ok.
>>
>> On Tuesday, February 26, 2002, at 09:34 AM, James Bates wrote:
>>
>>> Just a moment... I have made a solution that does work...
>> Can I still try
>>> & commit it before you build rc2?
>>>
>>> James
>>>
>>>> -----Original Message-----
>>>> From: Kimbro Staken [mailto:[EMAIL PROTECTED]
>>>> Sent: 26 February 2002 17:22
>>>> To: [EMAIL PROTECTED]
>>>> Subject: Re: Command-line tools reading in UTF-8
>>>>
>>>>
>>>> Ok,� I'll back it out and put up new rc2 builds.
>>>>
>>>> On Tuesday, February 26, 2002, at 06:38 AM, James Bates wrote:
>>>>
>>>>> In addition, comments are stripped: probably not
>> desirable either...
>>>>> Definately more work needed...
>>>>>
>>>>>> -----Original Message-----
>>>>>> From: James Bates
>>>>>> Sent: 26 February 2002 14:30
>>>>>> To: [EMAIL PROTECTED]
>>>>>> Subject: Command-line tools reading in UTF-8
>>>>>>
>>>>>>
>>>>>> Bad news I'm afraid:
>>>>>>
>>>>>> The patch I submitted, which uses Xerces/Xalan to read in the
>>>>>> document and send it to Xindice, breaks another aspect.
>>>> This document:
>>>>>>
>>>>>>���� <?xml version="1.0"?>
>>>>>>���� <cartoons>Tom &amp; Jerry</cartoons>
>>>>>>
>>>>>> goes in as:
>>>>>>
>>>>>>���� <?xml version="1.0"?>
>>>>>>���� <cartoons>Tom� Jerry</cartoons>
>>>>>>
>>>>>> I haven't been able to find out why this goes wrong, but I
>>>>>> suspect it might be
>>>>>> a problem with Xalan... reverting to the original code for
>>>>>> AddDocument, the document is added just fine...
>>>>>>
>>>>>> Maybe you should go back to the old code until I find out
>>>>>> what's going on?
>>>>>>
>>>>>> Sorry,
>>>>>> James
>>>>>>
>>>>>
>>>>>
Kimbro Staken - http://www.kstaken.org - http://www.xmldatabases.org
Apache Xindice native XML database http://xml.apache.org/xindice
XML:DB Initiative http://www.xmldb.org
Senior Technologist (Your company name here)
>>>>
>>>>
>>>
>>
>>
>




Reply via email to