On Tuesday, February 26, 2002, at 11:50 AM, James Bates wrote:
Nothing (i.e. looses them). If XML:DB's one is more complete we can of course use their serializer. I'll have a look at it.
�
What's the parser going to do with them? I wouldn't worry about the XML:DB thing, your implementation is probably better.
James
-----Original Message----- From: Kimbro Staken [mailto:[EMAIL PROTECTED] Sent: Tue 2/26/2002 7:39 PM To: [EMAIL PROTECTED] Cc: Subject: Re: Command-line tools reading in UTF-8
On Tuesday, February 26, 2002, at 10:37 AM, James Bates wrote:
> I have just submitted the changes. A new class, StringSerializer was > added, > but it is nothing more than a SAX content and lexical handler that > serializes the XML it receives. I'm pretty sure such a class already > exists somewhere (maybe not in Xindice), and that, at any rate, it might > be more >
There's one that's used by the XML:DB API to implement setContentAsSAX. It'
s part of the XML:DB source tree though.
> suitable in some "tools" package than where I put it for the moment... > Any views on this? > > The standard entities (< > " and ' I think) and comments > work now though.
What does this do with DTDs?
> > James > > >> -----Original Message----- >> From: Kimbro Staken [mailto:[EMAIL PROTECTED] >> Sent: 26 February 2002 17:39 >> To: [EMAIL PROTECTED] >> Subject: Re: Command-line tools reading in UTF-8 >> >> >> Ok. >> >> On Tuesday, February 26, 2002, at 09:34 AM, James Bates wrote: >> >>> Just a moment... I have made a solution that does work... >> Can I still try >>> & commit it before you build rc2? >>> >>> James >>> >>>> -----Original Message----- >>>> From: Kimbro Staken [mailto:[EMAIL PROTECTED] >>>> Sent: 26 February 2002 17:22 >>>> To: [EMAIL PROTECTED] >>>> Subject: Re: Command-line tools reading in UTF-8 >>>> >>>> >>>> Ok,� I'll back it out and put up new rc2 builds. >>>> >>>> On Tuesday, February 26, 2002, at 06:38 AM, James Bates wrote: >>>> >>>>> In addition, comments are stripped: probably not >> desirable either... >>>>> Definately more work needed... >>>>> >>>>>> -----Original Message----- >>>>>> From: James Bates >>>>>> Sent: 26 February 2002 14:30 >>>>>> To: [EMAIL PROTECTED] >>>>>> Subject: Command-line tools reading in UTF-8 >>>>>> >>>>>> >>>>>> Bad news I'm afraid: >>>>>> >>>>>> The patch I submitted, which uses Xerces/Xalan to read in the >>>>>> document and send it to Xindice, breaks another aspect. >>>> This document: >>>>>> >>>>>>���� <?xml version="1.0"?> >>>>>>���� <cartoons>Tom & Jerry</cartoons> >>>>>> >>>>>> goes in as: >>>>>> >>>>>>���� <?xml version="1.0"?> >>>>>>���� <cartoons>Tom� Jerry</cartoons> >>>>>> >>>>>> I haven't been able to find out why this goes wrong, but I >>>>>> suspect it might be >>>>>> a problem with Xalan... reverting to the original code for >>>>>> AddDocument, the document is added just fine... >>>>>> >>>>>> Maybe you should go back to the old code until I find out >>>>>> what's going on? >>>>>> >>>>>> Sorry, >>>>>> James >>>>>> >>>>> >>>>>
Kimbro Staken - http://www.kstaken.org - http://www.xmldatabases.org Apache Xindice native XML database http://xml.apache.org/xindice XML:DB Initiative http://www.xmldb.org Senior Technologist (Your company name here)
>>>> >>>> >>> >> >> >
