In addition, comments are stripped: probably not desirable either... Definately more work needed...
> -----Original Message----- > From: James Bates > Sent: 26 February 2002 14:30 > To: [EMAIL PROTECTED] > Subject: Command-line tools reading in UTF-8 > > > Bad news I'm afraid: > > The patch I submitted, which uses Xerces/Xalan to read in the > document and send it to Xindice, breaks another aspect. This document: > > <?xml version="1.0"?> > <cartoons>Tom & Jerry</cartoons> > > goes in as: > > <?xml version="1.0"?> > <cartoons>Tom Jerry</cartoons> > > I haven't been able to find out why this goes wrong, but I > suspect it might be > a problem with Xalan... reverting to the original code for > AddDocument, the document is added just fine... > > Maybe you should go back to the old code until I find out > what's going on? > > Sorry, > James >
