Thank you, i will check this out. Also, what should i get if i try to print the XMLCh variable?
regards Jinesh K J On Nov 28, 2007 8:45 PM, Jesse Pelton <[EMAIL PROTECTED]> wrote: > The problem is probably in the transcoding. XMLString::transcode() > transcodes to whatever native code page your machine is set up with. Unless > that code page allows zwj and zwnj to be represented, your transcoding > results will not be what you expect. You should transcode to an encoding > that can represent any characters you can get (like Xerces' internal UTF-16 > encoding). See XMLTransService. > > ________________________________ > > From: jinesh kj [mailto:[EMAIL PROTECTED] > Sent: Wednesday, November 28, 2007 9:56 AM > To: [email protected] > Subject: Re: reg:[reading data with ZWJ and ZWNJ] > > > hi, > > I actually need the whole text with the zwj. My code i am attaching. Only > the section which does interaction with xml file. Hope its enough. My code > is little big, so it may take a little time for you to understand i havent > commented it properly. If you need explanation on any part please let me > know. > > cheers > > Jinesh K J > > > On Nov 28, 2007 5:43 PM, Alberto Massari <[EMAIL PROTECTED]> wrote: > > > The file you attached is correct, and the same modified DOMPrint > that I > used before return the ZWJ characters in the content of > getTextContent. > Could you show us the code you are using to read the file? > > > Alberto > > jinesh kj wrote: > > hi, > > > > > I dumped using mysql -X command which will give me output as xml > file. > > I dont know whether there is any problem with my xml files. Is > there > > any specific notation to represent the ZWJ and ZWNJ in xml files? > > > > I am attaching an xml file i have. > > > > Thank you for your help, and if you have a better idea what to do > with > > the xml file when i get characters like these, or any links to > those > > details, please point me. > > > > regards > > > > Jinesh K J > > > > On Nov 28, 2007 4:46 PM, Alberto Massari <[EMAIL PROTECTED] > > > <mailto:[EMAIL PROTECTED]>> wrote: > > > > If you can read the original file, but not when you edit it, > I > > would bet > > the reason is in the way you edit your XML files (and dump > from the > > database). What are you using? Could you attach a small > sample file? > > > > Alberto > > > > jinesh kj wrote: > > > hi, > > > > > > I tried reading the file you send. It didnt give any error, > > which means it > > > was reading perfectly. I dont know how to check in the > debugger > > and all, so > > > dont know whether it read 200d or not. But if i try to > edit the > > xml file, > > > with some text data along with, it is not reading the the > text. > > Do i have to > > > do anything for it? Basically i am trying to read through > an xml > > file, which > > > is a dump of mysql database. It have many zwj and all. I > dont > > know whether > > > it is according to specified encoding or so and all.Butsince it > > was dumped > > > from database, using the built in function, i think a > chance for > > error is > > > too low. > > > > > > I am trying to use a similar function only, in my program, > it > > returns > > > nothing when there is a ZWJ in my data. > > > > > > I hope i am clear. I am able to read xml files without ZWJ > easily. > > > > > > regards > > > > > > Jinesh K J > > > > > > On Nov 28, 2007 4:02 PM, Alberto Massari > > > <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>> > wrote: > > > > > > > > >> I am attaching a sample XML that contains a U+200D > character > > between a > > >> --| and |-- pattern; I modified DOMPrint to issue a > > >> > > >> const XMLCh* > > data=doc->getDocumentElement()->getTextContent(); > > >> > > >> and in the debugger I see that data[4] is \x200D > > >> Have you checked your source XML really has that > character? > > Also, is > > >> the representation of the ZWJ character in the XML file > valid > > according > > >> to the specified encoding (e.g. in UTF-8, it's 0xE2 0x80 > 0x8D)? > > >> > > >> Alberto > > >> > > >> jinesh kj wrote: > > >> > > >>> hi, > > >>> > > >>> Actually, getTextContent is not returning any value when > there > > is a Zero > > >>> width joiner. > > >>> > > >>> cheers > > >>> > > >>> Jinesh K J > > >>> > > >>> On Nov 28, 2007 3:28 PM, Alberto Massari > > > < [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>> > > > >>> > > >> wrote: > > >> > > >>> > > >>>> Hi Jinesh, > > >>>> which kind of issues are you having? The text returned > by > > >>>> > > >> getTextContent > > >> > > >>>> should contain a \x200D value inside. Or have you > transcoded > > it into > > >>>> chars? > > >>>> > > >>>> Alberto > > >>>> > > >>>> jinesh kj wrote: > > >>>> > > >>>> > > >>>>> hi all, > > >>>>> > > >>>>> I was trying to read from an XML file where some data > have > > ZERO Width > > >>>>> > > >>>>> > > >>>> Joiner > > >>>> > > >>>> > > >>>>> in it. I used the getTextContent in DOMNode. I was able > to > > read the > > >>>>> > > >>>>> > > >>>> contents > > >>>> > > >>>> > > >>>>> without Zero width joiner, but there are some issues > with these > > >>>>> > > >> special > > >> > > >>>>> characters. What do i have to change? Do i have to make > any > > special > > >>>>> settings? Or do i have to use any other function > insttead? > > >>>>> > > >>>>> cheers > > >>>>> Jinesh K J > > >>>>> > > >>>>> > > >>>>> > > >>>>> > > >>> > > >>> > > >> > > > > > > > > > > > > > > > > > > > -- > > My Feelings,Expressions- > > http://logbookofanobserver.blogspot.com > > > > SMC : My computer, My language http://smc.org.in > > സ്വതന്ത്ര മലയാളം കമ്പ്യൂട്ടിങ്ങ്, എന്റെ കമ്പ്യൂട്ടറിന് എന്റെ ഭാഷ > > > > > > > -- > My Feelings,Expressions- > http://logbookofanobserver.blogspot.com > > SMC : My computer, My language http://smc.org.in > സ്വതന്ത്ര മലയാളം കമ്പ്യൂട്ടിങ്ങ്, എന്റെ കമ്പ്യൂട്ടറിന് എന്റെ ഭാഷ > -- My Feelings,Expressions- http://logbookofanobserver.blogspot.com SMC : My computer, My language http://smc.org.in സ്വതന്ത്ര മലയാളം കമ്പ്യൂട്ടിങ്ങ്, എന്റെ കമ്പ്യൂട്ടറിന് എന്റെ ഭാഷ
