Thanks for the tips. I was definitely not reading in the document using the
same encoding that was used to create it. I am now able to read in Euro
signs when I get the encoding right.

The only outstanding issue is that line separators are being interpreted as
spaces. For example, if there is a block of XML that looks like this:

<tagx>Line 1.
Line 2.
Line 3.</tagx>

It is being read in as:

Line1. Line 2. Line 3.

instead of:

Line 1.
Line 2.
Line 3.

as I want it to. I have to assume it is an encoding issue because the line
separators used to be retained after the XML was parsed.

Once again, thank you for the help.

----- Original Message -----
From: "Joseph Kesselman" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Tuesday, November 19, 2002 11:05 AM
Subject: Re: Parsing XML Containing Euro Sign


> Make sure the encoding declared in your document considers the Euro symbol
> an acceptable character. If it doesn't, (a) change encodings to one that
> does, or (b) use a numeric character escape or (c) if you're in UTF-8, use
> the multibyte sequence which represents that symbol or (d) define an
> entity which maps to (b) or (c), and reference that entity name.
>
> ______________________________________
> Joe Kesselman  / IBM Research
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to