On Wed, Sep 06, 2000 at 07:07:42PM +0100, Nick Vincent wrote:
> Hi,
> 
> I have a problem with special characters disappearing from my XML when I run
> it through SabloTron.  Consider the cases of two characters I know and love,
> the UK pound (�) and the single quote, or acute (').  Here is a run that
> illustrates this:
> 
...snip...
> <?xml version="1.0" standalone="no"?>
> <!DOCTYPE moreovernews SYSTEM
> "http://p.moreover.com/xml_dtds/moreovernews.dtd">
> 
> <allproducts>
>       <quid>&pound;</quid>
>       <amp>&amp;</amp>
>       <acute>&acute;</acute>
> </allproducts>
...snip...

I think &pound; and &acute; are undefined in XML, unlike in HTML.  If
you want to use them, you must specify so somewhere in your DTD - you
did not.  I did not try if it works if you do (not exactly knowing how
to right now), but you could try to use numeric values instead:

&pound; is &#163;
&acute; is &#180;

This works in your example (outputs ugly unreadable UTF-8 unicode but
this can be changed with xml:output or postprocessing).

-- 
Guenter Radestock, Universitaetsbibliothek Karlsruhe
[EMAIL PROTECTED]
http://www.ubka.uni-karlsruhe.de/~guenter

Reply via email to