Hi Andrew,

I can't speak to the general question.  But on this point . . .

   ü

. . . is an HTML character entity reference, and is not recognized within XML 
documents in general.  To use this, you would need, as the XML parser here is 
saying, a supporting DTD entity reference declaration.

Easier, I think, maybe just to reference it using it's numeric value:

   ü

--Dave


==================
David Walker
Library Web Services Manager
California State University
http://xerxes.calstate.edu


From: Andrew Marlow [[email protected]]
Sent: Monday, December 15, 2008 2:05 PM
To: [email protected]
Subject: [Dspace-tech] structure import problem with French and German accented 
characters


I have created an XML file for a structure import, based on a CSV file I have 
of journal titles. I am converting the CSV to XML using a bit of perl. 
Everything is fine until I introduce journal titles that contain accented 
characters. For example, one title contains the German word 'fur' with u 
umlaut. I get a UTF-8 error if I leave it like that. So in my XML file I change 
this for ü but it doesn't work. It says 'the entity uuml was referenced 
but not declared'. What is going wromg please? How may titles with accented 
characters be imported?
--
Regards,

Andrew M.

------------------------------------------------------------------------------
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to