Re: [Dspace-tech] structure import problem with French and German accented characters

2008-12-15 Thread Walker, David
Hi Andrew,

I can't speak to the general question.  But on this point . . .

   ü

. . . is an HTML character entity reference, and is not recognized within XML 
documents in general.  To use this, you would need, as the XML parser here is 
saying, a supporting DTD entity reference declaration.

Easier, I think, maybe just to reference it using it's numeric value:

   ü

--Dave


==
David Walker
Library Web Services Manager
California State University
http://xerxes.calstate.edu


From: Andrew Marlow [marlow.and...@googlemail.com]
Sent: Monday, December 15, 2008 2:05 PM
To: dspace-tech@lists.sourceforge.net
Subject: [Dspace-tech] structure import problem with French and German accented 
characters


I have created an XML file for a structure import, based on a CSV file I have 
of journal titles. I am converting the CSV to XML using a bit of perl. 
Everything is fine until I introduce journal titles that contain accented 
characters. For example, one title contains the German word 'fur' with u 
umlaut. I get a UTF-8 error if I leave it like that. So in my XML file I change 
this for ü but it doesn't work. It says 'the entity uuml was referenced 
but not declared'. What is going wromg please? How may titles with accented 
characters be imported?
--
Regards,

Andrew M.

--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] structure import problem with French and German accented characters

2008-12-15 Thread Andrew Marlow
I have created an XML file for a structure import, based on a CSV file I
have of journal titles. I am converting the CSV to XML using a bit of perl.
Everything is fine until I introduce journal titles that contain accented
characters. For example, one title contains the German word 'fur' with u
umlaut. I get a UTF-8 error if I leave it like that. So in my XML file I
change this for ü but it doesn't work. It says 'the entity uuml was
referenced but not declared'. What is going wromg please? How may titles
with accented characters be imported?
-- 
Regards,

Andrew M.
--
SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada.
The future of the web can't happen without you.  Join us at MIX09 to help
pave the way to the Next Web now. Learn more and register at
http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech