Hi All,

I've been trying to import a batch of UTF-8 encoded text file (metadata) where 
there are some Traditional Chinese characters in the metadata, and the importer 
spelt out the following error message:



Error Message Dump screen:

=============================
[Fatal Error] dublin_core.xml:8:65: The element type "???" must be terminated by
 the matching end-tag "</???>".
org.xml.sax.SAXParseException: The element type "???" must be terminated by the
matching end-tag "</???>".
        at org.apache.xerces.parsers.DOMParser.parse(Unknown Source)
        at org.apache.xerces.jaxp.DocumentBuilderImpl.parse(Unknown Source)
        at javax.xml.parsers.DocumentBuilder.parse(Unknown Source)
        at org.dspace.app.itemimport.ItemImport.loadXML(ItemImport.java:1737)
        at org.dspace.app.itemimport.ItemImport.loadDublinCore(ItemImport.java:9
37)
        at org.dspace.app.itemimport.ItemImport.loadMetadata(ItemImport.java:922
)
        at org.dspace.app.itemimport.ItemImport.addItem(ItemImport.java:745)
        at org.dspace.app.itemimport.ItemImport.addItems(ItemImport.java:617)
        at org.dspace.app.itemimport.ItemImport.main(ItemImport.java:498)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)
        at java.lang.reflect.Method.invoke(Unknown Source)
        at org.dspace.app.launcher.ScriptLauncher.main(ScriptLauncher.java:183)
org.xml.sax.SAXParseException: The element type "???" must be terminated by the
matching end-tag "</???>".


I am using DSpace on Win7 32 bit box and the content of one of the metadata I 
am importing is like:

<?xml version="1.0" encoding="UTF-8" standalone="no" ?>
<dublin_core>
<dcvalue element="identifier" qualifier="callnumber">PL2447 .D44 2003</dcvalue>
<dcvalue element="qualification" qualifier="degree">M.Phil.</dcvalue>
<dcvalue element="faculty" qualifier="division">Chinese Language and 
Literature</dcvalue>
<dcvalue element="date" qualifier="imprint">2003</dcvalue>
<dcvalue element="identifier" qualifier="recordnum">b24080731</dcvalue>
<dcvalue element="title" qualifier="none">上博楚竹書<性情論>語法研究 / 鄧佩玲</dcvalue>
<dcvalue element="identifier" qualifier="barcode">004069801</dcvalue>
<dcvalue element="contributor" qualifier="author">Deng, Peiling</dcvalue>
<dcvalue element="location" qualifier="none">ult  </dcvalue>
<dcvalue element="language" qualifier="none">chi</dcvalue>
</dublin_core>



The Importer processed all English text ones (metadata files) and stopped at 
this one with Chinese characters in it! I haven't a clue what went wrong! Could 
anyone shed some light please?

Thank you.

-brian.t
University Library System

CUHK
------------------------------------------------------------------------------
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Reply via email to