Hi,
I have encountered some trouble in many files in DBpedia while trying to
read them using Jena due to errors in the files (as I guess). Here I list two
of the major issues I found out.
1. parsing error QName - in the rdf file for Italy, in line 3812 column 8,
there is the following line mentioning "<p:" but never defines in the file and
the value after : is empty.
<p: xmlns:p="http://dbpedia.org/property/nickname%28s%29" xml:lang="en">The
Boot; The Belpaese</p:>
http://dbpedia.org/resource/Italy(line 3812 column 8): Element or attribute do
not match QName production: QName::=(NCName':')?NCName.
Error in reading model at
URI:http://dbpedia.org/resource/Italycom.hp.hpl.jena.shared.JenaException:
org.xml.sax.SAXParseException; systemId: http://dbpedia.org/resource/Italy;
lineNumber: 3812; columnNumber: 8; Element or attribute do not match QName
production: QName::=(NCName':')?NCName.
at
com.hp.hpl.jena.rdf.model.impl.RDFDefaultErrorHandler.fatalError(RDFDefaultErrorHandler.java:45)
at
com.hp.hpl.jena.rdf.arp.impl.ARPSaxErrorHandler.fatalError(ARPSaxErrorHandler.java:35)
at com.hp.hpl.jena.rdf.arp.impl.XMLHandler.warning(XMLHandler.java:225)
at com.hp.hpl.jena.rdf.arp.impl.XMLHandler.fatalError(XMLHandler.java:255)
I found many of these files and got the same error when trying to process these
rdf files. One other example is the following file,
http://dbpedia.org/resource/California . Is there any solution for this errors
in the files that we can do when we try to process using a tool like Jena? As I
think, this is an error in rdf files in dbpedia.
2. premature end of file. example for the this resource,
http://dbpedia.org/resource/Karr%C3%B6sten. This may be because of closing tag
errors. following is the exact error I get when tried to parse the file.
ERROR [main] (RDFDefaultErrorHandler.java:44) - Premature end of file.
Error in reading model at URI:http://dbpedia.org/resource/Karr%C3%B6sten
com.hp.hpl.jena.shared.JenaException: org.xml.sax.SAXParseException; Premature
end of file.
at
com.hp.hpl.jena.rdf.model.impl.RDFDefaultErrorHandler.fatalError(RDFDefaultErrorHandler.java:45)
at
com.hp.hpl.jena.rdf.arp.impl.ARPSaxErrorHandler.fatalError(ARPSaxErrorHandler.java:35)
at com.hp.hpl.jena.rdf.arp.impl.XMLHandler.warning(XMLHandler.java:225)
at com.hp.hpl.jena.rdf.arp.impl.XMLHandler.fatalError(XMLHandler.java:255)
What can we do for these errors in DBpedia? Is there any solution for these?
The major concern for me is the issue 1 which is file is already in good shape
except for one QName error. Any suggestions?
Thank you.
------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion