Hi,
   I have encountered some trouble in many files in DBpedia while trying to 
read them using Jena due to errors in the files (as I guess). Here I list two 
of the major issues I found out.

1. parsing error QName - in the rdf file for Italy, in line 3812 column 8, 
there is the following line mentioning "<p:" but never defines in the file and 
the value after : is empty.

<p: xmlns:p="http://dbpedia.org/property/nickname%28s%29"; xml:lang="en">The 
Boot; The Belpaese</p:>

http://dbpedia.org/resource/Italy(line 3812 column 8): Element or attribute do 
not match QName production: QName::=(NCName':')?NCName.
Error in reading model at 
URI:http://dbpedia.org/resource/Italycom.hp.hpl.jena.shared.JenaException: 
org.xml.sax.SAXParseException; systemId: http://dbpedia.org/resource/Italy; 
lineNumber: 3812; columnNumber: 8; Element or attribute do not match QName 
production: QName::=(NCName':')?NCName.
    at 
com.hp.hpl.jena.rdf.model.impl.RDFDefaultErrorHandler.fatalError(RDFDefaultErrorHandler.java:45)
    at 
com.hp.hpl.jena.rdf.arp.impl.ARPSaxErrorHandler.fatalError(ARPSaxErrorHandler.java:35)
    at com.hp.hpl.jena.rdf.arp.impl.XMLHandler.warning(XMLHandler.java:225)
    at com.hp.hpl.jena.rdf.arp.impl.XMLHandler.fatalError(XMLHandler.java:255)

I found many of these files and got the same error when trying to process these 
rdf files. One other example is the following file, 
http://dbpedia.org/resource/California . Is there any solution for this errors 
in the files that we can do when we try to process using a tool like Jena? As I 
think, this is an error in rdf files in dbpedia.


2. premature end of file. example for the this resource, 
http://dbpedia.org/resource/Karr%C3%B6sten. This may be because of closing tag 
errors. following is the exact error I get when tried to parse the file.

ERROR [main] (RDFDefaultErrorHandler.java:44) - Premature end of file.
Error in reading model at URI:http://dbpedia.org/resource/Karr%C3%B6sten
com.hp.hpl.jena.shared.JenaException: org.xml.sax.SAXParseException; Premature 
end of file.
    at 
com.hp.hpl.jena.rdf.model.impl.RDFDefaultErrorHandler.fatalError(RDFDefaultErrorHandler.java:45)
    at 
com.hp.hpl.jena.rdf.arp.impl.ARPSaxErrorHandler.fatalError(ARPSaxErrorHandler.java:35)
    at com.hp.hpl.jena.rdf.arp.impl.XMLHandler.warning(XMLHandler.java:225)
    at com.hp.hpl.jena.rdf.arp.impl.XMLHandler.fatalError(XMLHandler.java:255)



What can we do for these errors  in DBpedia? Is there any solution for these? 
The major concern for me is the issue 1 which is file is already in good shape 
except for one QName error. Any suggestions?

Thank you.
------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to