Hi Andy,

Basekb dumps come from freebase dump and their data format is N-Triples RDF.
So for each basekb dump, it is uncompressed, attached a extension '.nt' and then loaded into TDB.

But the tdbloader reports the following error:

15:19:22 ERROR riot :: [line: 309035, col: 135] Illegal object: [INTEGER:5281023] org.apache.jena.riot.RiotException: [line: 309035, col: 135] Illegal object: [INTEGER:5281023] at org.apache.jena.riot.system.ErrorHandlerFactory$ErrorHandlerStd.fatal(ErrorHandlerFactory.java:136) at org.apache.jena.riot.lang.LangEngine.raiseException(LangEngine.java:163) at org.apache.jena.riot.lang.LangEngine.exceptionDirect(LangEngine.java:156)
    at org.apache.jena.riot.lang.LangEngine.exception(LangEngine.java:149)
at org.apache.jena.riot.lang.LangNTuple.checkRDFTerm(LangNTuple.java:126) at org.apache.jena.riot.lang.LangNTriples.parseOne(LangNTriples.java:78) at org.apache.jena.riot.lang.LangNTriples.runParser(LangNTriples.java:54)
    at org.apache.jena.riot.lang.LangBase.parse(LangBase.java:42)
at org.apache.jena.riot.RDFParserRegistry$ReaderRIOTLang.read(RDFParserRegistry.java:169)
    at org.apache.jena.riot.RDFDataMgr.process(RDFDataMgr.java:859)
    at org.apache.jena.riot.RDFDataMgr.parse(RDFDataMgr.java:687)
    at org.apache.jena.riot.RDFDataMgr.parse(RDFDataMgr.java:666)
    at org.apache.jena.riot.RDFDataMgr.parse(RDFDataMgr.java:654)
at com.hp.hpl.jena.tdb.store.bulkloader.BulkLoader.loadTriples$(BulkLoader.java:130) at com.hp.hpl.jena.tdb.store.bulkloader.BulkLoader.loadDefaultGraph(BulkLoader.java:87)
    at com.hp.hpl.jena.tdb.TDBLoader.loadDefaultGraph$(TDBLoader.java:260)
    at com.hp.hpl.jena.tdb.TDBLoader.loadGraph$(TDBLoader.java:243)
    at com.hp.hpl.jena.tdb.TDBLoader.loadGraph(TDBLoader.java:176)
    at com.hp.hpl.jena.tdb.TDBLoader.load(TDBLoader.java:113)
    at tdb.tdbloader.loadDefaultGraph(tdbloader.java:138)
    at tdb.tdbloader.exec(tdbloader.java:108)
    at arq.cmdline.CmdMain.mainMethod(CmdMain.java:102)
    at arq.cmdline.CmdMain.mainRun(CmdMain.java:63)
    at arq.cmdline.CmdMain.mainRun(CmdMain.java:50)
    at tdb.tdbloader.main(tdbloader.java:52)

And then, I print the triple:

<http://www.neusoft.com/ontologies/2013/6/medicine#m.07_71> <http://www.neusoft.com/ontologies/2013/6/medicine#medicine.drug.pubchem> 5281023 .

It should be that tdbloader can't decide on the type of the object. There are also many triples like this.

Then I change the extension from '.nt' to '.n3' and then reload these dumps. This time tdbloader load all the dumps into the TDB store without reporting any errors.
And I can query all triples from the TDB store.

But I don't know why tdbloader don't check these errors any more when the extension is '.n3'.

Thank you very much.

Deyan Chen
---------------------------------------------------------------------------------------------------
Confidentiality Notice: The information contained in this e-mail and any accompanying attachment(s) is intended only for the use of the intended recipient and may be confidential and/or privileged of Neusoft Corporation, its subsidiaries and/or its affiliates. If any reader of this communication is not the intended recipient, unauthorized use, forwarding, printing, storing, disclosure or copying is strictly prohibited, and may be unlawful.If you have received this communication in error,please immediately notify the sender by return e-mail, and delete the original message and all copies from your system. Thank you. ---------------------------------------------------------------------------------------------------

Reply via email to