On 31/12/10 01:47, Benson Margulies wrote:
Does tdbloader eat what tdbdump produces?
/Users/benson/x/oap tdbloader2 --loc /users/benson/data/10k.tdb
/users/benson/data/10k.dump
20:45:22 -- TDB Bulk Loader Start
20:45:22 Data phase
Exception in thread "main" java.lang.NullPointerException
at
com.hp.hpl.jena.tdb.store.bulkloader2.CmdNodeTableBuilder.exec(CmdNodeTableBuilder.java:147)
at arq.cmdline.CmdMain.mainMethod(CmdMain.java:85)
at arq.cmdline.CmdMain.mainRun(CmdMain.java:47)
at arq.cmdline.CmdMain.mainRun(CmdMain.java:34)
at
com.hp.hpl.jena.tdb.store.bulkloader2.CmdNodeTableBuilder.main(CmdNodeTableBuilder.java:70)
What's in 10k.dump? If it's the output of tdbdump, then that's N-Quads [*].
The loaders use file extension to determine syntax ".dump" is not a
known extension and the default is RDF/XML (it's the only standard).
So if it is the output of tdbdump, put in in .nq or .nq.gz.
Andy
[*]
There is a bug in the released TDB tdbdump in escaping characters.
Detect it by looking for \ in the dump.
Fixed in SVN (for ARQ).
Workaround convert any \ to \u escapes