Hi,
I have loaded Wikidata in Jena using tdbloader2. I noticed that some queries do not produce the expected result. Query 1: PREFIX wd: <http://www.wikidata.org/entity/> PREFIX wdt: <http://www.wikidata.org/prop/direct/> SELECT DISTINCT * WHERE { <http://www.wikidata.org/entity/Q31> wdt:P1344 ?o . } Query 2: PREFIX wd: <http://www.wikidata.org/entity/> PREFIX wdt: <http://www.wikidata.org/prop/direct/> SELECT * WHERE { ?s wdt:P1344 ?o . } Query 1 returns solutions, but query 2 returns an empty table. This is contradictory because query 1 is more selective than query 2. I guess that this is because tdbloader2 does not finished properly the index phase. However, the loading log showed no errors. My question is whether can I repeat the indexing phase with the data I currently have: -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:24 GOSP.dat -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:24 GOSP.idn -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:24 GPOS.dat -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:24 GPOS.idn -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:24 GSPO.dat -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:24 GSPO.idn -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:24 OSP.dat -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:24 OSP.idn -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:24 OSPG.dat -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:24 OSPG.idn -rw-r--r-- 1 ubuntu ubuntu 0 Feb 23 17:24 POS-txt -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:24 POS.dat -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:24 POS.idn -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:24 POSG.dat -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:24 POSG.idn -rw-r--r-- 1 ubuntu ubuntu 276379467776 Feb 23 17:47 SPO.dat -rw-r--r-- 1 ubuntu ubuntu 956301312 Feb 23 17:47 SPO.idn -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:47 SPOG.dat -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 17:48 SPOG.idn -rw-r--r-- 1 ubuntu ubuntu 0 Feb 23 17:48 data-quads.tmp -rw-r--r-- 1 ubuntu ubuntu 592136511840 Feb 23 18:41 data-triples.tmp -rw-r--r-- 1 ubuntu ubuntu 0 Feb 23 18:41 journal.jrnl -rw-r--r-- 1 ubuntu ubuntu 67679289344 Feb 23 18:48 node2id.dat -rw-r--r-- 1 ubuntu ubuntu 293601280 Feb 23 18:48 node2id.idn -rw-r--r-- 1 ubuntu ubuntu 136298477605 Feb 23 19:00 nodes.dat -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 19:00 prefix2id.dat -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 19:00 prefix2id.idn -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 19:00 prefixIdx.dat -rw-r--r-- 1 ubuntu ubuntu 8388608 Feb 23 19:00 prefixIdx.idn -rw-r--r-- 1 ubuntu ubuntu 0 Feb 23 19:00 prefixes.dat -rw-r--r-- 1 ubuntu ubuntu 1793582 Feb 23 19:00 stats.opt In the best case I can run the indexing phase over this database. Is it possible? Do you recommend me another solution to fix this database without loading the data again? Best, Daniel