I'd like to use a large dbpedia dataset with entityHub in my vm with ~50G
disk space.
I created one (~12G) by following
https://github.com/apache/stanbol/tree/trunk/entityhub/indexing/dbpedia#entity-typesand
put it under the datafiles folder.
After updating the dbpedia bundle from the admin console, I am informed of
no enough disk space by the system.
It seems that another ~26G index is automatically created under my launcher
folder stanbol/indexes/default.

Now I deleted the ~12G zip file from the datafiles folder in hopes that
stanbol won't use it once ~26G index is generated.

However, when I did
curl "
http://localhost:8080/entityhub/site/dbpedia/entity?id=http://dbpedia.org/resource/Paris
",
I got the exception
"org.apache.stanbol.entityhub.servicesapi.yard.YardException: The SolrIndex
'dbpedia' for SolrYard 'dbpedia default data index' is currently not
active!"

So my question is whether I MUST keep the ~12G zip file under datafiles
folder.


Thanks,
Xiaoyan

Reply via email to