Rafa Haro created STANBOL-1125:
----------------------------------
Summary: Create a lightweight EntityHub Indexing Tool for Freebase
Key: STANBOL-1125
URL: https://issues.apache.org/jira/browse/STANBOL-1125
Project: Stanbol
Issue Type: Improvement
Components: Entityhub
Reporter: Rafa Haro
Due to the enormous size of the dumps, current Freebase indexing tool in
Stanbol can't barely work in machines without several gigas of RAM and/or SSD
disks. JenaTDB importer has been identified as the bootle neck of the indexing
process. To use an RDF database is mandatory in order to, for instance, use
LDPath programs at indexing time.
The idea is to develop a lightweight indexing tool that stream data from the
dumps and push it directly to Solr. Despite losing some functionality, it is
possible for any user to generate Freebase EntityHub indexes from any dump.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira