[
https://issues.apache.org/jira/browse/HBASE-2433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12856310#action_12856310
]
Amandeep Khurana commented on HBASE-2433:
-----------------------------------------
We should be able to answer small queries with low latency by using the right
kind of indexing. Here are some papers that do it:
http://people.csail.mit.edu/tdanford/6830papers/weiss-hexastore.pdf
http://portal.acm.org/citation.cfm?id=1114857
However, they do everything in memory. We can store these indexes in HBase and
allow for fast querying. However, we cant guarantee as good performance as
these papers do. It'll still be much better than a MR job though.
Batch processes can also use these indexes for getting results out faster. This
is yet to be explored.
> RDF and SPARQL with HBase - Features and design specs
> -----------------------------------------------------
>
> Key: HBASE-2433
> URL: https://issues.apache.org/jira/browse/HBASE-2433
> Project: Hadoop HBase
> Issue Type: Improvement
> Components: contrib
> Reporter: Amandeep Khurana
> Assignee: Amandeep Khurana
>
> This is for scoping out the feature set and the design specifications for the
> RDF store over HBase and the query capability it will have. I'll be posting
> some initial ideas soon.
> The key goals for this layer are:
> 1. Scalability
> 2. Support for interactive queries (this one seems to be the biggest
> challenge)
> We would need to define the subset of queries we will support. We'll probably
> begin with SELECT queries.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira