Re: A Scale-Out RDF Store for Distributed Processing on Map/Reduce

Colin Evans Mon, 20 Oct 2008 18:24:25 -0700

Hi Edward,

At Metaweb, we're experimenting with storing raw triples in HDFS flatfiles, and have written a simple query language and planner thatexecutes the queries with chained map-reduce jobs. This approach workswell for warehousing triple data, and doesn't require HBase. Queriesmay take a few minutes to execute, but the system scales for very largedatasets and result sets because it doesn't try to resolve queries inmemory. We're currently testing with more than 150MM triples and havebeen happy with the results.


-Colin


Edward J. Yoon wrote:

Hi all,

This RDF proposal is a good long time ago. Now we'd like to settle
down to research again. I attached our proposal, We'd love to hear
your feedback & stories!!

Thanks.

Re: A Scale-Out RDF Store for Distributed Processing on Map/Reduce

Reply via email to