Dear Wiki user, You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification.
The following page has been changed by udanax: http://wiki.apache.org/lucene-hadoop/Hbase/RDF The comment on the change is: The name Altools will no longer be used due to trademark issues ------------------------------------------------------------------------------ [[TableOfContents(4)]] ---- - -- ''Volunteers and any comments on HbaseRDF are welcomed.'' + ''Volunteers and any comments on HbaseRDF are welcomed.'' == HbaseRDF, a Planet-Scale RDF Data Store == We have started to think about storing and querying RDF data in Hbase. But we'll jump into its implementation after prudence investigation. @@ -16, +16 @@ parallel, distributed query processing. === Related projects === - * The issue [https://issues.apache.org/jira/browse/HADOOP-1608 HADOOP-1608 Relational Algrebra Operators] is designing and implementing relational algebra operators. See [:Hbase/HbaseShell/Altools: Algebric Tools] for various algebric operators we are designing and planing to implement, including relational algebra operators. + * The issue [https://issues.apache.org/jira/browse/HADOOP-1608 HADOOP-1608 Relational Algrebra Operators] is designing and implementing relational algebra operators. * [:Hbase/HbaseShell: HbaseShell] provides a command line tool in which we can manipulate tables in Hbase. We are also planning to use HbaseShell to manipulate and query RDF data to be stored in Hbase. * [https://issues.apache.org/jira/browse/HADOOP-1120 contrib/data_join] provides helper classes to help implement data join operations through MapReduce jobs. Thanks to Runping. * [http://www.openrdf.org/forum/mvnforum/viewthread?thread=1423 A forum at Aduna/Sesame] would be interested in working with this group. @@ -119, +119 @@ ---- == Papers == + * OSDI 2004, ''MapReduce: Simplified Data Processing on Large Clusters'', proposes a very simple, but powerfull, and highly parallelized data processing technique. - * OSDI 2004, ''MapReduce: Simplified Data Processing on Large Clusters'' - * proposes a very simple, but powerfull, and highly parallelized data processing technique. - * CIDR 2007, ''[http://db.lcs.mit.edu/projects/cstore/abadicidr07.pdf Column-Stores For Wide and Sparse Data]'' + * CIDR 2007, ''[http://db.lcs.mit.edu/projects/cstore/abadicidr07.pdf Column-Stores For Wide and Sparse Data]'', discusses the benefits of using C-Store to store RDF and XML data. + * VLDB 2007, ''[http://db.lcs.mit.edu/projects/cstore/abadirdf.pdf Scalable Semantic Web Data Management Using Vertical Partitoning]'', proposes an efficient method to store RDF data in table projections (i.e., columns) and executes queries on them. - * discusses the benefits of using C-Store to store RDF and XML data. - * VLDB 2007, ''[http://db.lcs.mit.edu/projects/cstore/abadirdf.pdf Scalable Semantic Web Data Management Using Vertical Partitoning]'' - * proposes an efficient method to store RDF data in table projections (i.e., columns) and executes queries on them. - * SIGMOD 2007, ''Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters'' + * SIGMOD 2007, ''Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters'', MapReduce implementation of several relational operators. - * MapReduce implementation of several relational operators.