Dear Wiki user, You have subscribed to a wiki page or wiki category on "Lucene-hadoop Wiki" for change notification.
The following page has been changed by udanax: http://wiki.apache.org/lucene-hadoop/Hbase/HbaseArchitecture ------------------------------------------------------------------------------ BigTable is the storage layer of the sparse matrix data. [[BR]]And the goal is not Data Selection even though it is very useful feature, but Matrix Computation and Aggregation. + Refering to the example code of the Google's Paper, it would be like this + {{{ + Scanner scanner(T); + ScanStream *stream; + stream = scanner.FetchColumnFamily("A"); + stream->SetReturnVersions("t2"); + scanner.Lookup("2"); + + for (; !stream->Done(); stream->Next()) { + printf("%s %s %lld %s\n", + scanner.RowName(), + stream->ColumnName(), + stream->Value()); + } + }}} + + This example code prints first and second row vectors of the the 4*4 Sparse Matrix. + [[BR]]It process vector calculation in parallel with row-wise partition. + [[BR]]Therefore, in order to do distiributed computing effectively, the data structure needs to be defined to fully support the preprocessing to get abstract Matrix Information + Then, I think architecture need to be like this * Data Storage Conceptual