We are populating some HBase tables from daily data streams that are
stored in Hive.  When we see a row key that's already in the table,
the data should be appended to that row's record.  What is the best
way to achieve this?..  Should we be using the Java API?..  Rely on
HBase cell timestamping?..  Create compound keys (row_id+date) and
periodically run a separate MR job to coalesce all the data belonging
to the same row_id?..

Any pointers greatly appreciated!

--Leo

Reply via email to