Charles PORROT created HBASE-20748:
--------------------------------------

             Summary: HBaseContext bulkLoad: being able to use custom versions
                 Key: HBASE-20748
                 URL: https://issues.apache.org/jira/browse/HBASE-20748
             Project: HBase
          Issue Type: Improvement
          Components: spark
            Reporter: Charles PORROT
         Attachments: bulkLoadCustomVersions.scala

The _bulkLoad_ methods of _class org.apache.hadoop.hbase.spark_ use the 
system's current time for the version of the cells to bulk-load.

This makes this method, and its twin _bulkLoadThinRows_, useless if you need to 
use your own versionning system.

Thus, I propose a third _bulkLoad_ method, based on the original method. 
Instead of using an _Iterator(KeyFamilyQualifier, Array[Byte])_ as the basis 
for the writes, this new method would use an _Iterator(KeyFamilyQualifier, 
Array[Byte], Long_), with the _Long_ being the version.

In case of illogical version (for instance, a negative version), the method 
would throw back to the current timestamp.

See the attached file for a proposal of this new _bulkLoad_ method.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to