Hi guys,
I've been reading up on bulk load using MapReduce jobs and I wanted to
validate something.

If I the input I wanted to load into HBase produced the same key for
several lines. How will HBase handle that?

I understand the MapReduce job will create StoreFiles which the region
servers just pick up and make available to the users. But is there a
validation to treat the first as insert and the rest as updates?

What about the limit on the number of versions of a key HBase can have? If
I want to have 10 versions, but the bulk load has 20 values for the same
key, will it only keep the last 10?

Thanks,
Juan

Reply via email to