Hi,
I'd like to make PIG load only a subset of an HBase table, based on
the timestamp of the records, or on the key of the rows.
As an example, I'd like to load only records that have a timestamp >
N, or a key > "something".
I know that HBase can handle scanners that are highly optimized to
perform this kind of things, and it would greatly improve the time
needed to load my data.
Is there any way to do this ?
If not, it is planned to be added in the HBase loader ?
If not, is it technically possible to do it ?
If yes, can I contribute and propose a patch on that ?
Thank a lot !