Hi All,

The current HBasePOJOInputOperator does not allow us to do the following:

   1. Allow us to specify a set of "column family: column" and fetch data
   only for these columns.
   2. Output format is currently a POJO. We need to have other output
   formats such that "columnFamily:column" representation is supported. Map /
   CSV are some of the options.
   3. Allow specifying "end row-key" to stop scanning a table.
   4. No metrics.

I am planning to add the above functionality to the HBase Input operators.
These features may go into the HBaseScanOperator / HBasePOJOInputOperator.

Please let me know your comments.

Thanks.

Bhupesh

Reply via email to