+1 for this Bhupesh. Additionally, I would suggest to add support for; 1. Point query 2. Returning any row version
The above two are key features of HBase and should be supported. Regards, Sandeep On Fri, Dec 18, 2015 at 4:39 PM, Bhupesh Chawda <[email protected]> wrote: > Hi All, > > The current HBasePOJOInputOperator does not allow us to do the following: > > 1. Allow us to specify a set of "column family: column" and fetch data > only for these columns. > 2. Output format is currently a POJO. We need to have other output > formats such that "columnFamily:column" representation is supported. > Map / > CSV are some of the options. > 3. Allow specifying "end row-key" to stop scanning a table. > 4. No metrics. > > I am planning to add the above functionality to the HBase Input operators. > These features may go into the HBaseScanOperator / HBasePOJOInputOperator. > > Please let me know your comments. > > Thanks. > > Bhupesh >
