[ https://issues.apache.org/jira/browse/PIG-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12987962#action_12987962 ]
Dmitriy V. Ryaboy commented on PIG-1782: ---------------------------------------- To Eric's point, we should add timestamp controls straight into Storage. Returning tuples of the form ( optionalRowKey, { col1 => val1, col2 => val2 } ) makes sense to me. I don't like the tuple of tuples option because it makes it hard to pull out specific columns in that structure, which is likely what one wants to do. We should give some thought to someone loading using HbaseStorage( 'cf1:, cf2:some_col' , '-loadKey') > Add ability to load data by column family in HBaseStorage > --------------------------------------------------------- > > Key: PIG-1782 > URL: https://issues.apache.org/jira/browse/PIG-1782 > Project: Pig > Issue Type: New Feature > Environment: Java 6, Mac OS X 10.6 > Reporter: Eric Yang > Assignee: Bill Graham > > It would be nice to load all columns in the column family by using short hand > syntax like: > {noformat} > CpuMetrics = load 'hbase://SystemMetrics' USING > org.apache.pig.backend.hadoop.hbase.HBaseStorage('cpu:','-loadKey'); > {noformat} > Assuming there are columns cpu: sys.0, cpu:sys.1, cpu:user.0, cpu:user.1, in > cpu column family. > CpuMetrics would contain something like: > {noformat} > (rowKey, cpu:sys.0, cpu:sys.1, cpu:user.0, cpu:user.1) > {noformat} -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.