Cheng Hao created HIVE-3823:
-------------------------------

             Summary: Performance issue while retrieving the Result objects in 
HiveHBaseTableInputFormat
                 Key: HIVE-3823
                 URL: https://issues.apache.org/jira/browse/HIVE-3823
             Project: Hive
          Issue Type: Improvement
          Components: HBase Handler
    Affects Versions: 0.9.0, 0.10.0, 0.9.1
            Reporter: Cheng Hao
            Priority: Trivial


In HiveHBaseTableInputFormat.java, the Result objects retrieving has  
performance issue.
{code:title:HiveHBaseTableInputFormat}
      @Override
      public boolean next(ImmutableBytesWritable rowKey, Result value) throws 
IOException {

        boolean next = false;

        try {
          next = recordReader.nextKeyValue();

          if (next) {
            rowKey.set(recordReader.getCurrentValue().getRow());
            // performance issue here, as the copyWritable
            // is Serialization - Bytes Copying - Deserialization.
            Writables.copyWritable(recordReader.getCurrentValue(), value);
          }
        } catch (InterruptedException e) {
          throw new IOException(e);
        }

        return next;
      }
{code}

In HBASE 0.94.4 & 0.96.0, the Result provides a new method copyFrom, would 
solve the issue.

See [HBASE-7381|https://issues.apache.org/jira/browse/HBASE-7381]


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to