[jira] Updated: (HBASE-940) Make the TableOutputFormat batching-aware

Jean-Daniel Cryans (JIRA) Tue, 21 Oct 2008 09:22:07 -0700

     [ 
https://issues.apache.org/jira/browse/HBASE-940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Jean-Daniel Cryans updated HBASE-940:
-------------------------------------

    Attachment: hbase-940-v1.patch

This patch adds the improvement and cleans up some stuff. Also, I had some 
problems with the ITR because the BatchUpdate object seems to be reused, so the 
writeBuffer in HTable was filled 17 000 times with the same object when running 
TestTableMapReduce hence this code :

{code}
     while(values.hasNext()) {
-      output.collect(key, values.next());
+      BatchUpdate copy = new BatchUpdate(key.get());
+      BatchUpdate old = values.next();
+      for(BatchOperation bo : old) {
+        copy.put(bo.getColumn(), bo.getValue());
+      }
+      output.collect(key, copy);

{code}

> Make the TableOutputFormat batching-aware
> -----------------------------------------
>
>                 Key: HBASE-940
>                 URL: https://issues.apache.org/jira/browse/HBASE-940
>             Project: Hadoop HBase
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Jean-Daniel Cryans
>            Assignee: Jean-Daniel Cryans
>             Fix For: 0.19.0
>
>         Attachments: hbase-940-v1.patch
>
>
> With HBASE-748, we can batch updates but the auto-flushing is enabled by 
> default. Make it that it is disabled for MR jobs outputting in a table and 
> make sure that everything is sent in the end.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HBASE-940) Make the TableOutputFormat batching-aware

Reply via email to