[ 
https://issues.apache.org/jira/browse/HBASE-24791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167581#comment-17167581
 ] 

Yechao Chen edited comment on HBASE-24791 at 7/30/20, 1:11 AM:
---------------------------------------------------------------

{noformat}
How many columns in ur test? If that is large, the impact will be so huge which 
u r seeing. I think this should be marked as a bugfix as it was a perf 
regression in 2.0{noformat}
Test case 10 million columns for each loop, and loop 10 times,

 

consider user usually  use  Bulkload import big data with many rows/columns to 
hbase, I think  this will be helpful. 


was (Author: chenyechao):
{noformat}
How many columns in ur test? If that is large, the impact will be so huge which 
u r seeing. I think this should be marked as a bugfix as it was a perf 
regression in 2.0{noformat}
Test case 10 million columns with for each, and loop 10 times,

 

consider user usually  use  Bulkload import big data with many rows/columns to 
hbase, I think  this will be helpful. 

> Improve HFileOutputFormat2 to avoid always call getTableRelativePath method
> ---------------------------------------------------------------------------
>
>                 Key: HBASE-24791
>                 URL: https://issues.apache.org/jira/browse/HBASE-24791
>             Project: HBase
>          Issue Type: Improvement
>          Components: mapreduce
>    Affects Versions: 3.0.0-alpha-1
>            Reporter: Yechao Chen
>            Assignee: Yechao Chen
>            Priority: Critical
>              Labels: HFileOutputFormat, bulkload
>             Fix For: 3.0.0-alpha-1
>
>
> Bulkload use HFileOutputFormat2 to write HFile 
> In the  HFileOutputFormat2.RecordWriter
> in the write method always called the getTableRelativePath method each time
> This  is unnecessary 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to