[
https://issues.apache.org/jira/browse/HBASE-24791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17167581#comment-17167581
]
Yechao Chen edited comment on HBASE-24791 at 7/30/20, 1:11 AM:
---------------------------------------------------------------
{noformat}
How many columns in ur test? If that is large, the impact will be so huge which
u r seeing. I think this should be marked as a bugfix as it was a perf
regression in 2.0{noformat}
Test case 10 million columns for each loop, and loop 10 times,
consider user usually use Bulkload import big data with many rows/columns to
hbase, I think this will be helpful.
was (Author: chenyechao):
{noformat}
How many columns in ur test? If that is large, the impact will be so huge which
u r seeing. I think this should be marked as a bugfix as it was a perf
regression in 2.0{noformat}
Test case 10 million columns with for each, and loop 10 times,
consider user usually use Bulkload import big data with many rows/columns to
hbase, I think this will be helpful.
> Improve HFileOutputFormat2 to avoid always call getTableRelativePath method
> ---------------------------------------------------------------------------
>
> Key: HBASE-24791
> URL: https://issues.apache.org/jira/browse/HBASE-24791
> Project: HBase
> Issue Type: Improvement
> Components: mapreduce
> Affects Versions: 3.0.0-alpha-1
> Reporter: Yechao Chen
> Assignee: Yechao Chen
> Priority: Critical
> Labels: HFileOutputFormat, bulkload
> Fix For: 3.0.0-alpha-1
>
>
> Bulkload use HFileOutputFormat2 to write HFile
> In the HFileOutputFormat2.RecordWriter
> in the write method always called the getTableRelativePath method each time
> This is unnecessary
--
This message was sent by Atlassian Jira
(v8.3.4#803005)