[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-03-12 Thread Andrew Purtell (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell updated HBASE-21810:
---
Fix Version/s: (was: 1.5.1)
   1.5.0

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 2.0.4, 2.1.3, 1.2.11
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Fix For: 3.0.0, 1.5.0, 2.2.0, 2.3.0
>
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.002.patch, HBASE-21810.branch-1.003.patch, 
> HBASE-21810.branch-1.2.001.patch, HBASE-21810.branch-1.2.002.patch, 
> HBASE-21810.branch-1.2.002.patch, HBASE-21810.branch-1.2.003.patch, 
> HBASE-21810.branch-2.001.patch, HBASE-21810.branch-2.002.patch, 
> HBASE-21810.master.001.patch, HBASE-21810.master.001.patch, 
> HBASE-21810.master.002.patch, HBASE-21810.master.003.patch, 
> HBASE-21810.master.003.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-03-12 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Fix Version/s: (was: 1.5.0)
   1.5.1
 Release Note: 
bulkload (HFileOutputFormat2)  support config the compression on client ,you 
can set the job configuration "hbase.mapreduce.hfileoutputformat.compression"  
override the auto-detection of the target table's compression 



> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 2.0.4, 2.1.3, 1.2.11
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Fix For: 3.0.0, 2.2.0, 2.3.0, 1.5.1
>
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.002.patch, HBASE-21810.branch-1.003.patch, 
> HBASE-21810.branch-1.2.001.patch, HBASE-21810.branch-1.2.002.patch, 
> HBASE-21810.branch-1.2.002.patch, HBASE-21810.branch-1.2.003.patch, 
> HBASE-21810.branch-2.001.patch, HBASE-21810.branch-2.002.patch, 
> HBASE-21810.master.001.patch, HBASE-21810.master.001.patch, 
> HBASE-21810.master.002.patch, HBASE-21810.master.003.patch, 
> HBASE-21810.master.003.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-03-12 Thread Andrew Purtell (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell updated HBASE-21810:
---
Fix Version/s: (was: 1.5.1)
   1.5.0

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 2.0.4, 2.1.3, 1.2.11
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Fix For: 3.0.0, 1.5.0, 2.2.0, 2.3.0
>
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.002.patch, HBASE-21810.branch-1.003.patch, 
> HBASE-21810.branch-1.2.001.patch, HBASE-21810.branch-1.2.002.patch, 
> HBASE-21810.branch-1.2.002.patch, HBASE-21810.branch-1.2.003.patch, 
> HBASE-21810.branch-2.001.patch, HBASE-21810.branch-2.002.patch, 
> HBASE-21810.master.001.patch, HBASE-21810.master.001.patch, 
> HBASE-21810.master.002.patch, HBASE-21810.master.003.patch, 
> HBASE-21810.master.003.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-03-12 Thread Guanghao Zhang (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-21810:
---
   Resolution: Fixed
Fix Version/s: 1.5.1
   2.3.0
   2.2.0
   3.0.0
   Status: Resolved  (was: Patch Available)

Pushed to branch-2.2+ and branch-1. Thanks [~chenyechao] for contributing.

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 2.0.4, 2.1.3, 1.2.11
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Fix For: 3.0.0, 2.2.0, 2.3.0, 1.5.1
>
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.002.patch, HBASE-21810.branch-1.003.patch, 
> HBASE-21810.branch-1.2.001.patch, HBASE-21810.branch-1.2.002.patch, 
> HBASE-21810.branch-1.2.002.patch, HBASE-21810.branch-1.2.003.patch, 
> HBASE-21810.branch-2.001.patch, HBASE-21810.branch-2.002.patch, 
> HBASE-21810.master.001.patch, HBASE-21810.master.001.patch, 
> HBASE-21810.master.002.patch, HBASE-21810.master.003.patch, 
> HBASE-21810.master.003.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-03-11 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Attachment: HBASE-21810.branch-1.2.003.patch

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 2.0.4, 2.1.3, 1.2.11
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.002.patch, HBASE-21810.branch-1.003.patch, 
> HBASE-21810.branch-1.2.001.patch, HBASE-21810.branch-1.2.002.patch, 
> HBASE-21810.branch-1.2.002.patch, HBASE-21810.branch-1.2.003.patch, 
> HBASE-21810.branch-2.001.patch, HBASE-21810.branch-2.002.patch, 
> HBASE-21810.master.001.patch, HBASE-21810.master.001.patch, 
> HBASE-21810.master.002.patch, HBASE-21810.master.003.patch, 
> HBASE-21810.master.003.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-03-11 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Attachment: HBASE-21810.branch-1.003.patch

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 2.0.4, 2.1.3, 1.2.11
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.002.patch, HBASE-21810.branch-1.003.patch, 
> HBASE-21810.branch-1.2.001.patch, HBASE-21810.branch-1.2.002.patch, 
> HBASE-21810.branch-1.2.002.patch, HBASE-21810.branch-2.001.patch, 
> HBASE-21810.branch-2.002.patch, HBASE-21810.master.001.patch, 
> HBASE-21810.master.001.patch, HBASE-21810.master.002.patch, 
> HBASE-21810.master.003.patch, HBASE-21810.master.003.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-03-11 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Attachment: HBASE-21810.branch-1.2.002.patch

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 2.0.4, 2.1.3, 1.2.11
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.002.patch, HBASE-21810.branch-1.2.001.patch, 
> HBASE-21810.branch-1.2.002.patch, HBASE-21810.branch-1.2.002.patch, 
> HBASE-21810.branch-2.001.patch, HBASE-21810.branch-2.002.patch, 
> HBASE-21810.master.001.patch, HBASE-21810.master.001.patch, 
> HBASE-21810.master.002.patch, HBASE-21810.master.003.patch, 
> HBASE-21810.master.003.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-03-11 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Affects Version/s: (was: 1.2.10)
   1.2.11

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 2.0.4, 2.1.3, 1.2.11
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.002.patch, HBASE-21810.branch-1.2.001.patch, 
> HBASE-21810.branch-1.2.002.patch, HBASE-21810.branch-2.001.patch, 
> HBASE-21810.branch-2.002.patch, HBASE-21810.master.001.patch, 
> HBASE-21810.master.001.patch, HBASE-21810.master.002.patch, 
> HBASE-21810.master.003.patch, HBASE-21810.master.003.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-03-11 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Attachment: HBASE-21810.branch-1.2.002.patch

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 1.2.10, 2.0.4, 2.1.3
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.002.patch, HBASE-21810.branch-1.2.001.patch, 
> HBASE-21810.branch-1.2.002.patch, HBASE-21810.branch-2.001.patch, 
> HBASE-21810.branch-2.002.patch, HBASE-21810.master.001.patch, 
> HBASE-21810.master.001.patch, HBASE-21810.master.002.patch, 
> HBASE-21810.master.003.patch, HBASE-21810.master.003.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-03-11 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Attachment: HBASE-21810.branch-1.002.patch

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 1.2.10, 2.0.4, 2.1.3
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.002.patch, HBASE-21810.branch-1.2.001.patch, 
> HBASE-21810.branch-2.001.patch, HBASE-21810.branch-2.002.patch, 
> HBASE-21810.master.001.patch, HBASE-21810.master.001.patch, 
> HBASE-21810.master.002.patch, HBASE-21810.master.003.patch, 
> HBASE-21810.master.003.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-03-11 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Attachment: HBASE-21810.master.003.patch

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 1.2.10, 2.0.4, 2.1.3
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.2.001.patch, HBASE-21810.branch-2.001.patch, 
> HBASE-21810.branch-2.002.patch, HBASE-21810.master.001.patch, 
> HBASE-21810.master.001.patch, HBASE-21810.master.002.patch, 
> HBASE-21810.master.003.patch, HBASE-21810.master.003.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-03-11 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Attachment: HBASE-21810.branch-2.002.patch

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 1.2.10, 2.0.4, 2.1.3
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.2.001.patch, HBASE-21810.branch-2.001.patch, 
> HBASE-21810.branch-2.002.patch, HBASE-21810.master.001.patch, 
> HBASE-21810.master.001.patch, HBASE-21810.master.002.patch, 
> HBASE-21810.master.003.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-03-11 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Attachment: HBASE-21810.master.003.patch

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 1.2.10, 2.0.4, 2.1.3
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.2.001.patch, HBASE-21810.branch-2.001.patch, 
> HBASE-21810.master.001.patch, HBASE-21810.master.001.patch, 
> HBASE-21810.master.002.patch, HBASE-21810.master.003.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-03-10 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Attachment: HBASE-21810.master.002.patch

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 1.2.10, 2.0.4, 2.1.3
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.2.001.patch, HBASE-21810.branch-2.001.patch, 
> HBASE-21810.master.001.patch, HBASE-21810.master.001.patch, 
> HBASE-21810.master.002.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-02-27 Thread Guanghao Zhang (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-21810:
---
Attachment: HBASE-21810.master.001.patch

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 1.2.10, 2.0.4, 2.1.3
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.2.001.patch, HBASE-21810.branch-2.001.patch, 
> HBASE-21810.master.001.patch, HBASE-21810.master.001.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-02-21 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Affects Version/s: (was: 2.1.2)
   2.1.3

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 1.2.10, 2.0.4, 2.1.3
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.2.001.patch, HBASE-21810.branch-2.001.patch, 
> HBASE-21810.master.001.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-02-21 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Description: 
hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
table(cf) compression,

if the compression can be set on client ,sometimes,it's useful,

some case in our production:

1、hfile bulkload replication between the data center with bandwidth limit, we 
can set the compression of the bulkload hfile not changing the table compression

2、bulkload hfile not set  compression ,but the table compression is 
gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
the hfile to compression finally

3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has no 
compression lib,but the hbase cluster has,it's useful for this case

  was:
hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
table(cf) compression,

if the compression can be set on client ,somethings it's useful,

some case in our production:

1、hfile bulkload replication between the data center with bandwidth limit, we 
can set the compression of the bulkload hfile not changing the table compression

2、bulkload hfile not set  compression ,but the table compression is 
gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
the hfile to compression finally

3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has no 
compression lib,but the hbase cluster has,it's useful for this case


> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 2.1.2, 1.2.10, 2.0.4
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.2.001.patch, HBASE-21810.branch-2.001.patch, 
> HBASE-21810.master.001.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,sometimes,it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-01-30 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Attachment: HBASE-21810.branch-1.2.001.patch

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 2.1.2, 1.2.10, 2.0.4
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-1.2.001.patch, HBASE-21810.branch-2.001.patch, 
> HBASE-21810.master.001.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,somethings it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-01-30 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Attachment: HBASE-21810.branch-1.001.patch

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 2.1.2, 2.0.4
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-2.001.patch, HBASE-21810.master.001.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,somethings it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-01-30 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Affects Version/s: 1.3.3
   1.4.9
   1.2.10

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 1.3.3, 1.4.9, 2.1.2, 1.2.10, 2.0.4
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-1.001.patch, 
> HBASE-21810.branch-2.001.patch, HBASE-21810.master.001.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,somethings it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-01-30 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Attachment: HBASE-21810.branch-2.001.patch

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 2.1.2, 2.0.4
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
> Attachments: HBASE-21810.branch-2.001.patch, 
> HBASE-21810.master.001.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,somethings it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-01-30 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Description: 
hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
table(cf) compression,

if the compression can be set on client ,somethings it's useful,

some case in our production:

1、hfile bulkload replication between the data center with bandwidth limit, we 
can set the compression of the bulkload hfile not changing the table compression

2、bulkload hfile not set  compression ,but the table compression is 
gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
the hfile to compression finally

3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has no 
compression lib,but the hbase cluster has,it's useful for this case

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 2.1.2, 2.0.4
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,somethings it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-01-30 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Attachment: HBASE-21810.master.001.patch
Status: Patch Available  (was: Open)

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 2.0.4, 2.1.2
>Reporter: Yechao Chen
>Assignee: Guanghao Zhang
>Priority: Major
> Attachments: HBASE-21810.master.001.patch
>
>
> hbase bulkload (HFileOutputFormat2) generate hfile ,the compression from the 
> table(cf) compression,
> if the compression can be set on client ,somethings it's useful,
> some case in our production:
> 1、hfile bulkload replication between the data center with bandwidth limit, we 
> can set the compression of the bulkload hfile not changing the table 
> compression
> 2、bulkload hfile not set  compression ,but the table compression is 
> gz/zstd/snappy... ,can reduce the hfile created time and compaction will make 
> the hfile to compression finally
> 3、somethings the yarn nodes (hfile created by reduce) /dobulkload client has 
> no compression lib,but the hbase cluster has,it's useful for this case



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-01-30 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Affects Version/s: 2.1.2
   2.0.4

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>Affects Versions: 2.1.2, 2.0.4
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (HBASE-21810) bulkload support set hfile compression on client

2019-01-30 Thread Yechao Chen (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-21810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yechao Chen updated HBASE-21810:

Component/s: mapreduce

> bulkload  support set hfile compression on client 
> --
>
> Key: HBASE-21810
> URL: https://issues.apache.org/jira/browse/HBASE-21810
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce
>Affects Versions: 2.1.2, 2.0.4
>Reporter: Yechao Chen
>Assignee: Yechao Chen
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)