[
https://issues.apache.org/jira/browse/HBASE-16117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jonathan Hsieh updated HBASE-16117:
-----------------------------------
Attachment: hbase-16117.v2.branch-1.patch
Patch for branch 1 attached.
[~mantonov], [~busbey], [~ndimiduk] give the minor incompatibility / semantics
change would you like the patch in your respective branches?
The change's is on a private api, and ramification in a can be seen
TestClientTimeouts. On the other hand, regular large spark jobs just using the
mapred.TableOutputFormat will die of zk connection exhaustion.
> Fix Connection leak in mapred.TableOutputFormat
> ------------------------------------------------
>
> Key: HBASE-16117
> URL: https://issues.apache.org/jira/browse/HBASE-16117
> Project: HBase
> Issue Type: Bug
> Components: mapreduce
> Affects Versions: 2.0.0, 1.3.0, 1.2.2, 1.1.6
> Reporter: Jonathan Hsieh
> Assignee: Jonathan Hsieh
> Fix For: 2.0.0, 1.3.0, 1.2.2, 1.1.6
>
> Attachments: hbase-16117.branch-1.patch, hbase-16117.patch,
> hbase-16117.v2.branch-1.patch, hbase-16117.v2.patch, hbase-16117.v3.patch,
> hbase-16117.v4.patch
>
>
> Spark seems to instantiate multiple instances of output formats within a
> single process. When mapred.TableOutputFormat (not
> mapreduce.TableOutputFormat) is used, this may cause connection leaks that
> slowly exhaust the cluster's zk connections.
> This patch fixes that.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)