GitHub user jackylk opened a pull request:
https://github.com/apache/spark/pull/3133
[SPARK-4269][SQL] make wait time configurable in BroadcastHashJoin
In BroadcastHashJoin, currently it is using a hard coded value (5 minutes)
to wait for the execution and broadcast of the small table.
In my opinion, it should be a configurable value since broadcast may exceed
5 minutes in some case, like in a busy/congested network environment.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/jackylk/spark timeout-config
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/3133.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #3133
----
commit 81a5e20e2f2b8dbf4ee27f246d3f5396b4bf442b
Author: Jacky Li <[email protected]>
Date: 2014-11-06T09:33:20Z
make wait time configurable in BroadcastHashJoin
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]