[jira] Commented: (MAPREDUCE-1266) Allow heartbeat interval smaller than 3 seconds for tiny clusters

Todd Lipcon (JIRA) Fri, 04 Dec 2009 14:55:47 -0800

    [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786219#action_12786219
 ]


Todd Lipcon commented on MAPREDUCE-1266:
----------------------------------------

Well, actually, in trunk there's mapreduce.jobtracker.heartbeats.in.second 
which sets the individual trackers such that that number of heartbeats arrive 
every second. The default is 100, which would be a 10ms interval for a 
pseudo-distributed cluster, which is silly. So there's a minimum as well, 
hardcoded. Here's the relevant code:
{code}
    int heartbeatInterval =  Math.max(
                                (int)(1000 * HEARTBEATS_SCALING_FACTOR *
                                      Math.ceil((double)clusterSize /
                                                NUM_HEARTBEATS_IN_SECOND)),
                                HEARTBEAT_INTERVAL_MIN) ;
{code}

HEARTBEAT_INTERVAL_MIN is hard coded to 3 seconds in MRConstants.java.

Maybe I'm misunderstanding your question - are you in support of lowering the 
minimum and just asking why make it undocumented-configurable instead of 
hardcoded? I was offering the undocumented configuration option just in case 
someone had an argument against this change. If everyone's for it, happy to 
just change the constant.

> Allow heartbeat interval smaller than 3 seconds for tiny clusters
> -----------------------------------------------------------------
>
>                 Key: MAPREDUCE-1266
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1266
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: jobtracker, task, tasktracker
>    Affects Versions: 0.22.0
>            Reporter: Todd Lipcon
>            Priority: Minor
>
> For small clusters, the heartbeat interval has a large effect on job latency. 
> This is especially true on pseudo-distributed or other "tiny" (<5 nodes) 
> clusters. It's not a big deal for production, but new users would have a 
> happier first experience if Hadoop seemed snappier.
> I'd like to change the minimum heartbeat interval from 3.0 seconds to perhaps 
> 0.5 seconds (but have it governed by an undocumented config parameter in case 
> people don't like this change). The cluster size-based ramp up of interval 
> will maintain the current scalable behavior for large clusters with no 
> negative effect.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1266) Allow heartbeat interval smaller than 3 seconds for tiny clusters

Reply via email to