[ https://issues.apache.org/jira/browse/HADOOP-491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Doug Cutting updated HADOOP-491: -------------------------------- Resolution: Fixed Status: Resolved (was: Patch Available) I just committed this. Thanks Arun. > streaming jobs should allow programs that don't do any IO for a long time > ------------------------------------------------------------------------- > > Key: HADOOP-491 > URL: https://issues.apache.org/jira/browse/HADOOP-491 > Project: Hadoop > Issue Type: New Feature > Components: contrib/streaming > Reporter: arkady borkovsky > Assigned To: Arun C Murthy > Fix For: 0.12.0 > > Attachments: HADOOP-491_20070205_1.patch, > HADOOP-491_20070206_2.patch, HADOOP-491_20070212_3.patch > > > The jobtracker relies on task to send heartbeats to know the tasks are still > alive. > There is a 600 seconds timeout preset. > hadoop streaming also uses input to or output from the program it spawns to > indicate progress, sending appropriate heartbeats. > Some spawned programs spend longer that 600 seconds without any output while > being perfectly healthy. > It would be good to enhance the interface between hadoop streaming and the > programs it spawns to track a healthy program in the absense of output. > There are certain dangers with this protocol: e.g. a task can run a separate > thread that does nothing but send "i'm alive" message. This would be a user > bug to abuse the API in such way. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.