[ 
https://issues.apache.org/jira/browse/HIVE-410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashish Thusoo updated HIVE-410:
-------------------------------

    Attachment: patch-410.txt

The fix is quite simple. I have not been able to add a test case yet as I need 
miniMR for that. If this goes in before miniMR stuff, I will add a test case in 
a separate JIRA.


> Heartbeating for streaming jobs should not depend on stdout
> -----------------------------------------------------------
>
>                 Key: HIVE-410
>                 URL: https://issues.apache.org/jira/browse/HIVE-410
>             Project: Hadoop Hive
>          Issue Type: Bug
>            Reporter: Venky Iyer
>            Assignee: Ashish Thusoo
>            Priority: Blocker
>         Attachments: patch-410.txt
>
>
> jobs that require iterative processing may take longer than 10 mins to 
> produce rows. This shouldn't be cause to kill the job. Producing keepalive 
> dummy rows to stdout is bad if the data has to go into a Hive table or other 
> Hive steps.
> If we adopt the solution of using stderr to indicate heartbeats, can that be 
> combined with streaming counters 
> (http://hadoop.apache.org/core/docs/current/streaming.html#How+do+I+update+counters+in+streaming+applications%3F
>  )? Also, will limitations on size of stderr break this?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to