[
https://issues.apache.org/jira/browse/HADOOP-3464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vinod Kumar Vavilapalli updated HADOOP-3464:
--------------------------------------------
Attachment: HADOOP-3464
Attaching first patch.
- This solves the problem of reporting errors on ringmaster side back to the
hod client, HodRing problems are still NOT addressed.
- Changes to hodlib/Common/setup.py are borrowed from the patch to
HADOOP-2961. Need merging of these two while committing.
- Also fixed another issue - earlier, any validation errors in ringmaster were
not getting logged due to late log initialization, changed this now so that
these errors can also be reported back to the hod client.
- Tested with 1) an invalid tar file e.g. a junk file 2) a non-existent path
value for hodring.java-home and 3) a non-existent path value for
gridservice-hdfs.pkgs and verified that errors are properly propagated back to
the hod client.
> [HOD] HOD can improve error messages by reporting failures on compute nodes
> back to hod client
> ----------------------------------------------------------------------------------------------
>
> Key: HADOOP-3464
> URL: https://issues.apache.org/jira/browse/HADOOP-3464
> Project: Hadoop Core
> Issue Type: Improvement
> Components: contrib/hod
> Reporter: Vinod Kumar Vavilapalli
> Assignee: Vinod Kumar Vavilapalli
> Fix For: 0.18.0
>
> Attachments: HADOOP-3464
>
>
> This issue addresses error messages w.r.t failures on compute nodes, while
> HADOOP-3151 addresses error messages in hod client.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.