[
https://issues.apache.org/jira/browse/MAPREDUCE-6978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
rangjiaheng updated MAPREDUCE-6978:
-----------------------------------
Description:
Environment:
NM1 TaskCounter.class old version;
NM2 TaskCounter.class new version (new Enumeration values appended);
Result:
When an MR app's AM running on NM1, and it's containers on NM2; the containers
on NM2 will all failed, AM cause OutOfBoundsException;
Reason:
When app running, containers will report their counters to AM through RPC,
while the Container with new version TaskCounter.class will write more Counter
values to RPC; however, the AM with old version TaskCounter.class which can not
read them correctly from RPC.
was:
Environment:
NM1 TaskCounter.class old version;
NM2 TaskCounter.class new version (new Enumeration values appended);
Result:
When an MR app's AM running on NM1, and it's containers on NM2; the containers
on NM2 will all failed, AM cause OutOfBoundsException;
Reason:
When app running, containers will report their counters to AM through RPC,
while the Container with new version TaskCounter.class will white more Counter
values to RPC; however, the AM with old version TaskCounter.class which can not
read them correctly from RPC.
> MR task counters deserialized through RPC throws OutOfBoundsException if
> Counter enum class version not match
> -------------------------------------------------------------------------------------------------------------
>
> Key: MAPREDUCE-6978
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6978
> Project: Hadoop Map/Reduce
> Issue Type: Improvement
> Components: mr-am, task
> Affects Versions: 3.0.0-alpha4
> Environment: NM1 TaskCounter.class old version;
> NM2 TaskCounter.class new version (new Enumeration values appended);
> Reporter: rangjiaheng
>
> Environment:
> NM1 TaskCounter.class old version;
> NM2 TaskCounter.class new version (new Enumeration values appended);
> Result:
> When an MR app's AM running on NM1, and it's containers on NM2; the
> containers on NM2 will all failed, AM cause OutOfBoundsException;
> Reason:
> When app running, containers will report their counters to AM through RPC,
> while the Container with new version TaskCounter.class will write more
> Counter values to RPC; however, the AM with old version TaskCounter.class
> which can not read them correctly from RPC.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]