[
https://issues.apache.org/jira/browse/HDFS-11096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16103867#comment-16103867
]
Sean Mackrory commented on HDFS-11096:
--------------------------------------
Updating: it was a recent change that broke this, I've posted a patch to fix it
that's being reviewed / iterated on, and I've updated my rolling upgrade test
scripts to actually confirm via the Job History Server that the jobs themselves
were FINISHED and SUCCESSFUL.
I re-ran the test with an early patch and I was able to get a successful
rolling upgrade with 5-10 minute delays between each step. So the entire
rolling upgrade of a 9-node (6 worker-node) cluster was spread out over 4 hours
and I didn't encounter any other issues, EXCEPT: in my test workload, I had to
increase Terasort's output replication, because some job failures were
occasionally happening when a job wrote to a node that was about to be taken
down for upgrades. I fixed that and no other actual compatibility issues in
Hadoop were found. I'll push the fixes out to Github soon...
> Support rolling upgrade between 2.x and 3.x
> -------------------------------------------
>
> Key: HDFS-11096
> URL: https://issues.apache.org/jira/browse/HDFS-11096
> Project: Hadoop HDFS
> Issue Type: Improvement
> Components: rolling upgrades
> Affects Versions: 3.0.0-alpha1
> Reporter: Andrew Wang
> Assignee: Lei (Eddy) Xu
> Priority: Blocker
>
> trunk has a minimum software version of 3.0.0-alpha1. This means we can't
> rolling upgrade between branch-2 and trunk.
> This is a showstopper for large deployments. Unless there are very compelling
> reasons to break compatibility, let's restore the ability to rolling upgrade
> to 3.x releases.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]