Re: [VOTE] Should we release 0.20.204.0rc2?
On Aug 9, 2011, at 8:55 AM, Owen O'Malley wrote: All, Matt rolled a 0.20.204.0rc1, but I think it got lost in the previous vote thread. Unfortunately, it had the version as 0.20.204 and didn't update the release notes. I've updated it, run the regression tests and I think we should release it. I've put the tarball up at: http://people.apache.org/~omalley/hadoop-0.20.204.0-rc2 This vote is still running with no votes other than mine. I've tested with and without security on a 60 node cluster and I'm seeing some failures, but not that many. On a terasort with 15,000 maps and 200 reduces, I ran the following cases: security + linux task controller : 2 failures (both mr-2651) no security + default task controller : 6-7 failures (seems to be a race condition in clean up) Even in the no security case, it is only losing 0.05% of the time. It isn't perfect, but this is the code that Yahoo is currently running. I think we should release it. -- Owen
Re: [VOTE] Should we release 0.20.204.0rc2?
On Aug 18, 2011, at 12:28 AM, Owen O'Malley wrote: This vote is still running with no votes other than mine. I've tested with and without security on a 60 node cluster and I'm seeing some failures, but not that many. On a terasort with 15,000 maps and 200 reduces, I ran the following cases: security + linux task controller : 2 failures (both mr-2651) no security + default task controller : 6-7 failures (seems to be a race condition in clean up) Even in the no security case, it is only losing 0.05% of the time. We're seeing much much higher failure rates. In the 5-10% area. It might very well be because we have more cores/faster boxes. It isn't perfect, but this is the code that Yahoo is currently running. I think we should release it. Y! can afford the task failures. The rest of us can't. So -1.
Re: [VOTE] Should we release 0.20.204.0rc2?
This vote is still running with no votes other than mine. The vote was started 9 days ago and, if it kept running after Allen's vote, it would have ended three days ago with the result of 1-0 to release. As such, let's call this release 2Owen3.
Re: [VOTE] Should we release 0.20.204.0rc2?
Jokes aside, I intend to try it out on a set of 10 nodes or so and see if it works or not. If it works fine (barring a few task failures), I dont see any reason why this cannot be released. thanks mahadev On Aug 18, 2011, at 11:35 AM, Jakob Homan wrote: This vote is still running with no votes other than mine. The vote was started 9 days ago and, if it kept running after Allen's vote, it would have ended three days ago with the result of 1-0 to release. As such, let's call this release 2Owen3.
Re: [VOTE] Should we release 0.20.204.0rc2?
On Aug 9, 2011, at 8:55 AM, Owen O'Malley wrote: All, Matt rolled a 0.20.204.0rc1, but I think it got lost in the previous vote thread. Unfortunately, it had the version as 0.20.204 and didn't update the release notes. I've updated it, run the regression tests and I think we should release it. I've put the tarball up at: http://people.apache.org/~omalley/hadoop-0.20.204.0-rc2 Please check it out and vote on it. -1 (non-binding) https://issues.apache.org/jira/browse/HDFS-2244
Re: [VOTE] Should we release 0.20.204.0rc2?
On Aug 9, 2011, at 8:55 AM, Owen O'Malley wrote: All, Matt rolled a 0.20.204.0rc1, but I think it got lost in the previous vote thread. Unfortunately, it had the version as 0.20.204 and didn't update the release notes. I've updated it, run the regression tests and I think we should release it. I've put the tarball up at: http://people.apache.org/~omalley/hadoop-0.20.204.0-rc2 Please check it out and vote on it. After playing with it this afternoon, it would be good if other people did too. I've already talked to Owen about some of the weirdness I'm seeing, but it would be useful to see if other people can independently verify. I'm not going to tell you what they are... but run a sizable job and see if you see anything odd. :)