Re: [VOTE] Should we release 0.20.204.0rc2?

2011-08-18 Thread Owen O'Malley

On Aug 9, 2011, at 8:55 AM, Owen O'Malley wrote:

 All,
  Matt rolled a 0.20.204.0rc1, but I think it got lost in the previous vote 
 thread. Unfortunately, it had the version as 0.20.204 and didn't update the 
 release notes. I've updated it, run the regression tests and I think we 
 should release it. I've put the tarball up at:
 
 http://people.apache.org/~omalley/hadoop-0.20.204.0-rc2

This vote is still running with no votes other than mine. 

I've tested with and without security on a 60 node cluster and I'm seeing some 
failures, but not that many. On a terasort with 15,000 maps and 200 reduces, I 
ran the following cases:

security + linux task controller : 2 failures (both mr-2651)

no security + default task controller : 6-7 failures (seems to be a race 
condition in clean up)

Even in the no security case, it is only losing 0.05% of the time.

It isn't perfect, but this is the code that Yahoo is currently running. I think 
we should release it.

-- Owen

Re: [VOTE] Should we release 0.20.204.0rc2?

2011-08-18 Thread Allen Wittenauer

On Aug 18, 2011, at 12:28 AM, Owen O'Malley wrote:
 
 This vote is still running with no votes other than mine. 
 
 I've tested with and without security on a 60 node cluster and I'm seeing 
 some failures, but not that many. On a terasort with 15,000 maps and 200 
 reduces, I ran the following cases:
 
 security + linux task controller : 2 failures (both mr-2651)
 
 no security + default task controller : 6-7 failures (seems to be a race 
 condition in clean up)
 
 Even in the no security case, it is only losing 0.05% of the time.

We're seeing much much higher failure rates.  In the 5-10% area.  It 
might very well be because we have more cores/faster boxes.

 It isn't perfect, but this is the code that Yahoo is currently running. I 
 think we should release it.

Y! can afford the task failures.  The rest of us can't.  So -1.

Re: [VOTE] Should we release 0.20.204.0rc2?

2011-08-18 Thread Jakob Homan
 This vote is still running with no votes other than mine.

The vote was started 9 days ago and, if it kept running after Allen's
vote, it would have ended three days ago with the result of 1-0 to
release.  As such, let's call this release 2Owen3.


Re: [VOTE] Should we release 0.20.204.0rc2?

2011-08-18 Thread Mahadev Konar
Jokes aside,
  I intend to try it out on a set of 10 nodes or so and see if it works or not. 
If it works fine (barring a few task failures), I dont see any reason why this 
cannot be released.
 
thanks
mahadev

On Aug 18, 2011, at 11:35 AM, Jakob Homan wrote:

 This vote is still running with no votes other than mine.
 
 The vote was started 9 days ago and, if it kept running after Allen's
 vote, it would have ended three days ago with the result of 1-0 to
 release.  As such, let's call this release 2Owen3.



Re: [VOTE] Should we release 0.20.204.0rc2?

2011-08-10 Thread Allen Wittenauer

On Aug 9, 2011, at 8:55 AM, Owen O'Malley wrote:

 All,
  Matt rolled a 0.20.204.0rc1, but I think it got lost in the previous vote 
 thread. Unfortunately, it had the version as 0.20.204 and didn't update the 
 release notes. I've updated it, run the regression tests and I think we 
 should release it. I've put the tarball up at:
 
 http://people.apache.org/~omalley/hadoop-0.20.204.0-rc2
 
 Please check it out and vote on it.

-1 (non-binding)

https://issues.apache.org/jira/browse/HDFS-2244




Re: [VOTE] Should we release 0.20.204.0rc2?

2011-08-10 Thread Allen Wittenauer

On Aug 9, 2011, at 8:55 AM, Owen O'Malley wrote:

 All,
  Matt rolled a 0.20.204.0rc1, but I think it got lost in the previous vote 
 thread. Unfortunately, it had the version as 0.20.204 and didn't update the 
 release notes. I've updated it, run the regression tests and I think we 
 should release it. I've put the tarball up at:
 
 http://people.apache.org/~omalley/hadoop-0.20.204.0-rc2
 
 Please check it out and vote on it.


After playing with it this afternoon, it would be good if other people 
did too.  I've already talked to Owen about some of the weirdness I'm seeing, 
but it would be useful to see if other people can independently verify.  I'm 
not going to tell you what they are... but run a sizable job and see if you see 
anything odd. :)