Re: NPE in removing container

2016-07-12 Thread Darin Johnson
Hey Stephen, I was on vacation last week, I'm looking over the logs this week. I've got a few ideas for a first but may take me a while as I get back into work. Darin On Fri, Jul 1, 2016 at 2:43 AM, Stephen Gran wrote: > Hi, > > It's not a problem at all. Anything I

Re: NPE in removing container

2016-07-01 Thread Stephen Gran
Hi, It's not a problem at all. Anything I can do to help. I've attached the log file for the relevant time period. This is hadoop 2.7.2 - you have a good memory :) Cheers, On 30/06/16 22:56, Darin Johnson wrote: > Hey Steven, > > Looks like this might be slightly different

Re: NPE in removing container

2016-06-30 Thread Stephen Gran
Hi, Yes - the imaginatively named slave2 was a zero-sized nm at that point - I am looking at how small a pool of reserved resource I can get away with, and use FGS for burst activity. Here are all the logs related to that host:port combination around that time: 2016-06-30 19:47:43,756 INFO

Re: NPE in removing container

2016-06-30 Thread Darin Johnson
Steven, thanks. I thought I had fixed that but perhaps a regression was made in another merge. I'll look into it, can you answer a few questions? Was the node (slave2) a zero sided nodemanager (for fgs)? In the node manager logs had it recently become unhealthy? I'm pretty concerned about this

NPE in removing container

2016-06-30 Thread Stephen Gran
Hi, Just playing with the 0.2.0 release (congratulations, by the way!) I have seen this twice now, although it is by no means consistent - I will have a dozen successful runs, and then one of these. This exits the RM, which makes it rather noticable. 2016-06-30 19:47:43,952 INFO