Re: Tablet server thrift issue

2014-09-01 Thread Corey Nolet
As an update, I raised the tablet server memory and I have not seen this error thrown since. I'd like to say raising the memory, alone, was the solution but it appears that I also may be having some performance issues with the switches connecting the racks together. I'll update more as I dive in

Re: Tablet server thrift issue

2014-08-22 Thread Josh Elser
If you get an error from a BatchWriter, you pretty much have to throw away that instance of the BatchWriter and make a new one. See ACCUMULO-2990. If you want, you should be able to catch/recover from this without having to restart the ingester. If the session ID is invalid, my guess is that

Re: Tablet server thrift issue

2014-08-22 Thread Corey Nolet
Thanks Josh, I understand about the session ID completely but the problem I have is that the exact same client code worked, line for line, just fine in 1.4.4 and it's acting up in 1.6.0. I also seem to remember the BatchWriter automatically creating a new session when one expired without an

Re: Tablet server thrift issue

2014-08-22 Thread Josh Elser
Don't mean to tell you that I don't think there might be a bug/otherwise, that's pretty much just the limit of what I know about the server-side sessions :) If you have concrete this worked in 1.4.4 and this happens instead with 1.6.0, that'd make a great ticket :D The BatchWriter failure

Re: Tablet server thrift issue

2014-08-22 Thread Corey Nolet
Josh, Your advice is definitely useful- I also thought about catching the exception and retrying with a fresh batch writer but the fact that the batch writer failure doesn't go away without being re-instantiated is really only a nuisance. The TabletServerBatchWriter could be designed much better,