2018-03-08 11:57 GMT+01:00 Ivan Kelly <iv...@apache.org>:

> Hmm, yes, looks messed up. Do you have a reliable repro of this?
>
> "at io.netty.channel.DefaultChannelPipeline.onUnhandledInboundMessage"
> <- it seems the message itself wasn't handled. Do you have any more
> bookie side logs leading up to this event?
>


The only suspicious logs are for logger named
"org.apache.bookkeeper.bookie.EntryLogger"


18-03-08-09-22-40       Created new entry log file
/data/qasql/bookie/./bookie_data/current/4b0.log for logId 1200.
18-03-08-09-32-41       Failed to get ledgers map index from: 53.log : No
ledgers map index found on entryLogId53
18-03-08-09-33-10       Failed to get ledgers map index from: 54.log : No
ledgers map index found on entryLogId54
18-03-08-09-33-10       Failed to get ledgers map index from: 375.log : No
ledgers map index found on entryLogId375
18-03-08-09-33-13       Failed to get ledgers map index from: 878.log : No
ledgers map index found on entryLogId878
18-03-08-09-33-22       Failed to get ledgers map index from: 1197.log : No
ledgers map index found on entryLogId1197
18-03-08-09-33-57       Failed to get ledgers map index from: 1198.log : No
ledgers map index found on entryLogId1198
18-03-08-09-34-54       Failed to get ledgers map index from: 1199.log : No
ledgers map index found on entryLogId1199

I will try to run Bookie with level debug, but it produces lots of spam

Thanks
Enrico


>
> -Ivan
>
> On Thu, Mar 8, 2018 at 9:18 AM, Enrico Olivelli <eolive...@gmail.com>
> wrote:
> > Hi all,
> > I am seeing this bad errors in some test environments with 4.6.1
> > The errors appear during rolling restart of the application, the test
> env is
> > made of 6 machines:
> > - 3 bookies
> > - 3 client machines (with several BK clients, of different sub-systems)
> > - running with 4.6.1 both client and servers
> >
> > *I do not have* reports of this errors from production, already running
> > 4.6.1 for the last month
> >
> > But the problem is quite scary
> >
> > This is a sample of relevant errors on clients (in this case Majordodo
> > brokers, with log level = INFO)
> >
> > logs/org.apache.bookkeeper.client.PendingAddOp:
> > 18-03-07-09-51-43       Write of ledger entry to quorum failed: L366634
> > E2557
> > 18-03-07-09-51-43       Write of ledger entry to quorum failed: L366634
> > E2558
> > 18-03-07-09-51-43       Write of ledger entry to quorum failed: L366634
> > E2559
> > 18-03-07-15-59-55       Failed to write entry (366680, 1865): Bookie
> > operation timeout
> > 18-03-07-15-59-55       Failed to write entry (366680, 1865): Bookie
> > operation timeout
> > 18-03-07-16-00-00       Failed to write entry (366680, 1865): Bookie
> > operation timeout
> > 18-03-07-16-00-00       Failed to write entry (366680, 1865): Bookie
> > operation timeout
> > 18-03-07-16-00-05       Failed to write entry (366680, 1865): Bookie
> > operation timeout
> >
> >
> > org.apache.bookkeeper.proto.PerChannelBookieClient:
> >
> > 18-03-07-10-06-44       Unexpected exception caught by bookie client
> channel
> > handler
> > 18-03-07-10-06-44       io.netty.handler.codec.DecoderException:
> > java.lang.IllegalStateException: Received unknown response : op code = 6
> > io.netty.handler.codec.DecoderException: java.lang.
> IllegalStateException:
> > Received unknown response : op code = 6
> >         at
> > io.netty.handler.codec.MessageToMessageDecoder.channelRead(
> MessageToMessageDecoder.java:98)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:362)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:348)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(
> AbstractChannelHandlerContext.java:340)
> >         at
> > io.netty.handler.codec.ByteToMessageDecoder.fireChannelRead(
> ByteToMessageDecoder.java:310)
> >         at
> > io.netty.handler.codec.ByteToMessageDecoder.fireChannelRead(
> ByteToMessageDecoder.java:297)
> >         at
> > io.netty.handler.codec.ByteToMessageDecoder.callDecode(
> ByteToMessageDecoder.java:413)
> >         at
> > io.netty.handler.codec.ByteToMessageDecoder.channelRead(
> ByteToMessageDecoder.java:265)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:362)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:348)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(
> AbstractChannelHandlerContext.java:340)
> >         at
> > io.netty.channel.DefaultChannelPipeline$HeadContext.channelRead(
> DefaultChannelPipeline.java:1414)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:362)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:348)
> >         at
> > io.netty.channel.DefaultChannelPipeline.fireChannelRead(
> DefaultChannelPipeline.java:945)
> >         at
> > io.netty.channel.epoll.AbstractEpollStreamChannel$EpollStreamUnsafe.
> epollInReady(AbstractEpollStreamChannel.java:800)
> >         at
> > io.netty.channel.epoll.EpollEventLoop.processReady(
> EpollEventLoop.java:404)
> >         at
> > io.netty.channel.epoll.EpollEventLoop.run(EpollEventLoop.java:304)
> >         at
> > io.netty.util.concurrent.SingleThreadEventExecutor$5.
> run(SingleThreadEventExecutor.java:886)
> >         at java.base/java.lang.Thread.run(Thread.java:844)
> > Caused by: java.lang.IllegalStateException: Received unknown response :
> op
> > code = 6
> >         at
> > org.apache.bookkeeper.proto.BookieProtoEncoding$
> ResponseEnDeCoderPreV3.decode(BookieProtoEncoding.java:294)
> >         at
> > org.apache.bookkeeper.proto.BookieProtoEncoding$ResponseDecoder.decode(
> BookieProtoEncoding.java:478)
> >         at
> > io.netty.handler.codec.MessageToMessageDecoder.channelRead(
> MessageToMessageDecoder.java:88)
> >
> >
> >         ... 19 more
> >
> >
> > Errors on Bookie side, with log level = INFO)
> >
> >
> > io.netty.channel.DefaultChannelPipeline:
> > 8-03-07-09-51-38       An exceptionCaught() event was fired, and it
> reached
> > at the tail of the pipeline. It usually means the last handler in the
> > pipeline did not handle the exception.
> > 18-03-07-09-51-38       io.netty.util.IllegalReferenceCountException:
> > refCnt: 0, increment: 1
> > io.netty.util.IllegalReferenceCountException: refCnt: 0, increment: 1
> >         at
> > io.netty.buffer.AbstractReferenceCountedByteBuf.release0(
> AbstractReferenceCountedByteBuf.java:100)
> >         at
> > io.netty.buffer.AbstractReferenceCountedByteBuf.release(
> AbstractReferenceCountedByteBuf.java:84)
> >         at
> > io.netty.util.ReferenceCountUtil.release(ReferenceCountUtil.java:88)
> >         at
> > io.netty.channel.DefaultChannelPipeline.onUnhandledInboundMessage(
> DefaultChannelPipeline.java:1202)
> >         at
> > io.netty.channel.DefaultChannelPipeline$TailContext.channelRead(
> DefaultChannelPipeline.java:1299)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:362)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:348)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(
> AbstractChannelHandlerContext.java:340)
> >         at
> > io.netty.channel.ChannelInboundHandlerAdapter.channelRead(
> ChannelInboundHandlerAdapter.java:86)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:362)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:348)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(
> AbstractChannelHandlerContext.java:340)
> >         at
> > org.apache.bookkeeper.proto.BookieRequestHandler.channelRead(
> BookieRequestHandler.java:78)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:362)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:348)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(
> AbstractChannelHandlerContext.java:340)
> >         at
> > io.netty.channel.ChannelInboundHandlerAdapter.channelRead(
> ChannelInboundHandlerAdapter.java:86)
> >         at
> > org.apache.bookkeeper.proto.AuthHandler$ServerSideHandler.
> channelRead(AuthHandler.java:88)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:362)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:348)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(
> AbstractChannelHandlerContext.java:340)
> >         at
> > io.netty.handler.codec.MessageToMessageDecoder.channelRead(
> MessageToMessageDecoder.java:102)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:362)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:348)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(
> AbstractChannelHandlerContext.java:340)
> >         at
> > io.netty.handler.codec.ByteToMessageDecoder.fireChannelRead(
> ByteToMessageDecoder.java:310)
> >         at
> > io.netty.handler.codec.ByteToMessageDecoder.fireChannelRead(
> ByteToMessageDecoder.java:297)
> >         at
> > io.netty.handler.codec.ByteToMessageDecoder.callDecode(
> ByteToMessageDecoder.java:413)
> >         at
> > io.netty.handler.codec.ByteToMessageDecoder.channelRead(
> ByteToMessageDecoder.java:265)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:362)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:348)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(
> AbstractChannelHandlerContext.java:340)
> >         at
> > io.netty.channel.DefaultChannelPipeline$HeadContext.channelRead(
> DefaultChannelPipeline.java:1414)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:362)
> >         at
> > io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(
> AbstractChannelHandlerContext.java:348)
> >         at
> > io.netty.channel.DefaultChannelPipeline.fireChannelRead(
> DefaultChannelPipeline.java:945)
> >         at
> > io.netty.channel.epoll.AbstractEpollStreamChannel$EpollStreamUnsafe.
> epollInReady(AbstractEpollStreamChannel.java:800)
> >         at
> > io.netty.channel.epoll.EpollEventLoop.processReady(
> EpollEventLoop.java:404)
> >         at
> > io.netty.channel.epoll.EpollEventLoop.run(EpollEventLoop.java:304)
> >         at
> > io.netty.util.concurrent.SingleThreadEventExecutor$5.
> run(SingleThreadEventExecutor.java:886)
> >         at java.base/java.lang.Thread.run(Thread.java:844)
> >
> >
>

Reply via email to