I had the same issue, servers were on aws same lane, means very remote
chance of disconnection but finally we found there were packet drops.
On Feb 10, 2016 3:39 AM, "Paul Friedman" <[email protected]>
wrote:

> Thanks for the reply.  Since the 2 machines are on the same LAN (no
> firewall
> in between), does the Drill JDBC driver (or drill-embedded server) have any
> timeouts which can be increased?
>
> Interestingly, the client side (JDBC) doesn't notice that the server side
> (Drill-embedded) has disconnected.
>
> ---Paul
>
>
> -----Original Message-----
> From: Nirav Shah [mailto:[email protected]]
> Sent: Tuesday, February 09, 2016 11:38 AM
> To: [email protected]
> Subject: Re: Help with error message...
>
> From the logs it looks like network drop between nodes.
> If it fails on exact time say 10 min than check with firewall settings.
> On Feb 10, 2016 12:27 AM, "Paul Friedman"
> <[email protected]>
> wrote:
>
> > Hello...
> >
> > I'm executing a long-running Drill (1.4) query (4-10mins) called via
> > JDBC from Talend and sometimes I'm seeing an error stack like this
> > (see below)
> >
> > The query is a select statement with an order by against a directory
> > of Parquet files which were produced by Spark.  Probably half the time
> > it succeeds and returns the expected results, but often it's erroring
> > out as below.
> >
> > Can you help with any insights?
> >
> > Thanks in advance.
> >
> > ---Paul
> >
> > ...
> > 2016-02-08 16:47:47,275
> > [2946cbe3-e73d-2ed4-da60-76c1bd799372:frag:1:0]
> > INFO
> > o.a.d.e.w.fragment.FragmentExecutor -
> > 2946cbe3-e73d-2ed4-da60-76c1bd799372:1:0: State change requested
> > RUNNING
> > -->
> > FINISHED
> > 2016-02-08 16:47:47,276
> > [2946cbe3-e73d-2ed4-da60-76c1bd799372:frag:1:0]
> > INFO
> > o.a.d.e.w.f.FragmentStatusReporter -
> > 2946cbe3-e73d-2ed4-da60-76c1bd799372:1:0: State to report: FINISHED
> > 2016-02-08 16:48:25,496 [UserServer-1] INFO
> > o.a.d.e.w.fragment.FragmentExecutor -
> > 2946cbe3-e73d-2ed4-da60-76c1bd799372:0:0: State change requested
> > RUNNING
> > -->
> > FAILED
> > 2016-02-08 16:48:25,778
> > [2946cbe3-e73d-2ed4-da60-76c1bd799372:frag:0:0]
> > INFO
> > o.a.d.e.w.fragment.FragmentExecutor -
> > 2946cbe3-e73d-2ed4-da60-76c1bd799372:0:0: State change requested
> > FAILED --> FAILED
> > 2016-02-08 16:48:25,779 [UserServer-1] INFO
> > o.a.d.e.w.fragment.FragmentExecutor -
> > 2946cbe3-e73d-2ed4-da60-76c1bd799372:0:0: State change requested
> > FAILED --> FAILED
> > 2016-02-08 16:48:25,779 [CONTROL-rpc-event-queue] INFO
> > o.a.d.e.w.fragment.FragmentExecutor -
> > 2946cbe3-e73d-2ed4-da60-76c1bd799372:0:0: State change requested
> > FAILED --> CANCELLATION_REQUESTED
> > 2016-02-08 16:48:25,779 [CONTROL-rpc-event-queue] WARN
> > o.a.d.e.w.fragment.FragmentExecutor -
> > 2946cbe3-e73d-2ed4-da60-76c1bd799372:0:0: Ignoring unexpected state
> > transition FAILED --> CANCELLATION_REQUESTED
> > 2016-02-08 16:48:25,779
> > [2946cbe3-e73d-2ed4-da60-76c1bd799372:frag:0:0]
> > INFO
> > o.a.d.e.w.fragment.FragmentExecutor -
> > 2946cbe3-e73d-2ed4-da60-76c1bd799372:0:0: State change requested
> > FAILED --> FAILED
> > 2016-02-08 16:48:25,780
> > [2946cbe3-e73d-2ed4-da60-76c1bd799372:frag:0:0]
> > INFO
> > o.a.d.e.w.fragment.FragmentExecutor -
> > 2946cbe3-e73d-2ed4-da60-76c1bd799372:0:0: State change requested
> > FAILED --> FINISHED
> > 2016-02-08 16:48:25,781 [UserServer-1] WARN
> > o.a.d.exec.rpc.RpcExceptionHandler - Exception occurred with closed
> > channel.
> > Connection: /172.20.20.154:31010 <--> /172.20.20.157:64101 (user
> > client)
> > java.nio.channels.ClosedChannelException: null
> > 2016-02-08 16:48:25,783
> > [2946cbe3-e73d-2ed4-da60-76c1bd799372:frag:0:0]
> > ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR:
> > ChannelClosedException: Channel closed /172.20.20.154:31010 <-->
> > /172.20.20.157:64101.
> >
> > Fragment 0:0
> >
> > [Error Id: 2f075631-fb49-4feb-b39d-cbe89083a2ee on
> > chai.dev.streetlightdata.com:31010]
> > org.apache.drill.common.exceptions.UserException: SYSTEM ERROR:
> > ChannelClosedException: Channel closed /172.20.20.154:31010 <-->
> > /172.20.20.157:64101.
> >
> > Fragment 0:0
> >
> > [Error Id: 2f075631-fb49-4feb-b39d-cbe89083a2ee on
> > chai.dev.streetlightdata.com:31010]
> >         at
> >
> > org.apache.drill.common.exceptions.UserException$Builder.build(UserExc
> > eption.java:534)
> > ~[drill-common-1.4.0.jar:1.4.0]
> >         at
> >
> > org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(Fr
> > agmentExecutor.java:321)
> > [drill-java-exec-1.4.0.jar:1.4.0]
> >         at
> >
> > org.apache.drill.exec.work.fragment.FragmentExecutor.cleanup(FragmentE
> > xecutor.java:184)
> > [drill-java-exec-1.4.0.jar:1.4.0]
> >         at
> >
> > org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecu
> > tor.java:290)
> > [drill-java-exec-1.4.0.jar:1.4.0]
> >         at
> >
> > org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.
> > java:38)
> > [drill-common-1.4.0.jar:1.4.0]
> >         at
> >
> > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.j
> > ava:1142)
> > [na:1.8.0_66]
> >         at
> >
> > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.
> > java:617)
> > [na:1.8.0_66]
> >         at java.lang.Thread.run(Thread.java:745) [na:1.8.0_66] Caused
> > by: org.apache.drill.exec.rpc.ChannelClosedException: Channel closed
> > /172.20.20.154:31010 <--> /172.20.20.157:64101.
> >         at
> >
> > org.apache.drill.exec.rpc.RpcBus$ChannelClosedHandler.operationComplet
> > e(RpcBus.java:175)
> > ~[drill-rpc-1.4.0.jar:1.4.0]
> >         at
> >
> > org.apache.drill.exec.rpc.RpcBus$ChannelClosedHandler.operationComplet
> > e(RpcBus.java:151)
> > ~[drill-rpc-1.4.0.jar:1.4.0]
> >         at
> >
> > io.netty.util.concurrent.DefaultPromise.notifyListener0(DefaultPromise
> > .java:680) ~[netty-common-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.util.concurrent.DefaultPromise.notifyListeners0(DefaultPromis
> > e.java:603) ~[netty-common-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.util.concurrent.DefaultPromise.notifyListeners(DefaultPromise
> > .java:563) ~[netty-common-4.0.27.Final.jar:4.0.27.Final]
> >         at
> > io.netty.util.concurrent.DefaultPromise.trySuccess(DefaultPromise.java
> > :406) ~[netty-common-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.channel.DefaultChannelPromise.trySuccess(DefaultChannelPromis
> > e.java:82) ~[netty-transport-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.channel.AbstractChannel$CloseFuture.setClosed(AbstractChannel
> > .java:943) ~[netty-transport-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.channel.AbstractChannel$AbstractUnsafe.doClose0(AbstractChann
> > el.java:592) ~[netty-transport-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.channel.AbstractChannel$AbstractUnsafe.close(AbstractChannel.
> > java:584) ~[netty-transport-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.channel.epoll.AbstractEpollStreamChannel$EpollStreamUnsafe.cl
> > oseOnRead(AbstractEpollStreamChannel.java:409)
> > ~[netty-transport-native-epoll-4.0.27.Final-linux-x86_64.jar:na]
> >         at
> >
> > io.netty.channel.epoll.AbstractEpollStreamChannel$EpollStreamUnsafe.ep
> > ollInReady(AbstractEpollStreamChannel.java:647)
> > ~[netty-transport-native-epoll-4.0.27.Final-linux-x86_64.jar:na]
> >         at
> >
> > io.netty.channel.epoll.AbstractEpollStreamChannel$EpollStreamUnsafe.ep
> > ollRdHupReady(AbstractEpollStreamChannel.java:573)
> > ~[netty-transport-native-epoll-4.0.27.Final-linux-x86_64.jar:na]
> >         at
> > io.netty.channel.epoll.EpollEventLoop.processReady(EpollEventLoop.java
> > :315) ~[netty-transport-native-epoll-4.0.27.Final-linux-x86_64.jar:na]
> >         at
> > io.netty.channel.epoll.EpollEventLoop.run(EpollEventLoop.java:250)
> > ~[netty-transport-native-epoll-4.0.27.Final-linux-x86_64.jar:na]
> >         at
> >
> > io.netty.util.concurrent.SingleThreadEventExecutor$2.run(SingleThreadE
> > ventExecutor.java:111) ~[netty-common-4.0.27.Final.jar:4.0.27.Final]
> >         ... 1 common frames omitted
> > 2016-02-08 16:48:25,785 [drill-executor-42] WARN
> > o.a.d.exec.rpc.control.WorkEventBus - Fragment
> > 2946cbe3-e73d-2ed4-da60-76c1bd799372:0:0 not found in the work bus.
> > 2016-02-08 16:48:25,810 [CONTROL-rpc-event-queue] WARN
> > o.a.drill.exec.work.foreman.Foreman - Dropping request to move to
> > COMPLETED state as query is already at CANCELED state (which is
> terminal).
> > 2016-02-08 16:48:25,811 [UserServer-1] INFO
> > o.a.drill.exec.work.foreman.Foreman - Failure while trying communicate
> > query result to initiating client. This would happen if a client is
> > disconnected before response notice can be sent.
> > org.apache.drill.exec.rpc.ChannelClosedException: null
> >         at
> >
> > org.apache.drill.exec.rpc.CoordinationQueue$RpcListener.operationCompl
> > ete(CoordinationQueue.java:89)
> > [drill-rpc-1.4.0.jar:1.4.0]
> >         at
> >
> > org.apache.drill.exec.rpc.CoordinationQueue$RpcListener.operationCompl
> > ete(CoordinationQueue.java:67)
> > [drill-rpc-1.4.0.jar:1.4.0]
> >         at
> >
> > io.netty.util.concurrent.DefaultPromise.notifyListener0(DefaultPromise
> > .java:680) [netty-common-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.util.concurrent.DefaultPromise.notifyListeners0(DefaultPromis
> > e.java:603) [netty-common-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.util.concurrent.DefaultPromise.notifyListeners(DefaultPromise
> > .java:563) [netty-common-4.0.27.Final.jar:4.0.27.Final]
> >         at
> > io.netty.util.concurrent.DefaultPromise.tryFailure(DefaultPromise.java
> > :424) [netty-common-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.channel.AbstractChannel$AbstractUnsafe.safeSetFailure(Abstrac
> > tChannel.java:788) [netty-transport-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.channel.AbstractChannel$AbstractUnsafe.write(AbstractChannel.
> > java:689) [netty-transport-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.channel.DefaultChannelPipeline$HeadContext.write(DefaultChann
> > elPipeline.java:1114) [netty-transport-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.channel.AbstractChannelHandlerContext.invokeWrite(AbstractCha
> > nnelHandlerContext.java:705)
> > [netty-transport-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.channel.AbstractChannelHandlerContext.access$1900(AbstractCha
> > nnelHandlerContext.java:32)
> > [netty-transport-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.channel.AbstractChannelHandlerContext$AbstractWriteTask.write
> > (AbstractChannelHandlerContext.java:980)
> > [netty-transport-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.channel.AbstractChannelHandlerContext$WriteAndFlushTask.write
> > (AbstractChannelHandlerContext.java:1032)
> > [netty-transport-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.channel.AbstractChannelHandlerContext$AbstractWriteTask.run(A
> > bstractChannelHandlerContext.java:965)
> > [netty-transport-4.0.27.Final.jar:4.0.27.Final]
> >         at
> >
> > io.netty.util.concurrent.SingleThreadEventExecutor.runAllTasks(SingleT
> > hreadEventExecutor.java:357)
> > [netty-common-4.0.27.Final.jar:4.0.27.Final]
> >         at
> > io.netty.channel.epoll.EpollEventLoop.run(EpollEventLoop.java:254)
> > [netty-transport-native-epoll-4.0.27.Final-linux-x86_64.jar:na]
> >         at
> >
> > io.netty.util.concurrent.SingleThreadEventExecutor$2.run(SingleThreadE
> > ventExecutor.java:111) [netty-common-4.0.27.Final.jar:4.0.27.Final]
> >         at java.lang.Thread.run(Thread.java:745) [na:1.8.0_66]
> > 2016-02-08 16:48:25,812 [UserServer-1] WARN
> > o.a.drill.exec.work.foreman.Foreman - Dropping request to move to
> > FAILED state as query is already at CANCELED state (which is terminal).
> >
>

Reply via email to