[
https://issues.apache.org/jira/browse/DRILL-8059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
wtf updated DRILL-8059:
-----------------------
Description:
One of the minor fragment starts late sometimes. All fragments transterred
between the same node pair take muck longer time during that period.
Three profiles attached. The slow communication happens around
11:45:18~11:45:38 between node explorersha-49-7146.ET15.alipay.com and node
explorersha-49-6019.ET15.alipay.com.
On node explorersha-49-7146.ET15.alipay.com:
drillbit.log.1:2021-11-30 11:45:25,258
[1e5a61ab-1b3e-8956-ad52-6786ccece9e2:foreman] INFO
o.a.d.e.work.foreman.FragmentsRunner - Sending remote fragments to node:
explorersha-49-6019.ET15.alipay.com, Fragments:
[1e5a61ab-1b3e-8956-ad52-6786ccece9e2:1:17]
On node explorersha-49-6019.ET15.alipay.com:
drillbit.log.3:2021-11-30 11:45:47,269 [BitServer-10] INFO
o.a.d.e.w.b.ControlMessageHandler - 1e5a61ab-1b3e-8956-ad52-6786ccece9e2:1:17
remote fragment received
We can see that it takes more than 22 seconds to send the fragment.
was:
One of the minor fragment starts late sometimes. All fragments transterred
between the same node pair take muck longer time during that period.
Three profiles attached. The slow communication happens around
11:45:18~11:45:38 between node explorersha-49-7146.ET15.alipay.com and node
explorersha-49-6019.ET15.alipay.com.
On node explorersha-49-7146.ET15.alipay.com:
drillbit.log.1:2021-11-30 11:45:25,258
[1e5a61ab-1b3e-8956-ad52-6786ccece9e2:foreman] INFO
o.a.d.e.work.foreman.FragmentsRunner - Sending remote fragments to node:
explorersha-49-6019.ET15.alipay.com, Fragments:
[1e5a61ab-1b3e-8956-ad52-6786ccece9e2:1:17]
On node explorersha-49-6019.ET15.alipay.com:
drillbit.log.3:2021-11-30 11:45:47,269 [BitServer-10] INFO
o.a.d.e.w.b.ControlMessageHandler - 1e5a61ab-1b3e-8956-ad52-6786ccece9e2:1:17
remote fragment received
> Fragment starts late
> --------------------
>
> Key: DRILL-8059
> URL: https://issues.apache.org/jira/browse/DRILL-8059
> Project: Apache Drill
> Issue Type: Bug
> Components: Execution - RPC
> Affects Versions: 1.13.0
> Reporter: wtf
> Priority: Major
> Attachments: 1e5a6196-f2c1-46d5-7126-1d9f1aada180.json,
> 1e5a61ab-1b3e-8956-ad52-6786ccece9e2.json,
> 1e5a61b1-256b-3b57-f76a-8b2b5f8a72c9.json
>
>
> One of the minor fragment starts late sometimes. All fragments transterred
> between the same node pair take muck longer time during that period.
> Three profiles attached. The slow communication happens around
> 11:45:18~11:45:38 between node explorersha-49-7146.ET15.alipay.com and node
> explorersha-49-6019.ET15.alipay.com.
> On node explorersha-49-7146.ET15.alipay.com:
> drillbit.log.1:2021-11-30 11:45:25,258
> [1e5a61ab-1b3e-8956-ad52-6786ccece9e2:foreman] INFO
> o.a.d.e.work.foreman.FragmentsRunner - Sending remote fragments to node:
> explorersha-49-6019.ET15.alipay.com, Fragments:
> [1e5a61ab-1b3e-8956-ad52-6786ccece9e2:1:17]
> On node explorersha-49-6019.ET15.alipay.com:
> drillbit.log.3:2021-11-30 11:45:47,269 [BitServer-10] INFO
> o.a.d.e.w.b.ControlMessageHandler - 1e5a61ab-1b3e-8956-ad52-6786ccece9e2:1:17
> remote fragment received
> We can see that it takes more than 22 seconds to send the fragment.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)