[
https://issues.apache.org/jira/browse/KYLIN-4348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17172237#comment-17172237
]
Gabor Arki commented on KYLIN-4348:
-----------------------------------
Altogether we have 10 running jobs in the cluster which show no progress:
* 169f75fa-a02f-221b-fc48-037bc7a842d0
* 0b5dae1b-6faf-66c5-71dc-86f5b820f1c4
* 00924699-8b51-8091-6e71-34ccfeba3a98
* 4620192a-71e1-16dd-3b05-44d7f9144ad4
* 416355c2-a3d7-57eb-55c6-c042aa256510
* 12750aea-3b96-c817-64e8-bf893d8c120f
* 42819dde-5857-fd6b-b075-439952f47140
* 00128937-bd4a-d6c1-7a4e-744dee946f67
* 46a0233f-217e-9155-725b-c815ad77ba2c
* 062150ba-bacd-6644-4801-3a51b260d1c5
However, the ones possessing the locks are all pending:
* f888380e-9ff4-98f5-2df4-1ae71e045f93
* fc186bd9-1186-6ed4-e58c-bbbf6dd8ef74
* d1a6475a-9ab2-5ee4-6714-f395e20cfc01
So, essentially the jobs that are running cannot actually run because they are
unable to acquire a lock. However, the ones that possess the lock cannot
continue because there are already 10 running jobs. This seems to be a deadlock
to me.
> Fix distributed concurrency lock bug
> ------------------------------------
>
> Key: KYLIN-4348
> URL: https://issues.apache.org/jira/browse/KYLIN-4348
> Project: Kylin
> Issue Type: Sub-task
> Reporter: wangxiaojing
> Assignee: wangxiaojing
> Priority: Major
> Fix For: v3.1.0
>
> Attachments: image-2020-02-03-10-54-21-976.png,
> image-2020-02-03-10-54-53-468.png
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)