[
https://issues.apache.org/jira/browse/KYLIN-4348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17172060#comment-17172060
]
Gabor Arki commented on KYLIN-4348:
-----------------------------------
We just upgraded to 3.1.0 release but we are still seeing this issue. After a
short time, we get all of the already started "STREAM CUBE" jobs stuck in
whichever state they were (no progress in the last 8 hours), while all new jobs
are stuck at pending state, not making any progress. Yet according to yarn, no
jobs are running or have been submitted for the past 8 hours. In the logs we
are still seeing:
{code:java}
2020-08-06 06:16:04 INFO [Scheduler 116797841 Job
416355c2-a3d7-57eb-55c6-c042aa256510-250] MapReduceExecutable:409 -
416355c2-a3d7-57eb-55c6-c042aa256510-00, parent lock
path(/cube_job_lock/cube_vm) is locked by other job result is true ,ephemeral
lock path :/cube_job_ephemeral_lock/cube_vm is locked by other job result is
true,will try after one minute
2020-08-06 06:16:22 INFO [Scheduler 116797841 Job
4620192a-71e1-16dd-3b05-44d7f9144ad4-342] MapReduceExecutable:409 -
4620192a-71e1-16dd-3b05-44d7f9144ad4-00, parent lock
path(/cube_job_lock/cube_cm) is locked by other job result is true ,ephemeral
lock path :/cube_job_ephemeral_lock/cube_cm is locked by other job result is
true,will try after one minute
2020-08-06 06:16:22 INFO [Scheduler 116797841 Job
12750aea-3b96-c817-64e8-bf893d8c120f-254] MapReduceExecutable:409 -
12750aea-3b96-c817-64e8-bf893d8c120f-00, parent lock
path(/cube_job_lock/cube_vm) is locked by other job result is true ,ephemeral
lock path :/cube_job_ephemeral_lock/cube_vm is locked by other job result is
true,will try after one minute
2020-08-06 06:16:33 WARN [FetcherRunner 787667774-43] FetcherRunner:56 - There
are too many jobs running, Job Fetch will wait until next schedule time{code}
> Fix distributed concurrency lock bug
> ------------------------------------
>
> Key: KYLIN-4348
> URL: https://issues.apache.org/jira/browse/KYLIN-4348
> Project: Kylin
> Issue Type: Sub-task
> Reporter: wangxiaojing
> Assignee: wangxiaojing
> Priority: Major
> Fix For: v3.1.0
>
> Attachments: image-2020-02-03-10-54-21-976.png,
> image-2020-02-03-10-54-53-468.png
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)