[ 
https://issues.apache.org/jira/browse/KYLIN-4348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17172060#comment-17172060
 ] 

Gabor Arki commented on KYLIN-4348:
-----------------------------------

We just upgraded to 3.1.0 release but we are still seeing this issue. After a 
short time, we get all of the already started "STREAM CUBE" jobs stuck in 
whichever state they were (no progress in the last 8 hours), while all new jobs 
are stuck at pending state, not making any progress. Yet according to yarn, no 
jobs are running or have been submitted for the past 8 hours. In the logs we 
are still seeing:
{code:java}
2020-08-06 06:16:04 INFO  [Scheduler 116797841 Job 
416355c2-a3d7-57eb-55c6-c042aa256510-250] MapReduceExecutable:409 - 
416355c2-a3d7-57eb-55c6-c042aa256510-00, parent lock 
path(/cube_job_lock/cube_vm) is locked by other job result is true ,ephemeral 
lock path :/cube_job_ephemeral_lock/cube_vm is locked by other job result is 
true,will try after one minute
2020-08-06 06:16:22 INFO  [Scheduler 116797841 Job 
4620192a-71e1-16dd-3b05-44d7f9144ad4-342] MapReduceExecutable:409 - 
4620192a-71e1-16dd-3b05-44d7f9144ad4-00, parent lock 
path(/cube_job_lock/cube_cm) is locked by other job result is true ,ephemeral 
lock path :/cube_job_ephemeral_lock/cube_cm is locked by other job result is 
true,will try after one minute
2020-08-06 06:16:22 INFO  [Scheduler 116797841 Job 
12750aea-3b96-c817-64e8-bf893d8c120f-254] MapReduceExecutable:409 - 
12750aea-3b96-c817-64e8-bf893d8c120f-00, parent lock 
path(/cube_job_lock/cube_vm) is locked by other job result is true ,ephemeral 
lock path :/cube_job_ephemeral_lock/cube_vm is locked by other job result is 
true,will try after one minute
2020-08-06 06:16:33 WARN  [FetcherRunner 787667774-43] FetcherRunner:56 - There 
are too many jobs running, Job Fetch will wait until next schedule time{code}
 

> Fix distributed concurrency lock bug
> ------------------------------------
>
>                 Key: KYLIN-4348
>                 URL: https://issues.apache.org/jira/browse/KYLIN-4348
>             Project: Kylin
>          Issue Type: Sub-task
>            Reporter: wangxiaojing
>            Assignee: wangxiaojing
>            Priority: Major
>             Fix For: v3.1.0
>
>         Attachments: image-2020-02-03-10-54-21-976.png, 
> image-2020-02-03-10-54-53-468.png
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to