[ https://issues.apache.org/jira/browse/KYLIN-4657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17163311#comment-17163311 ]
QiangZhang commented on KYLIN-4657: ----------------------------------- h2. Cause Analysis # job run at org.apache.kylin.engine.mr.common.MapReduceExecutable#doWork # HadoopJobStatusChecker.checkStatus at MapReduceExecutable.java:153,for some reason ,checkStatus error and error was catched in checkStatus method # HadoopCmdOutput.getInfo at MapReduceExecutable.java:163 ,for some reason ,getInfo error so a execption was thrown . And doWork method return. # If user has enable Job Retry (_+kylin.job.retry+_), so job will retry again. # job = new Cluster(conf).getJob(JobID.forName(extra.get(ExecutableConstants.MR_JOB_ID))) at MapReduceExecutable.java:120,if last run job status is SUCCEEDED,will redirecting to job history server to getJob,for some reason,getJob failed and job is null. # when job is null,the while loop will became dead-loop > dead-loop in org.apache.kylin.engine.mr.common.MapReduceExecutable.doWork > ------------------------------------------------------------------------- > > Key: KYLIN-4657 > URL: https://issues.apache.org/jira/browse/KYLIN-4657 > Project: Kylin > Issue Type: Bug > Components: Job Engine > Affects Versions: all > Reporter: QiangZhang > Priority: Major > Attachments: image-2020-07-23-14-54-02-079.png, > image-2020-07-23-14-58-49-140.png, image-2020-07-23-15-08-45-493.png, > image-2020-07-23-15-18-48-058.png, image-2020-07-23-15-20-05-764.png > > > ||issue|| > |Kylin job in step3("Extract Fact Table Distinct Columns") is always running| > > ---- > 1.when this job is run for the first time: > !image-2020-07-23-14-54-02-079.png! > 2.If user has enable Job Retry (_+kylin.job.retry+_), so job will retry again > !image-2020-07-23-14-58-49-140.png! -- This message was sent by Atlassian Jira (v8.3.4#803005)