Vsevolod Ostapenko created KYLIN-4107:
-----------------------------------------

             Summary: StorageCleanupJob fails to delete Hive tables with 
"Argument list too long" error
                 Key: KYLIN-4107
                 URL: https://issues.apache.org/jira/browse/KYLIN-4107
             Project: Kylin
          Issue Type: Bug
          Components: Storage - HBase
    Affects Versions: v2.6.2
         Environment: CentOS 7.6, HDP 2.6.5, Kylin 2.6.3
            Reporter: Vsevolod Ostapenko


On a system with multiple Kylin developers that experiment with cube design and 
(re)build/drop cube segments often intermediate Hive tables and HBase left over 
tables accumulate very quickly.

After a certain point storage cleanup cannot be executed using suggested method:
{{${KYLIN_HOME}/bin/kylin.sh org.apache.kylin.tool.StorageCleanupJob --delete 
true}}

Apparently, storage cleanup job creates a single shell command to drop all Hive 
tables, which fails to execute because command line is just too long. For 
example:
{quote}
2019-07-23 17:47:31,611 ERROR [main] job.StorageCleanupJob:377 : Error during 
deleting Hive tables
java.io.IOException: Cannot run program "/bin/bash": error=7, Argument list too 
long
 at java.lang.ProcessBuilder.start(ProcessBuilder.java:1048)
 at 
org.apache.kylin.common.util.CliCommandExecutor.runNativeCommand(CliCommandExecutor.java:133)
 at 
org.apache.kylin.common.util.CliCommandExecutor.execute(CliCommandExecutor.java:89)
 at 
org.apache.kylin.common.util.CliCommandExecutor.execute(CliCommandExecutor.java:83)
 at 
org.apache.kylin.rest.job.StorageCleanupJob.deleteHiveTables(StorageCleanupJob.java:409)
 at 
org.apache.kylin.rest.job.StorageCleanupJob.cleanUnusedIntermediateHiveTableInternal(StorageCleanupJob.java:375)
 at 
org.apache.kylin.rest.job.StorageCleanupJob.cleanUnusedIntermediateHiveTable(StorageCleanupJob.java:278)
 at 
org.apache.kylin.rest.job.StorageCleanupJob.cleanup(StorageCleanupJob.java:151)
 at 
org.apache.kylin.rest.job.StorageCleanupJob.execute(StorageCleanupJob.java:145)
 at 
org.apache.kylin.common.util.AbstractApplication.execute(AbstractApplication.java:37)
 at org.apache.kylin.tool.StorageCleanupJob.main(StorageCleanupJob.java:27)
Caused by: java.io.IOException: error=7, Argument list too long
 at java.lang.UNIXProcess.forkAndExec(Native Method)
 at java.lang.UNIXProcess.<init>(UNIXProcess.java:247)
 at java.lang.ProcessImpl.start(ProcessImpl.java:134)
 at java.lang.ProcessBuilder.start(ProcessBuilder.java:1029)
 ... 10 moreĀ 
{quote}
Instead of composing one long command, storage cleanup need to generate a 
script and feed that into beeline or hive CLI.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Reply via email to