[ 
https://issues.apache.org/jira/browse/HIVE-24774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17284958#comment-17284958
 ] 

Rajesh Balamohan commented on HIVE-24774:
-----------------------------------------

Thanks [~pvargacl], I will go through both PRs you mentioned.

> Reduce FS listing during dynamic partition loading
> --------------------------------------------------
>
>                 Key: HIVE-24774
>                 URL: https://issues.apache.org/jira/browse/HIVE-24774
>             Project: Hive
>          Issue Type: Improvement
>          Components: HiveServer2
>            Reporter: Rajesh Balamohan
>            Priority: Major
>
> When loading large number of partitions in cloud storage, notification log 
> takes lot longer time to list newly added files.
> It would be good to explore if FileStatus can be reused from 
> Hive::listFilesCreatedByQuery or from copyFiles
> {noformat}
>       at 
> org.apache.hadoop.fs.s3a.S3AFileSystem.innerGetFileStatus(S3AFileSystem.java:3031)
>       at 
> org.apache.hadoop.fs.s3a.S3AFileSystem.isDirectory(S3AFileSystem.java:4171)
>       at 
> org.apache.hadoop.hive.ql.metadata.Hive.addInsertFileInformation(Hive.java:3566)
>       at 
> org.apache.hadoop.hive.ql.metadata.Hive.addWriteNotificationLog(Hive.java:3519)
>       at 
> org.apache.hadoop.hive.ql.metadata.Hive.addWriteNotificationLog(Hive.java:3504)
>       at 
> org.apache.hadoop.hive.ql.metadata.Hive.loadDynamicPartitions(Hive.java:2984)
>       at 
> org.apache.hadoop.hive.ql.exec.MoveTask.handleDynParts(MoveTask.java:562)
>       at org.apache.hadoop.hive.ql.exec.MoveTask.execute(MoveTask.java:440)
>       at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:213)
>       at 
> org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:105)
>       at org.apache.hadoop.hive.ql.Executor.launchTask(Executor.java:357)
>       at org.apache.hadoop.hive.ql.Executor.launchTasks(Executor.java:330)
>       at org.apache.hadoop.hive.ql.Executor.runTasks(Executor.java:246)
>       at org.apache.hadoop.hive.ql.Executor.execute(Executor.java:109)
>       at org.apache.hadoop.hive.ql.Driver.runInternal(Driver.java:730)
>       at org.apache.hadoop.hive.ql.Driver.run(Driver.java:490)
>       at org.apache.hadoop.hive.ql.Driver.run(Driver.java:484)
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to