[ https://issues.apache.org/jira/browse/HIVE-1403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Siying Dong updated HIVE-1403: ------------------------------ Attachment: HIVE-1403.1.patch The previous patch doesn't seem to be complete. This patch will cover the timeout case under multiple directories. > Reporting progress to JT during closing files in FileSinkOperator > ----------------------------------------------------------------- > > Key: HIVE-1403 > URL: https://issues.apache.org/jira/browse/HIVE-1403 > Project: Hadoop Hive > Issue Type: Bug > Reporter: Ning Zhang > Assignee: Ning Zhang > Fix For: 0.6.0 > > Attachments: HIVE-1403.1.patch, HIVE-1403.patch > > > If there are too many files need to be closed in FileSinkOperator (e.g., if > DynamicPartition/FileSpray is turned on), there could be many files generated > by each task and they need to be closed at the FileSinkOperator.closeOp(). If > the NN is overloaded each file close could take more than 1 sec. This > sometimes make JT think the task is dead since it takes too long to close all > the files and without any progress report. We need to report progress after a > while during file closing. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.