[
https://issues.apache.org/jira/browse/HIVE-13223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15233127#comment-15233127
]
Prasanth Jayachandran commented on HIVE-13223:
----------------------------------------------
ORC reader handles only non-orc files correctly by throwing
FileFormatException. The reader itself does not handle 0 length files. The way
it's handled currently is OrcInputFormat just ignores 0 length files from split
computation as it knows that it cannot be valid orc file. Also there are
filters to prune hidden and _* files which are also not valid orc files. So ORC
reader expects only valid ORC files. I think it should be handled at both
places (split generation and reader) as both can be used together or
independently. I can add a check to ORC reader to throw exception when 0 length
files are encountered.
> HoS may hang for queries that run on 0 splits
> -----------------------------------------------
>
> Key: HIVE-13223
> URL: https://issues.apache.org/jira/browse/HIVE-13223
> Project: Hive
> Issue Type: Bug
> Components: Spark
> Affects Versions: 2.1.0
> Reporter: Ashutosh Chauhan
> Assignee: Ashutosh Chauhan
> Attachments: HIVE-13223.1.patch, HIVE-13223.2.patch, HIVE-13223.patch
>
>
> Can be seen on all timed out tests after HIVE-13040 went in
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)