Hive can use CombineFileInputFormat for when the input are many small files
---------------------------------------------------------------------------

                 Key: HIVE-74
                 URL: https://issues.apache.org/jira/browse/HIVE-74
             Project: Hadoop Hive
          Issue Type: Improvement
            Reporter: dhruba borthakur
            Assignee: dhruba borthakur
             Fix For: 0.20.0


There are cases when the input to a Hive job are thousands of small files. In 
this case, there is a mapper for each file. Most of the overhead for spawning 
all these mappers can be avoided if Hive used CombineFileInputFormat introduced 
via HADOOP-4565

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to