Bing Jiang created PARQUET-1051:
-----------------------------------

             Summary: Parquet Combine Input format for MapReduce job
                 Key: PARQUET-1051
                 URL: https://issues.apache.org/jira/browse/PARQUET-1051
             Project: Parquet
          Issue Type: New Feature
          Components: parquet-mr
            Reporter: Bing Jiang


ParquetInputFormat can only process one parquet file, if there are small files, 
it will spawn many small map task processing the small file. It is not 
efficient.
We need to provide CombineInputFormat to combine small files together in one 
map task.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to