Bing Jiang created PARQUET-1051:
-----------------------------------
Summary: Parquet Combine Input format for MapReduce job
Key: PARQUET-1051
URL: https://issues.apache.org/jira/browse/PARQUET-1051
Project: Parquet
Issue Type: New Feature
Components: parquet-mr
Reporter: Bing Jiang
ParquetInputFormat can only process one parquet file, if there are small files,
it will spawn many small map task processing the small file. It is not
efficient.
We need to provide CombineInputFormat to combine small files together in one
map task.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)