[
https://issues.apache.org/jira/browse/PARQUET-4?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Le Dem updated PARQUET-4:
--------------------------------
Assignee: Matt Martin
> Use LRU caching for footers in ParquetInputFormat.
> --------------------------------------------------
>
> Key: PARQUET-4
> URL: https://issues.apache.org/jira/browse/PARQUET-4
> Project: Parquet
> Issue Type: Bug
> Components: parquet-mr
> Reporter: Matt Martin
> Assignee: Matt Martin
> Fix For: 1.6.0
>
>
> The caching approach needs to change because of issues that occur when the
> same ParquetInputFormat instance is reused to generate splits for different
> input directories. For example, it causes problems in Hive's FetchOperator
> when the FetchOperator is attempting to operate over more than one partition
> (sidenote: as far as I could tell, Hive has been reusing inputformat
> instances in this way for quite some time). The details of how this issue
> manifests itself with respect to Hive are described in more detail here:
> https://groups.google.com/d/msg/parquet-dev/0aXql-3z7vE/Gn5m094V7PMJ
> The proposed patch can be found here:
> https://github.com/apache/incubator-parquet-mr/pull/2
--
This message was sent by Atlassian JIRA
(v6.2#6252)