[
https://issues.apache.org/jira/browse/PARQUET-86?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14122210#comment-14122210
] Julien Le Dem commented on PARQUET-86: -------------------------------------- Here is a fix: https://github.com/julienledem/incubator-parquet-mr/commit/ccdd08c7030d9d4579b6beef2b93981e327637c5 > parquet-hive (and therefore Hive) depends on ParquetInputSplit constructor > -------------------------------------------------------------------------- > > Key: PARQUET-86 > URL: https://issues.apache.org/jira/browse/PARQUET-86 > Project: Parquet > Issue Type: Bug > Components: parquet-mr > Reporter: Julien Le Dem > > The issue is not really parquet-hive which we can modify in sync but rather > Hive itself. As we want to be able to change the split implementation without > breaking Hive. (Users might want to use the latest Parquet with their version > of Hive) > [ParquetRecordReaderWrapper in > parquet-hive|https://github.com/apache/incubator-parquet-mr/blob/647b8a70f9b7c94cabf9a7ec7bce2e7cbbb4c05b/parquet-hive/parquet-hive-storage-handler/src/main/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L223] > and in > [Hive|https://github.com/apache/hive/blob/e58b9d273cb78bda2947148bc54f4befb2514241/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L221] > It should use > https://github.com/apache/incubator-parquet-mr/blob/master/parquet-hadoop/src/main/java/parquet/hadoop/InternalParquetRecordReader.java > directly instead. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
