[
https://issues.apache.org/jira/browse/PARQUET-86?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Le Dem updated PARQUET-86:
---------------------------------
Description:
The issue is not really parquet-hive which we can modify in sync but rather
Hive itself. As we want to be able to change the split implementation without
breaking Hive. (Users might want to use the latest Parquet with their version
of Hive)
[ParquetRecordReaderWrapper in
parquet-hive|https://github.com/apache/incubator-parquet-mr/blob/647b8a70f9b7c94cabf9a7ec7bce2e7cbbb4c05b/parquet-hive/parquet-hive-storage-handler/src/main/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L223]
and in
[Hive|https://github.com/apache/hive/blob/e58b9d273cb78bda2947148bc54f4befb2514241/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L221]
was:
The issue is not really parquet-hive which we can modify in sync but rather
Hive itself. As we want to be able to change the split implementation without
breaking Hive. (Users might want to use the latest Parquet with their version
of Hive)
[ParquetRecordReaderWrapper in
parquet-hive|https://github.com/apache/incubator-parquet-mr/blob/647b8a70f9b7c94cabf9a7ec7bce2e7cbbb4c05b/parquet-hive/parquet-hive-storage-handler/src/main/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L223]
and in
[Hive|https://github.com/apache/hive/blob/e58b9d273cb78bda2947148bc54f4befb2514241/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L221]
It should use
https://github.com/apache/incubator-parquet-mr/blob/master/parquet-hadoop/src/main/java/parquet/hadoop/InternalParquetRecordReader.java
directly instead.
> parquet-hive (and therefore Hive) depends on ParquetInputSplit constructor
> --------------------------------------------------------------------------
>
> Key: PARQUET-86
> URL: https://issues.apache.org/jira/browse/PARQUET-86
> Project: Parquet
> Issue Type: Bug
> Components: parquet-mr
> Reporter: Julien Le Dem
>
> The issue is not really parquet-hive which we can modify in sync but rather
> Hive itself. As we want to be able to change the split implementation without
> breaking Hive. (Users might want to use the latest Parquet with their version
> of Hive)
> [ParquetRecordReaderWrapper in
> parquet-hive|https://github.com/apache/incubator-parquet-mr/blob/647b8a70f9b7c94cabf9a7ec7bce2e7cbbb4c05b/parquet-hive/parquet-hive-storage-handler/src/main/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L223]
> and in
> [Hive|https://github.com/apache/hive/blob/e58b9d273cb78bda2947148bc54f4befb2514241/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L221]
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
