[ 
https://issues.apache.org/jira/browse/PARQUET-86?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Julien Le Dem updated PARQUET-86:
---------------------------------
    Description: 
The issue is not really parquet-hive which we can modify in sync but rather 
Hive itself. As we want to be able to change the split implementation without 
breaking Hive. (Users might want to use the latest Parquet with their version 
of Hive)
[ParquetRecordReaderWrapper in 
parquet-hive|https://github.com/apache/incubator-parquet-mr/blob/647b8a70f9b7c94cabf9a7ec7bce2e7cbbb4c05b/parquet-hive/parquet-hive-storage-handler/src/main/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L223]
and in 
[Hive|https://github.com/apache/hive/blob/e58b9d273cb78bda2947148bc54f4befb2514241/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L221]


  was:
The issue is not really parquet-hive which we can modify in sync but rather 
Hive itself. As we want to be able to change the split implementation without 
breaking Hive. (Users might want to use the latest Parquet with their version 
of Hive)
[ParquetRecordReaderWrapper in 
parquet-hive|https://github.com/apache/incubator-parquet-mr/blob/647b8a70f9b7c94cabf9a7ec7bce2e7cbbb4c05b/parquet-hive/parquet-hive-storage-handler/src/main/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L223]
and in 
[Hive|https://github.com/apache/hive/blob/e58b9d273cb78bda2947148bc54f4befb2514241/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L221]

It should use 
https://github.com/apache/incubator-parquet-mr/blob/master/parquet-hadoop/src/main/java/parquet/hadoop/InternalParquetRecordReader.java
 directly instead.



> parquet-hive (and therefore Hive) depends on ParquetInputSplit constructor
> --------------------------------------------------------------------------
>
>                 Key: PARQUET-86
>                 URL: https://issues.apache.org/jira/browse/PARQUET-86
>             Project: Parquet
>          Issue Type: Bug
>          Components: parquet-mr
>            Reporter: Julien Le Dem
>
> The issue is not really parquet-hive which we can modify in sync but rather 
> Hive itself. As we want to be able to change the split implementation without 
> breaking Hive. (Users might want to use the latest Parquet with their version 
> of Hive)
> [ParquetRecordReaderWrapper in 
> parquet-hive|https://github.com/apache/incubator-parquet-mr/blob/647b8a70f9b7c94cabf9a7ec7bce2e7cbbb4c05b/parquet-hive/parquet-hive-storage-handler/src/main/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L223]
> and in 
> [Hive|https://github.com/apache/hive/blob/e58b9d273cb78bda2947148bc54f4befb2514241/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L221]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to