[ 
https://issues.apache.org/jira/browse/PARQUET-86?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14122210#comment-14122210
 ] 

Julien Le Dem commented on PARQUET-86:
--------------------------------------

Here is a fix:
https://github.com/julienledem/incubator-parquet-mr/commit/ccdd08c7030d9d4579b6beef2b93981e327637c5

> parquet-hive (and therefore Hive) depends on ParquetInputSplit constructor
> --------------------------------------------------------------------------
>
>                 Key: PARQUET-86
>                 URL: https://issues.apache.org/jira/browse/PARQUET-86
>             Project: Parquet
>          Issue Type: Bug
>          Components: parquet-mr
>            Reporter: Julien Le Dem
>
> The issue is not really parquet-hive which we can modify in sync but rather 
> Hive itself. As we want to be able to change the split implementation without 
> breaking Hive. (Users might want to use the latest Parquet with their version 
> of Hive)
> [ParquetRecordReaderWrapper in 
> parquet-hive|https://github.com/apache/incubator-parquet-mr/blob/647b8a70f9b7c94cabf9a7ec7bce2e7cbbb4c05b/parquet-hive/parquet-hive-storage-handler/src/main/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L223]
> and in 
> [Hive|https://github.com/apache/hive/blob/e58b9d273cb78bda2947148bc54f4befb2514241/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L221]
> It should use 
> https://github.com/apache/incubator-parquet-mr/blob/master/parquet-hadoop/src/main/java/parquet/hadoop/InternalParquetRecordReader.java
>  directly instead.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to