[jira] [Updated] (PARQUET-2151) Drop Hadoop 2 input stream reflection from parquet-hadoop

2022-06-07 Thread Steve Loughran (Jira)


 [ 
https://issues.apache.org/jira/browse/PARQUET-2151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Steve Loughran updated PARQUET-2151:

Description: 
Parquet uses reflection to load a hadoop2 input stream, falling back to a 
hadoop-1 compatible client if not found.

All hadoop 2.0.2+ releases work with H2SeekableInputStream, so the binding to 
H2SeekableInputStream reworked to avoid needing reflection. This would make it 
a lot easier to probe for/use the bytebuffer input, and line the code up for 
more recent hadoop releases.

H1SeekableInputStream is still needed to handle streams without 
ByteBufferReadable.

At some poiint support for ByteBufferPositionedReadable is needed, because that 
is really what parquet wants. that's where reflection will be needed




  was:
Parquet uses reflection to load a hadoop2 input stream, falling back to a 
hadoop-1 compatible client if not found.

All hadoop 2.0.2+ releases work with H2SeekableInputStream, so 
H1SeekableInputStream can be cut and the binding to H2SeekableInputStream 
reworked to avoid needing reflection. This would make it a lot easier to probe 
for/use the bytebuffer input, and line the code up for more recent hadoop 
releases.





> Drop Hadoop 2 input stream reflection from parquet-hadoop 
> --
>
> Key: PARQUET-2151
> URL: https://issues.apache.org/jira/browse/PARQUET-2151
> Project: Parquet
>  Issue Type: Improvement
>  Components: parquet-mr
>Affects Versions: 1.13.0
>Reporter: Steve Loughran
>Priority: Minor
>
> Parquet uses reflection to load a hadoop2 input stream, falling back to a 
> hadoop-1 compatible client if not found.
> All hadoop 2.0.2+ releases work with H2SeekableInputStream, so the binding to 
> H2SeekableInputStream reworked to avoid needing reflection. This would make 
> it a lot easier to probe for/use the bytebuffer input, and line the code up 
> for more recent hadoop releases.
> H1SeekableInputStream is still needed to handle streams without 
> ByteBufferReadable.
> At some poiint support for ByteBufferPositionedReadable is needed, because 
> that is really what parquet wants. that's where reflection will be needed



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Updated] (PARQUET-2151) Drop Hadoop 2 input stream reflection from parquet-hadoop

2022-06-07 Thread Steve Loughran (Jira)


 [ 
https://issues.apache.org/jira/browse/PARQUET-2151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Steve Loughran updated PARQUET-2151:

Summary: Drop Hadoop 2 input stream reflection from parquet-hadoop   (was: 
Drop Hadoop 1 input stream support from parquet-hadoop )

> Drop Hadoop 2 input stream reflection from parquet-hadoop 
> --
>
> Key: PARQUET-2151
> URL: https://issues.apache.org/jira/browse/PARQUET-2151
> Project: Parquet
>  Issue Type: Improvement
>  Components: parquet-mr
>Affects Versions: 1.13.0
>Reporter: Steve Loughran
>Priority: Minor
>
> Parquet uses reflection to load a hadoop2 input stream, falling back to a 
> hadoop-1 compatible client if not found.
> All hadoop 2.0.2+ releases work with H2SeekableInputStream, so 
> H1SeekableInputStream can be cut and the binding to H2SeekableInputStream 
> reworked to avoid needing reflection. This would make it a lot easier to 
> probe for/use the bytebuffer input, and line the code up for more recent 
> hadoop releases.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)