[
https://issues.apache.org/jira/browse/FLINK-5944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ilya Ganelin updated FLINK-5944:
--------------------------------
Description:
Snappy is an extremely performant compression format that's widely used
offering fast decompression/compression.
This can be easily implemented by creating a SnappyInflaterInputStreamFactory
and updating the initDefaultInflateInputStreamFactories in FileInputFormat.
Flink already includes the Snappy dependency in the project.
There is a minor gotcha in this. If we wish to use this with Hadoop, then we
must provide two separate implementations since Hadoop uses a different version
of the snappy format than Snappy Java (which is the xerial/snappy included in
Flink).
was:
Snappy is an extremely performant compression format that's widely used
offering fast decompression/compression.
This can be easily implemented by creating a SnappyInflaterInputStreamFactory
and updating the initDefaultInflateInputStreamFactories in FileInputFormat.
Flink already includes the Snappy dependency in the project.
> Flink should support reading Snappy Files
> -----------------------------------------
>
> Key: FLINK-5944
> URL: https://issues.apache.org/jira/browse/FLINK-5944
> Project: Flink
> Issue Type: New Feature
> Reporter: Ilya Ganelin
> Labels: features
>
> Snappy is an extremely performant compression format that's widely used
> offering fast decompression/compression.
> This can be easily implemented by creating a SnappyInflaterInputStreamFactory
> and updating the initDefaultInflateInputStreamFactories in FileInputFormat.
> Flink already includes the Snappy dependency in the project.
> There is a minor gotcha in this. If we wish to use this with Hadoop, then we
> must provide two separate implementations since Hadoop uses a different
> version of the snappy format than Snappy Java (which is the xerial/snappy
> included in Flink).
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)