[ https://issues.apache.org/jira/browse/NIFI-3420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15847219#comment-15847219 ]
ASF GitHub Bot commented on NIFI-3420: -------------------------------------- Github user ilganeli commented on the issue: https://github.com/apache/nifi/pull/1457 I've added unit tests but am closing this issue for now. There is a substantial blocker to this approach since it leverages the classes from Hadoop which themselves depend on natively compiled and loaded C code. Unless NiFi explicitly adds the C-code for the Lz4 codec and manually builds and loads that library, we won't be able to use the Codec in Hadoop. I've also evaluated using the lz4-java library instead but this does not generate data in a Hadoop readable format. > NIFI Should support generating Hadoop-readable Lz4 outside of HDFS Write > ------------------------------------------------------------------------ > > Key: NIFI-3420 > URL: https://issues.apache.org/jira/browse/NIFI-3420 > Project: Apache NiFi > Issue Type: New Feature > Reporter: Ilya Ganelin > > Per https://issues.apache.org/jira/browse/HADOOP-12990 data stored in Lz4 > format on Hadoop is in a different format from the data generated by the Lz4 > CLI. The Lz4 CLI can also not be used to generate the Hadoop-compatible > format. > At the moment, NiFi does not support compression to Lz4 for streaming data. > Although PutHdfs in the Hadoop processors supports writing out Lz4 to HDFS > (assuming the appropriate codec exists), if data is instead being saved to > something like S3 or simply streamed, there's no way to generate Lz4 > compressed data. > If the Lz4 command line tool is used within a custom processor to perform Lz4 > conversion, this data will then not be readable on Hadoop if it's > subsequently loaded to HDFS. > A processor can be added that performs the conversion streaming data into the > Lz4 format that IS readable on Hadoop by using the Hadoop Lz4 Codec to do the > compression. -- This message was sent by Atlassian JIRA (v6.3.15#6346)