Matt Burgess created NIFI-5455:
----------------------------------

             Summary: Add a ParquetRecordReader controller service
                 Key: NIFI-5455
                 URL: https://issues.apache.org/jira/browse/NIFI-5455
             Project: Apache NiFi
          Issue Type: New Feature
          Components: Extensions
            Reporter: Matt Burgess


Currently we have Fetch and PutParquet, the former of which allows a Parquet 
file to be ingested into NiFi, and the latter of which allows conversion of a 
record set into Parquet to be placed on a filesystem such as HDFS. However 
there is no way to convert from Parquet into another format on which additional 
operations (transformation, conversion to another format, e.g.) may be 
performed.

This Jira proposes to add a ParquetRecordReader controller service that can be 
used by the record-aware processors for reading in records in Parquet format. 
I'm not including a ParquetRecordSetWriter in this Jira since there is a 
PutParquet and any additional operations on the records are probably (at least 
at present) better handled via a "faster" format w.r.t. NiFi, rather than 
serializing back to Parquet at each step.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to