Matt Burgess created NIFI-5455:
----------------------------------
Summary: Add a ParquetRecordReader controller service
Key: NIFI-5455
URL: https://issues.apache.org/jira/browse/NIFI-5455
Project: Apache NiFi
Issue Type: New Feature
Components: Extensions
Reporter: Matt Burgess
Currently we have Fetch and PutParquet, the former of which allows a Parquet
file to be ingested into NiFi, and the latter of which allows conversion of a
record set into Parquet to be placed on a filesystem such as HDFS. However
there is no way to convert from Parquet into another format on which additional
operations (transformation, conversion to another format, e.g.) may be
performed.
This Jira proposes to add a ParquetRecordReader controller service that can be
used by the record-aware processors for reading in records in Parquet format.
I'm not including a ParquetRecordSetWriter in this Jira since there is a
PutParquet and any additional operations on the records are probably (at least
at present) better handled via a "faster" format w.r.t. NiFi, rather than
serializing back to Parquet at each step.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)