[ 
https://issues.apache.org/jira/browse/NIFI-3612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Chaffelson updated NIFI-3612:
------------------------------------
    Description: 
This bundle could potentially be extended to include a Parquet transform by 
leveraging the Apache 2.0 licensed parquet-mr/avro libraries:
https://github.com/apache/parquet-mr/tree/master/parquet-avro

This would provide coverage of this popular format to complement the ORC 
support in the Hive Bundle and the other schema-dependent formats already in 
this bundle.
Existing NiFi Parquet support in the kite bundle can only write to a 
non-kerberised Kite Dataset, which prevents usage on secured environments or 
writing to a FlowFile.

As the main competitor to ORC, providing more generic Parquet Transform support 
will greatly widen the pool of potential NiFi adopters, particularly in the 
Spark community.

  was:
This bundle could potentially be extended to include a Parquet transform by 
leveraging the Apache 2.0 licenses parquet-mr/avro libraries:
https://github.com/apache/parquet-mr/tree/master/parquet-avro

This would provide coverage of this popular format to complement the ORC 
support in the Hive Bundle and the other schema-dependent formats already in 
this bundle.


> Add support for Parquet to Nifi-Registry-Bundle
> -----------------------------------------------
>
>                 Key: NIFI-3612
>                 URL: https://issues.apache.org/jira/browse/NIFI-3612
>             Project: Apache NiFi
>          Issue Type: Improvement
>            Reporter: Daniel Chaffelson
>
> This bundle could potentially be extended to include a Parquet transform by 
> leveraging the Apache 2.0 licensed parquet-mr/avro libraries:
> https://github.com/apache/parquet-mr/tree/master/parquet-avro
> This would provide coverage of this popular format to complement the ORC 
> support in the Hive Bundle and the other schema-dependent formats already in 
> this bundle.
> Existing NiFi Parquet support in the kite bundle can only write to a 
> non-kerberised Kite Dataset, which prevents usage on secured environments or 
> writing to a FlowFile.
> As the main competitor to ORC, providing more generic Parquet Transform 
> support will greatly widen the pool of potential NiFi adopters, particularly 
> in the Spark community.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to