[
https://issues.apache.org/jira/browse/NIFI-4565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16271161#comment-16271161
]
ASF GitHub Bot commented on NIFI-4565:
--------------------------------------
Github user jfrazee commented on the issue:
https://github.com/apache/nifi/pull/2293
@baank Definitely thanks for the contrib, but unfortunately I don't think
this is the right solution/it supports S3 without any code changes.
PutParquet provides an "Additional Classpath Resources" property that you
can point at a directory and provide all the S3 dependencies. Here's what I
used:
```
aws-java-sdk-1.7.4.jar
hadoop-aws-2.7.3.jar
hadoop-common-2.7.3.jar
httpclient-4.5.3.jar
httpcore-4.4.4.jar
jackson-annotations-2.6.0.jar
jackson-core-2.6.1.jar
jackson-databind-2.6.1.jar
```
We take the same approach with PutHDFS for filesystems that aren't included
in the core Hadoop libs, so it seems to make sense to keep doing the same here.
> PutParquet doesn't support S3
> -----------------------------
>
> Key: NIFI-4565
> URL: https://issues.apache.org/jira/browse/NIFI-4565
> Project: Apache NiFi
> Issue Type: Bug
> Components: Core Framework
> Affects Versions: 1.4.0
> Reporter: Franco
> Assignee: Joey Frazee
>
> PutParquet doesn't support S3 targets due to the lack of the hadoop-aws
> dependency.
> To recreate it:
> 1) Modify core-site.xml to change fs.defaultName to one starting with s3://
> 2) Add a PutParquet
> 3) It will fail to run. The logs will show the missing hadoop-aws dependency.
> Simple fix is just to add hadoop-aws.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)