Vik created NIFI-2562:
-------------------------
Summary: PutHDFS writes corrupted data in the transparent disk
encryption zone
Key: NIFI-2562
URL: https://issues.apache.org/jira/browse/NIFI-2562
Project: Apache NiFi
Issue Type: Bug
Components: Core Framework
Affects Versions: 0.6.0
Reporter: Vik
Priority: Blocker
Problem 1: UnknownHostExcepion
When NiFi is trying to ingest files into HDFS encryption zone, it was throwing
UnknownHostException
Reason: In hadoop Configuration files, like core-site.xml and hdfs-site.xml,
kms hosts were mentioned in the following format "[email protected];
xxxxxxx2.int.xxxx.com:16000".
Since NiFi was using old hadoop libraries (2.6.2), It could not resolve two
hosts. So instead it considered two hosts as a single host and started throwing
UnknownHostExcepion.
We tried a couple different fixes for this.
Fix 1: Changing configuration files from having property like:
<property> <name>hadoop.security.key.provider.path</name>
<value>kms://[email protected];
xxxxxxxx.int.xxxx.com:16000/kms</value> </property>
to:
<property> <name>hadoop.security.key.provider.path</name>
<value>kms://[email protected]:16000/kms</value> </property>
Fix 2: Building NiFi nar files with hadoop version, as installed in our system.
(2.6.0-cdh5.7.0).
Steps followed:
a) Changed NiFi pom file hadoop version from 2.6.2 to 2.6.0-cdh5.7.0.
b) Run mvn clean package -DskipTests
c) Copy following nar files to /opt/nifi-dev<number>/lib
./nifi-nar-bundles/nifi-hadoop-bundle/nifi-hadoop-nar/target/nifi-hadoop-nar-1.0.0-SNAPSHOT.nar
./nifi-nar-bundles/nifi-hadoop-libraries-bundle/nifi-hadoop-libraries-nar/target/nifi-hadoop-libraries-nar-1.0.0-SNAPSHOT.nar
./nifi-nar-bundles/nifi-hbase-bundle/nifi-hbase-nar/target/nifi-hbase-nar-1.0.0-SNAPSHOT.nar
./nifi-nar-bundles/nifi-standard-services/nifi-http-context-map-bundle/nifi-http-context-map-nar/target/nifi-http-context-map-nar-1.0.0-SNAPSHOT.nar
d) Restart NiFi with bin/nifi.sh restart
This fixes resolved the Unknown Host Exception for us but we ran into Problem 2
mentioned below.
Problem 2: Ingesting Corrupted data into HDFS encryption zone
After resolving the UnknownHostException, NiFi was able to ingest files into
encryption zone but content of the file is corrupted.
Approaches:
Tried to simulate error with sample Java program which uses similar logic and
same library, but it was ingesting files into encryption zone without any
problem.
Checked NiFi log files to find the cause, found NiFi is making HTTP requests to
kms to decrypt keys but could not proceed further as there is no error.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)