Github user alopresto commented on the issue:
https://github.com/apache/nifi/pull/2983
Based on @thenatog 's comment on [PR
2980](https://github.com/apache/nifi/pull/2980#issuecomment-419574743) which
captured a Java Big Endian Byte Order Mark (BOM) being inserted in UTF-16
strings pre-hash, I added some logic to prevent that here. I closed PR 2980,
and this PR can be considered as a standalone solution to
[NIFI-5147](https://issues.apache.org/jira/browse/NIFI-5147) and
[NIFI-5566](https://issues.apache.org/jira/browse/NIFI-5566). I also opened
[NIFI-5582](https://issues.apache.org/jira/browse/NIFI-5582) which depends on
this PR.
---