Joseph Percivall created NIFI-1217:
--------------------------------------

             Summary: New processor to determine flowfile text content's 
encoding
                 Key: NIFI-1217
                 URL: https://issues.apache.org/jira/browse/NIFI-1217
             Project: Apache NiFi
          Issue Type: Improvement
            Reporter: Joseph Percivall


A file can enter the through many different means. Most of which make it almost 
impossible to find the text encoding of the file without relying on OS specific 
commands.

There is a need for a processor that can analyze the contents of a flowfile to 
determine the text encoding. As a start this library may be of help [1]. It 
uses a Mozilla 1.1 license which needs special treatment [2].

Here is the email thread discussing this [3].

[1] http://jchardet.sourceforge.net/
[2] http://www.apache.org/legal/resolved.html#category-b
[3] 
http://mail-archives.apache.org/mod_mbox/nifi-users/201511.mbox/%3C1093351346.8086538.1448382574704.JavaMail.yahoo%40mail.yahoo.com%3E



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to