Mark Ward created NIFI-11084:
--------------------------------

             Summary: Character/text data "mis-identified" by IdentifyMimeType 
processor
                 Key: NIFI-11084
                 URL: https://issues.apache.org/jira/browse/NIFI-11084
             Project: Apache NiFi
          Issue Type: Bug
          Components: Core Framework
    Affects Versions: 1.19.1, 1.15.2
         Environment: Windows Server 2019, Java 11.0.17
            Reporter: Mark Ward
         Attachments: mime_type_mis-id_file.csv

When *IdentifyMimeType* is presented with a text file with a `.csv` extension 
and the first two characters of the content as `P2`, the processor 
mis-identifies the mime.extension as `pgm` and mime.type as 
`image/x-portable-graymap`.

The processor's *Use Filename In Detection* property is set to `true`.

An example file is attached and the following flow and be used to reproduce: 
*GetFile* > *IdentifyMimeType* where the outputted flowfile's attributes can be 
inspected.

This has been tested on NiFi versions `1.15.2` and `1.19.1` both running on a 
Window's Server 2019 instance.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to