Tika reports the content type of AR archives as "text/plain"
------------------------------------------------------------

                 Key: TIKA-697
                 URL: https://issues.apache.org/jira/browse/TIKA-697
             Project: Tika
          Issue Type: Bug
         Environment: Linux (CentOS 5.6)
            Reporter: PNS
            Priority: Trivial


The Tika.detect(InputStream) method returns "text/plain" for AR archives 
created with the Linux "Create Archive" option of Nautilus (available via 
right-clicking on a file).

The Apache Commons Compress "autodetection" code of the ArchiveStreamFactory 
looks at the first 12 bytes of the stream and correctly identifies the type as 
AR.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to