Tim Allison created TIKA-3328:
---------------------------------

             Summary: PDFs detected as matlab
                 Key: TIKA-3328
                 URL: https://issues.apache.org/jira/browse/TIKA-3328
             Project: Tika
          Issue Type: Task
            Reporter: Tim Allison
         Attachments: GHOSTSCRIPT-690494-1.pdf, pdf.js-LINK-1691-0.pdf

In two rare cases in one corpus, I noticed that two PDFs, both starting with 
'%%' are identified as matlab because our matlab patterns look for '%%'.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to