Tim Allison created TIKA-3328:
---------------------------------
Summary: PDFs detected as matlab
Key: TIKA-3328
URL: https://issues.apache.org/jira/browse/TIKA-3328
Project: Tika
Issue Type: Task
Reporter: Tim Allison
Attachments: GHOSTSCRIPT-690494-1.pdf, pdf.js-LINK-1691-0.pdfIn two rare cases in one corpus, I noticed that two PDFs, both starting with '%%' are identified as matlab because our matlab patterns look for '%%'. -- This message was sent by Atlassian Jira (v8.3.4#803005)
