Tim Allison created TIKA-3971:
---------------------------------
Summary: Distinguish eps-based Adobe Illustrator files from
pdf-based Illustrator files
Key: TIKA-3971
URL: https://issues.apache.org/jira/browse/TIKA-3971
Project: Tika
Issue Type: Task
Reporter: Tim Allison
On TIKA-2689, we plan to add detection for Illustrator files that are based
on/wrapped in PDF files at parse time. Illustrator files used to be eps or
just ps. We should figure out how we want to distinguish between these two or
three formats.
TIKA-2689 has some great resource links to help with this.
Pronom has a bunch of ids for "Illustrator":
https://www.nationalarchives.gov.uk/PRONOM/Format/proFormatSearch.aspx?status=detailReport&id=1350
See also: https://bugs.ghostscript.com/show_bug.cgi?id=689926
--
This message was sent by Atlassian Jira
(v8.20.10#820010)