[ 
https://issues.apache.org/jira/browse/TIKA-3971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison updated TIKA-3971:
------------------------------
    Description: 
On TIKA-2689, we plan to add detection for Illustrator files that are based 
on/wrapped in PDF files at parse time.  Illustrator files used to be eps or 
just ps.  We should figure out how we want to distinguish between these two or 
three formats.

TIKA-2689 has some great resource links to help with this.

Pronom has a bunch of ids for "Illustrator", summarized: 
http://justsolve.archiveteam.org/wiki/Adobe_Illustrator_Artwork

One example: 
https://www.nationalarchives.gov.uk/PRONOM/Format/proFormatSearch.aspx?status=detailReport&id=1350

See also: https://bugs.ghostscript.com/show_bug.cgi?id=689926

  was:
On TIKA-2689, we plan to add detection for Illustrator files that are based 
on/wrapped in PDF files at parse time.  Illustrator files used to be eps or 
just ps.  We should figure out how we want to distinguish between these two or 
three formats.

TIKA-2689 has some great resource links to help with this.

Pronom has a bunch of ids for "Illustrator":

https://www.nationalarchives.gov.uk/PRONOM/Format/proFormatSearch.aspx?status=detailReport&id=1350

See also: https://bugs.ghostscript.com/show_bug.cgi?id=689926


> Distinguish eps-based Adobe Illustrator files from pdf-based Illustrator files
> ------------------------------------------------------------------------------
>
>                 Key: TIKA-3971
>                 URL: https://issues.apache.org/jira/browse/TIKA-3971
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Minor
>
> On TIKA-2689, we plan to add detection for Illustrator files that are based 
> on/wrapped in PDF files at parse time.  Illustrator files used to be eps or 
> just ps.  We should figure out how we want to distinguish between these two 
> or three formats.
> TIKA-2689 has some great resource links to help with this.
> Pronom has a bunch of ids for "Illustrator", summarized: 
> http://justsolve.archiveteam.org/wiki/Adobe_Illustrator_Artwork
> One example: 
> https://www.nationalarchives.gov.uk/PRONOM/Format/proFormatSearch.aspx?status=detailReport&id=1350
> See also: https://bugs.ghostscript.com/show_bug.cgi?id=689926



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to