[
https://issues.apache.org/jira/browse/TIKA-1283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13983167#comment-13983167
]
Jukka Zitting commented on TIKA-1283:
-------------------------------------
I'm not sure if it's a good idea to extract thumbnail images as regular
embedded resource. A thumbnail is not similarly a "part of" the document like
an embedded image or an attached file. Instead a thumbnail is used to
"describe" or "visualize" a document, and thus would IMHO be better expressed
as a part of document metadata as suggested in TIKA-90.
> Add "thumbnail" as possible metadata item to TikaCoreProperties
> ---------------------------------------------------------------
>
> Key: TIKA-1283
> URL: https://issues.apache.org/jira/browse/TIKA-1283
> Project: Tika
> Issue Type: Improvement
> Components: metadata
> Reporter: Tim Allison
> Priority: Minor
>
> TIKA-90 originally requested to add thumbnails to a document's metadata.
> I'd like to have a unified way of determining whether an embedded
> document/resource is a thumbnail or a regular attachment.
> With the changes in TIKA-1223 (ooxml) and TIKA-1010 (rtf), we are now pulling
> out more thumbnails than before.
> I propose adding "tika:thumbnail" to the metadata of each thumbnail image.
> The consumer can then determine what to do with the embedded resource based
> on the metadata.
--
This message was sent by Atlassian JIRA
(v6.2#6252)