[
https://issues.apache.org/jira/browse/TIKA-442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12879532#action_12879532
]
Nick Burch commented on TIKA-442:
---------------------------------
OK, I'll work up a patch that uses these keys, hopefully some time next week
If you get a chance in the mean time, do please add those entries to the XMPDM
class ready :)
> Image extractors use inconsistent metadata keys and formats for common
> features
> -------------------------------------------------------------------------------
>
> Key: TIKA-442
> URL: https://issues.apache.org/jira/browse/TIKA-442
> Project: Tika
> Issue Type: Improvement
> Components: metadata, parser
> Affects Versions: 0.7
> Reporter: Nick Burch
> Priority: Minor
>
> Currently Tika has a number of parsers for image formats, but the way they
> return their data is inconsistent. For example:
> Jpeg: "Image Width" = "420 pixels", "Data Precision" = "8 bits"
> Gif: "width" = "420"
> Png: "width" = "420", "IHDR" = ".... bitDepth = 8 ....."
> Bmp: "width" = "420", "BitsPerSample" = "8 8 8"
> I think that the common keys, such as width and height, should be returned in
> a consistent format of key and value. If someone would like to suggest the
> namespace for this (maybe under XMDPM), and the short or long form (eg 420 vs
> 420 pixels), then I'm happy to work up a patch for this
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.