[
https://issues.apache.org/jira/browse/TIKA-1451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14182674#comment-14182674
]
Tim Allison commented on TIKA-1451:
-----------------------------------
Thank you, Chris. The credit goes to [~jukkaz] and [~gagravarr] for the
recursive parser example! I'm grateful to now have an out-of-the-box format
(w/ serializers and deserializers) that captures embedded document metadata.
As I was working on this, I was starting to think that we might want to add
some "tika:" prefixed properties to TikaCoreProperties to capture metadata
generated during processing, such as: tika:content, tika:parse_time_millis,
tika:exception, tika:parsed_by (instead of our current X-Parsed-By). In
effect, move the RecursiveParserWrapper properties to TikaCoreProperties and
add some others as necessary.
> Add Recursive Metadata Parser Wrapper output to tika-app and gui
> ----------------------------------------------------------------
>
> Key: TIKA-1451
> URL: https://issues.apache.org/jira/browse/TIKA-1451
> Project: Tika
> Issue Type: Improvement
> Reporter: Tim Allison
> Priority: Minor
> Fix For: 1.7
>
> Attachments: integrate_recursive_metadata_wrapper.patch
>
>
> It would be helpful to expose the output of the recursive metadata parser
> wrapper in the gui and in the command line for tika-app.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)