[
https://issues.apache.org/jira/browse/TIKA-1329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14027794#comment-14027794
]
Nick Burch commented on TIKA-1329:
----------------------------------
The Tika App already has something similar, for extracting out the contents. If
we want to have the Tika App or Tika Server recurse, we'll need to give some
though to where in the document we output the nested contents, how we best
report metadata or document headers, and that sort of thing.
I suspect that the result will be something a bit more fully featured than the
examples on the wiki, which are deliberately simple to provide a starting point
rather than a solution.
On a related thing, we might want to look at promoting those examples from the
wiki to the main site as "starting points", and also include links to "fully
featured" implementations as examples, such as the ones in the tika app
> Add RecursiveParserWrapper aka Jukka's (and Nick's) RecursiveMetadataParser
> ---------------------------------------------------------------------------
>
> Key: TIKA-1329
> URL: https://issues.apache.org/jira/browse/TIKA-1329
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Reporter: Tim Allison
> Priority: Minor
> Fix For: 1.7
>
>
> Jukka and Nick have a great demo of parsing metadata recursively on the
> [wiki|http://wiki.apache.org/tika/RecursiveMetadata]. For TIKA-1302, I'd
> like to use something similar, and I think that others may find it useful for
> tika-app and tika-server.
> I took the code from the wiki and made some modifications. I'm not sure if
> we should put this in parsers or in a new module for "examples." Given that
> I think this would be useful for tika-app and tika-server, I'd prefer
> parsers, but I'm open to any input...including "let's not."
> I opened up a review board issue here:
> [rb|http://reviews.apache.org/r/22433]
--
This message was sent by Atlassian JIRA
(v6.2#6252)