[
https://issues.apache.org/jira/browse/TIKA-1509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14276105#comment-14276105
]
Nick Burch commented on TIKA-1509:
----------------------------------
First up is probably some sort of composite / decorator that handles the
different strategies. Then it's probably something to allow a Tika Config XML
file to include or exclude parsers for a mime type, as well as request these.
(Coding wise, these aren't too bad, but I'm struggling to come up with good
names that don't clash with names of what we already have...)
When those parts are in place, users can opt into that behaviour. We can then
test it out, and get feedback!
Once we have that, we can then plan some breaking changes for 2.0 where we
alter defaults, allow for preferences / fancier ordering, make it easier to
control what you get etc
Let's try to keep things compatible for as long as possible :)
> Create configurable strategies for composite parsers
> ----------------------------------------------------
>
> Key: TIKA-1509
> URL: https://issues.apache.org/jira/browse/TIKA-1509
> Project: Tika
> Issue Type: Improvement
> Reporter: Tim Allison
>
> Several parsers can handle the same mime type, and we are currently ordering
> which parser is chosen (roughly) by the alphabetic order of the parser class
> name.
> Let's allow users to configure strategies for picking parsers.
> ***NOTE: this description is just a place holder, will edit later.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)