[
https://issues.apache.org/jira/browse/TIKA-1509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14338344#comment-14338344
]
Luis Filipe Nassif commented on TIKA-1509:
------------------------------------------
I was thinking about the need to buffer or to spool large inputStreams to a
temp file to make possible the parsing by a second or a third parser. Maybe we
can create an InputStreamSource interface and set it into TikaInputStream, so
we could call tikaInputStream.getInputStreamSource().getInputStream() before
each additional pasing, so CompositeParser do not need to create temp files for
types with two or more parsers? The users could implement the
InputStreamSource.getInputStream() method and set the InputStreamSource into
TikaInputStream. If it do not exist, we can fallback to the temp file approach.
> Create configurable strategies for composite parsers
> ----------------------------------------------------
>
> Key: TIKA-1509
> URL: https://issues.apache.org/jira/browse/TIKA-1509
> Project: Tika
> Issue Type: Improvement
> Reporter: Tim Allison
>
> Several parsers can handle the same mime type, and we are currently ordering
> which parser is chosen (roughly) by the alphabetic order of the parser class
> name.
> Let's allow users to configure strategies for picking parsers.
> See and contribute to full discussion here:
> http://wiki.apache.org/tika/CompositeParserDiscussion
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)