Parsers can't get at an underlying TikaInputStream to get the file if they
wanted one
-------------------------------------------------------------------------------------
Key: TIKA-645
URL: https://issues.apache.org/jira/browse/TIKA-645
Project: Tika
Issue Type: Bug
Components: parser
Affects Versions: 0.9
Reporter: Nick Burch
Spotted this with the office parser, but it should be general. The user creates
a TikaInputStream, and passes that off to the parser framework. The Parser that
is called may wish to spot that the input is a File backed TikaInputStream, and
take a shortcut to use the file instead of the InputStream.
However, what the parser gets is a TaggedInputStream wrapping a
CountingInputStream wrapping the original TikaInputStream. As such, it can't
get at the file.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira