Don't trust streams that claim mark support
-------------------------------------------

                 Key: TIKA-388
                 URL: https://issues.apache.org/jira/browse/TIKA-388
             Project: Tika
          Issue Type: Improvement
          Components: parser
            Reporter: Jukka Zitting
            Priority: Minor


As seen on tika-dev@ and in JCR-2576, there are some InputStream 
implementations that claim to support the mark feature, but lose the mark as 
soon as the end of stream has been reached. There's no way for a client to 
detect such behaviour, so it's probably best for Tika to always use 
BufferedInputStream to wrap incoming streams when mark support is needed. This 
may cause one layer of extra buffering, but avoids problems with such broken 
streams.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to