Hi,

On 10/13/07, Bertrand Delacretaz <[EMAIL PROTECTED]> wrote:
> On 10/13/07, Keith R. Bennett <[EMAIL PROTECTED]> wrote:
> > ...In order to accomplish the reading of an original resource only once, we
> > have the RereadableInputStream....
>
> Is that used for all parsers?

I actually managed to get rid of the re-reading of the input stream in
the Microsoft parsers, see TIKA-63.

> I'd prefer this to be an internal concern of Tika, rather than putting
> the burden on the user to decide if the input can be read several
> times safely. Unless someone really needs that feature now, of course.

+1 The parser class should have the best knowledge on how many passes
will be required for reading the stream. Most of the times times I
guess a parser will either need just a single pass or will just read
the whole document to memory.

BR,

Jukka Zititng

Reply via email to