On 10/13/07, Keith R. Bennett <[EMAIL PROTECTED]> wrote:

> ...In order to accomplish the reading of an original resource only once, we
> have the RereadableInputStream....

Is that used for all parsers?

If yes we should use it only where needed, by asking the parser if it
needs it or not. Or more precisely, if it needs a "no-rewind",
"small-rewind" or "all-rewind" input stream.

Then, we could document which parsers use which stream type, so that
people know which file types are likely to cause resource problems or
writes to disk.

I'd prefer this to be an internal concern of Tika, rather than putting
the burden on the user to decide if the input can be read several
times safely. Unless someone really needs that feature now, of course.

-Bertrand

Reply via email to