Hey all, Running a parquet-mr build off of master and I'm seeing some interesting behavior when using a DictionaryFilter to prune row groups. Basically, if I have an And or Or filter the DictionaryPage object gets re-used. This seems to be a problem for StreamBytesInput because the stream gets exhausted after the first toByteArray call. My current workaround is to synchronize and just re-use the byte array after the first read, but I'd be curious as to what people think the best approach to solving this is and if we should be reusing the BytesInput at all.
Best, Patrick
