Github user hyunsik commented on the pull request:
https://github.com/apache/tajo/pull/75#issuecomment-49011928
Hi @davidzchen,
We just need current reading offset in order to get the task progress.
Also, we need writing offset and the current memory buffer size in order to
estimate the output file size to be written. In addition to them, there is no
other purpose. If you know a better approach for them, feel free to suggest us.
In addition, we will ask these features to Parquet community. BTW, I think
that we need this feature right now.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---