Github user jihoonson commented on the pull request:
https://github.com/apache/tajo/pull/884#issuecomment-166503160
Well, there remain some issues around fetch timeout. When I tested this
patch against 10TB dataset, a lot of fetch timeouts occurred while transferring
intermediate data between stages. The main reason looks that index lookup takes
a lot of time (over 30 seconds with cache miss). So, I think the fundamental
solution is to improve index search performance which need to be handled in
another jira.
So, I'd like to suggest to increase fetch timeout temporarily. With doubled
timeout (that is 120 seconds), everything was ok against 10TB data.
@hyunsik, @jinossy what do you think?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---