Github user hyunsik commented on the pull request:
https://github.com/apache/tajo/pull/113#issuecomment-53154532
Hi @blrunner,
If CODEGEN session variable is enabled, all queries will work by using code
generation. But, currently, we cannot expect its performance improvement. The
code generation is designed to avoid Datum objects creations during any
computations and reduce interpretation overheads like branches.
But, in order to keep the compatibility against our current Tuple and Datum
mechanism, code generation feature still create lots of Datum objects.
Currently, I'm working new tuple structure using direct memory, which uses a
sequence of bytes as a row blocks instead of an array of Datum objects. After
than, it will give big performance benefits.
Thanks!
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---