[
https://issues.apache.org/jira/browse/CALCITE-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17165996#comment-17165996
]
Stamatis Zampetakis edited comment on CALCITE-4144 at 7/27/20, 10:26 PM:
-------------------------------------------------------------------------
I think I understand the problem. [~zhangchenghui] is using both interpreter
mode (aka. {{BindableConvention}}) and compiler mode (aka.
{{EnumerableConvention}}) as I think it is the case for many people using the
default ruleset. Thus, I assume that some operators are in
{{BindableConvention}} and some on {{EnumerableConvention}}.
The {{BINDABLE_CACHE}} is taking care of class generation coming from the
compiler but there is no such cache for the interpreter. In my mind, I was
thinking that interpreter should never generate and compile code however, as it
is observed in this case some part of the interpreter ({{JaninoRexCompiler}})
does it.
I see 3 ways to address this issue:
* Implement a {{ScalarCompiler}} that does not generate code (such as
{{JaninoRexCompiler}}) but interprets expressions.
[FooCompiler|https://github.com/apache/calcite/blob/b306668d796a19ade127dce7b392f970cffd4b39/core/src/main/java/org/apache/calcite/interpreter/Interpreter.java#L141]
outlines how the code would look like. However, I don't know anybody pushing
in this direction so this might never actually happen.
* Make the {{JaninoRexCompiler}} to also use the BINDABLE_CACHE. This is
mostly code refactoring since all the pieces are there.
* Use exclusively {{Enumerable}} operators during planning which already
exploit the BINDABLE_CACHE.
was (Author: zabetak):
I think I understand the problem. [~zhangchenghui] is using both interpreter
mode (aka. {{BindableConvention}}) and compiler mode (aka.
{{EnumerableConvention}}) as I think it is the case for many people using the
default ruleset. Thus, I assume that some operators are in
{{BindableConvention}} and some on {{EnumerableConvention}}.
The {{BINDABLE_CACHE}} is taking care of class generation coming from the
compiler but not there is no such cache for the interpreter. In my mind, I was
thinking that interpreter should never generate and compile code however, as it
is observed in this case some part of the interpreter ({{JaninoRexCompiler}})
does it.
I see 3 ways to address this issue:
* Implement a {{ScalarCompiler}} that does not generate code (such as
{{JaninoRexCompiler}}) but interprets expressions.
[FooCompiler|https://github.com/apache/calcite/blob/b306668d796a19ade127dce7b392f970cffd4b39/core/src/main/java/org/apache/calcite/interpreter/Interpreter.java#L141]
outlines how the code would look like. However, I don't know anybody pushing
in this direction so this might never actually happen.
* Make the {{JaninoRexCompiler}} to also use the BINDABLE_CACHE. This is
mostly code refactoring since all the pieces are there.
* Use exclusively {{Enumerable}} operators during planning which already
exploit the BINDABLE_CACHE.
> Reduce code generation and class loading overhead when getScalar in
> JaninoRexCompiler.
> --------------------------------------------------------------------------------------
>
> Key: CALCITE-4144
> URL: https://issues.apache.org/jira/browse/CALCITE-4144
> Project: Calcite
> Issue Type: Improvement
> Components: core
> Affects Versions: 1.21.0
> Environment: version: calcite-core 1.21
> model: filterableTable
> Reporter: zhangchenghui
> Priority: Major
> Labels: cache, scalar
> Fix For: 1.25.0
>
> Attachments: image-2020-07-27-22-44-44-455.png,
> image-2020-07-27-22-45-25-350.png, image-2020-07-27-22-45-54-346.png,
> image-2020-07-27-22-46-18-306.png
>
> Original Estimate: 96h
> Remaining Estimate: 96h
>
> I used the FilterableTable mode in the project, but I found that the query
> was particularly slow. I used the JProfile tool to troubleshoot the thread
> time-consuming place, and then through the debug, I found that each request
> took two places:
> 1、org.apache.calcite.adapter.enumerable.EnumerableInterpretable#getBindable
> !image-2020-07-27-22-44-44-455.png!
> This place optimizes the cache settings by setting the cache size.
> 2、org.apache.calcite.interpreter.JaninoRexCompiler#baz
> !image-2020-07-27-22-45-25-350.png!
> But this place is not cached, and a new expression string is used every time
> to create it through reflection.
> JProfile tool time consumption:
> !image-2020-07-27-22-45-54-346.png!
> I originally wanted to add a layer of cache here, but found that the
> expressions generated each time are different, as follows:
> !image-2020-07-27-22-46-18-306.png!
> So you can't use the getBindable method to directly add cache.
> Is there anything you can optimize here? For example, can you generate a
> template class in advance, and then generate different objects through
> different parameter values?
> Bring tea to the boss!
> Looking forward to your reply!
--
This message was sent by Atlassian Jira
(v8.3.4#803005)