[
https://issues.apache.org/jira/browse/CALCITE-4223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17203237#comment-17203237
]
Chunwei Lei commented on CALCITE-4223:
--------------------------------------
{quote}I don't know why Flink and Drill have not integrated their statistics
into Calcite. Maybe they didn't know how. They could have asked. Or we could
have written better documentation.
{quote}
I think it is not a good practice if users have to ask the community how to do
it and how to do it correctly. I have to admit that my proposal is not as
extendable as expected. What I want to do is that introducing column statistics
and taking advantage of them explicitly. BTW, it would be great if you can give
a PR to show how to introduce column statistics in the way you think it should
be. Thank you for your time~~
> Introducing column statistics to RelOptTable
> --------------------------------------------
>
> Key: CALCITE-4223
> URL: https://issues.apache.org/jira/browse/CALCITE-4223
> Project: Calcite
> Issue Type: Improvement
> Reporter: Chunwei Lei
> Assignee: Chunwei Lei
> Priority: Major
> Labels: pull-request-available
> Time Spent: 40m
> Remaining Estimate: 0h
>
> Many systems depend on column statistics to compute more accurate stats, such
> as NDV, average column size, and so on. It would be nice if Calcite can
> provide such an interface.
> Column statistics might include NDV, average/max column length, number of
> nulls, number of trues, number of falses and so on.
> What do you think?
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)