Fucun Chu has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16645 )

Change subject: IMPALA-10279: Import CPC functionality from DataSketches
......................................................................


Patch Set 6:

(5 comments)

Thanks for the review! Addressed the comments.

http://gerrit.cloudera.org:8080/#/c/16645/6//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/16645/6//COMMIT_MSG@11
PS6, Line 11: First, I updated our existing snapshot of DataSketches to the
            : following commit:8ae0ab5703783c8c360d48e577a6d450d3514ece
            : "https://github.com/apache/incubator-datasketches-cpp/pull/172";
> I see that your patch doesn't change any of the existing files in be/src/th
All change files have been updated


http://gerrit.cloudera.org:8080/#/c/16645/6/be/src/exprs/datasketches-test.cc
File be/src/exprs/datasketches-test.cc:

http://gerrit.cloudera.org:8080/#/c/16645/6/be/src/exprs/datasketches-test.cc@82
PS6, Line 82: get an estimate for quantiles
> I think here you don't get the quantiles estimates but the cardinality esti
Done


http://gerrit.cloudera.org:8080/#/c/16645/6/be/src/exprs/datasketches-test.cc@101
PS6, Line 101: and prints the result
> This code doesn't print the results. (and shouldn;t. I guess this is a copy
Done


http://gerrit.cloudera.org:8080/#/c/16645/6/be/src/exprs/datasketches-test.cc@114
PS6, Line 114:     EXPECT_EQ(149796, (int)sketch.get_estimate());
> I'm not 100% familiar with the characteristics of the CPC algorithm. Are yo
The test code comes from the official example:
https://datasketches.apache.org/docs/CPC/CpcCppExample.html
The estimate is 149796.506.
When lg_k is 11, the estimate is 150967.1274.


http://gerrit.cloudera.org:8080/#/c/16645/6/be/src/thirdparty/datasketches/README.md
File be/src/thirdparty/datasketches/README.md:

http://gerrit.cloudera.org:8080/#/c/16645/6/be/src/thirdparty/datasketches/README.md@12
PS6, Line 12: The git branch of the snapshot I used as a source for the files: 
2.1.0-incubating
            : The hash: c1a6f8edb49699520f248d3d02019b87429b4241
            :
            : Browse the source files here:
            : 
https://github.com/apache/incubator-datasketches-cpp/tree/2.1.0-incubating
> In case you updated the content of this folder to a newer version of Apache
Done



--
To view, visit http://gerrit.cloudera.org:8080/16645
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1e8d0c2e80df95fa84af82c64d493df9bbb34a8c
Gerrit-Change-Number: 16645
Gerrit-PatchSet: 6
Gerrit-Owner: Fucun Chu <[email protected]>
Gerrit-Reviewer: Fucun Chu <[email protected]>
Gerrit-Reviewer: Gabor Kaszab <[email protected]>
Gerrit-Reviewer: Impala Public Jenkins <[email protected]>
Gerrit-Comment-Date: Sun, 01 Nov 2020 08:29:03 +0000
Gerrit-HasComments: Yes

Reply via email to