Fucun Chu has posted comments on this change. ( http://gerrit.cloudera.org:8080/16645 )
Change subject: IMPALA-10279: Import CPC functionality from DataSketches ...................................................................... Patch Set 6: (5 comments) Thanks for the review! Addressed the comments. http://gerrit.cloudera.org:8080/#/c/16645/6//COMMIT_MSG Commit Message: http://gerrit.cloudera.org:8080/#/c/16645/6//COMMIT_MSG@11 PS6, Line 11: First, I updated our existing snapshot of DataSketches to the : following commit:8ae0ab5703783c8c360d48e577a6d450d3514ece : "https://github.com/apache/incubator-datasketches-cpp/pull/172" > I see that your patch doesn't change any of the existing files in be/src/th All change files have been updated http://gerrit.cloudera.org:8080/#/c/16645/6/be/src/exprs/datasketches-test.cc File be/src/exprs/datasketches-test.cc: http://gerrit.cloudera.org:8080/#/c/16645/6/be/src/exprs/datasketches-test.cc@82 PS6, Line 82: get an estimate for quantiles > I think here you don't get the quantiles estimates but the cardinality esti Done http://gerrit.cloudera.org:8080/#/c/16645/6/be/src/exprs/datasketches-test.cc@101 PS6, Line 101: and prints the result > This code doesn't print the results. (and shouldn;t. I guess this is a copy Done http://gerrit.cloudera.org:8080/#/c/16645/6/be/src/exprs/datasketches-test.cc@114 PS6, Line 114: EXPECT_EQ(149796, (int)sketch.get_estimate()); > I'm not 100% familiar with the characteristics of the CPC algorithm. Are yo The test code comes from the official example: https://datasketches.apache.org/docs/CPC/CpcCppExample.html The estimate is 149796.506. When lg_k is 11, the estimate is 150967.1274. http://gerrit.cloudera.org:8080/#/c/16645/6/be/src/thirdparty/datasketches/README.md File be/src/thirdparty/datasketches/README.md: http://gerrit.cloudera.org:8080/#/c/16645/6/be/src/thirdparty/datasketches/README.md@12 PS6, Line 12: The git branch of the snapshot I used as a source for the files: 2.1.0-incubating : The hash: c1a6f8edb49699520f248d3d02019b87429b4241 : : Browse the source files here: : https://github.com/apache/incubator-datasketches-cpp/tree/2.1.0-incubating > In case you updated the content of this folder to a newer version of Apache Done -- To view, visit http://gerrit.cloudera.org:8080/16645 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: comment Gerrit-Change-Id: I1e8d0c2e80df95fa84af82c64d493df9bbb34a8c Gerrit-Change-Number: 16645 Gerrit-PatchSet: 6 Gerrit-Owner: Fucun Chu <[email protected]> Gerrit-Reviewer: Fucun Chu <[email protected]> Gerrit-Reviewer: Gabor Kaszab <[email protected]> Gerrit-Reviewer: Impala Public Jenkins <[email protected]> Gerrit-Comment-Date: Sun, 01 Nov 2020 08:29:03 +0000 Gerrit-HasComments: Yes
