Daniel Becker has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/21605 )

Change subject: IMPALA-13247: Support Reading Puffin files for the current 
snapshot
......................................................................


Patch Set 13:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/21605/12//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/21605/12//COMMIT_MSG@20
PS12, Line 20: 'disable_reading_puffin_stats' startup flag - when it is set to 
true,
             : Impala will never read Puffin stats
> I think a query option would be more convenient. We usually use query optio
As discussed offline, I kept the startup flag and added a table property also.
If another writer writes incorrect stats for a specific table, reading them can 
be disabled with the table property.
If many tables are affected, we can use the startup flag to disable reading 
Puffin globally.
Query options are unfortunately not available because the code runs in 
catalogd, and not necessarily during a query.


http://gerrit.cloudera.org:8080/#/c/21605/6/fe/src/main/java/org/apache/impala/catalog/IcebergTable.java
File fe/src/main/java/org/apache/impala/catalog/IcebergTable.java:

http://gerrit.cloudera.org:8080/#/c/21605/6/fe/src/main/java/org/apache/impala/catalog/IcebergTable.java@471
PS6, Line 471: r (Map.Entry<Integer, PuffinStatsLoader.P
> However, I tried setting this table property, then setting NDV
 > manually (as an HMS stat), and that is not capped. In this case I
 > think it would be strange to cap the Puffin NDV value while not
 > doing the same with the HMS NDV value. Probably we should leave it
 > as it is in this patch and maybe open a Jira to cap the values in
 > both cases. What do you think?

@Riza Suminto what do you think of this?



--
To view, visit http://gerrit.cloudera.org:8080/21605
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I50c1228988960a686d08a9b2942e01e366678866
Gerrit-Change-Number: 21605
Gerrit-PatchSet: 13
Gerrit-Owner: Daniel Becker <daniel.bec...@cloudera.com>
Gerrit-Reviewer: Csaba Ringhofer <csringho...@cloudera.com>
Gerrit-Reviewer: Daniel Becker <daniel.bec...@cloudera.com>
Gerrit-Reviewer: Gabor Kaszab <gaborkas...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Noemi Pap-Takacs <npaptak...@cloudera.com>
Gerrit-Reviewer: Peter Rozsa <pro...@cloudera.com>
Gerrit-Reviewer: Riza Suminto <riza.sumi...@cloudera.com>
Gerrit-Reviewer: Zoltan Borok-Nagy <borokna...@cloudera.com>
Gerrit-Comment-Date: Mon, 23 Sep 2024 13:35:20 +0000
Gerrit-HasComments: Yes

Reply via email to