zanmato1984 commented on issue #44513: URL: https://github.com/apache/arrow/issues/44513#issuecomment-2550342799
> Cardinality can refer to different things. In a database context, cardinality usually refers to the number of unique values in a relational table column relative to the total number of rows in the table. So, if are both talking about the same, cardinality is presented in the report above, `cardinality_percentage = (unique_count / total_rows)*100` But "cardinality" can also represent the size of the join result which is what I originally asked about. Do you have that? (You can just run the right join and count the number of rows). And thank you for the source files. I'll try to reproduce the issue using these files in my local. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org