zanmato1984 commented on issue #44513:
URL: https://github.com/apache/arrow/issues/44513#issuecomment-2550342799

   > Cardinality can refer to different things. In a database context, 
cardinality usually refers to the number of unique values in a relational table 
column relative to the total number of rows in the table. So, if are both 
talking about the same, cardinality is presented in the report above, 
`cardinality_percentage = (unique_count / total_rows)*100`
   
   But "cardinality" can also represent the size of the join result which is 
what I originally asked about. Do you have that? (You can just run the right 
join and count the number of rows).
   
   And thank you for the source files. I'll try to reproduce the issue using 
these files in my local.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to