Github user nsyca commented on the issue:

    https://github.com/apache/spark/pull/17240
  
    @wzhfy wrote:
    *"usually big tables (fact table) have more columns than small tables, so 
cardinality and size is positively correlated"*
    
    I am aware this may not be a forum to discuss design issue. I will make it 
brief. In general, fact tables contain the columns associated to the primary 
key columns of the dimension tables plus a few columns (usually numbers), and 
the dimension tables contains the descprtions of the entities they represent. 
An example is the STORE_SALES and STORE tables in TPC DS schema.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to