wangyum commented on pull request #31393:
URL: https://github.com/apache/spark/pull/31393#issuecomment-771464400


   Benchmark with production data:
   ```sql
   CREATE TABLE test11.benchmark_column_index_2 using parquet
   CLUSTERED BY (FDBK_RCVR_USER_ID) SORTED BY (FDBK_GIVER_USER_ID) into 2000 
buckets
   AS
   (SELECT * FROM test11.origin_data);
   
   SELECT col... FROM test11.benchmark_column_index_2 WHERE FDBK_GIVER_USER_ID 
= 992647107 AND ...
   ```
   
   Enable column index | Disable column index
   -- | --
   
![image](https://user-images.githubusercontent.com/5399861/106572923-fe7a3b80-6573-11eb-89d5-47a809a1642d.png)|![image](https://user-images.githubusercontent.com/5399861/106572597-97f51d80-6573-11eb-8f21-4477b05698e3.png)


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to