wangyum commented on pull request #31393: URL: https://github.com/apache/spark/pull/31393#issuecomment-771464400
Benchmark with production data: ```sql CREATE TABLE test11.benchmark_column_index_2 using parquet CLUSTERED BY (FDBK_RCVR_USER_ID) SORTED BY (FDBK_GIVER_USER_ID) into 2000 buckets AS (SELECT * FROM test11.origin_data); SELECT col... FROM test11.benchmark_column_index_2 WHERE FDBK_GIVER_USER_ID = 992647107 AND ... ``` Enable column index | Disable column index -- | -- | ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
