gabotechs commented on issue #1837: URL: https://github.com/apache/datafusion-ballista/issues/1837#issuecomment-4650466962
I think this does not happen with the standard TPC-DS query files, in https://github.com/datafusion-contrib/datafusion-distributed, we have some code that upon generating the TPC-DS parquet files, it moves some columns to dictionary encoded to cover more surface area in the benchmarks. Probably it's easier to replicate this with some more scoped tests. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
