kevinwilfong opened a new issue, #10568: URL: https://github.com/apache/incubator-gluten/issues/10568
### Description Today we pass the columns we expect to read from the table as the "dataColumns" parameter of the HiveTableHandle. What HiveTableHandle expects to receive here is the schema of the table being read. This mostly works today because for the file formats we support (Parquet and ORC) we hard code the configs to force them to map the file columns to the table columns by name, so the missing columns aren't needed. However, in order to support mapping file columns to table columns by index (an option in Velox) or other file formats that are index based (e.g. Text) we need to pass the table schema. This shouldn't affect the current behavior as this is just a superset of what we're currently using. ### Gluten version None -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
