trikota-kc edited a comment on issue #2258: URL: https://github.com/apache/hudi/issues/2258#issuecomment-740555412
Ok to anyone out there struggling with data type issues that are related to improper column ordering in Presto queries of Deltastreamer output. This is what worked for me: - Use emr of the latest version with jars that AWS provide on the master node - hive.parquet.use-column-names = true; then restart presto-server - Fix order of columns with hoodie.deltastreamer.transformer.sql . When using DMS and Transformer make sure to include "Op" column first in order - Partition column MUST be the last one in order - Also you can try to cast partition column to string in the hoodie.deltastreamer.transformer.sql ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
