nverdhan commented on issue #3008: URL: https://github.com/apache/hudi/issues/3008#issuecomment-1233078942
The second problem still exists. Setting ``` hoodie.datasource.write.keygenerator.class=org.apache.hudi.keygen.NonpartitionedKeyGenerator hoodie.datasource.hive_sync.partition_extractor_class=org.apache.hudi.hive.NonPartitionedExtractor ``` for non partitioned data doesn't show data after syncing to Glue. Querying data shows empty results while the parquet file has all the columns. The schema in Glue shows "Partition(0)" against the column it was synced but probably doesn't show data because all partitions are empty. If I edit the Glue table schema and remove Partition field, it starts showing up the data correctly. The workaround for me was to use a `org.apache.hudi.keygen.SimpleKeyGenerator` with a constant column. (I was using Debezium, so I had `db_shard_source_partition` column to partition with) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
