nverdhan commented on issue #3008:
URL: https://github.com/apache/hudi/issues/3008#issuecomment-1233078942

   The second problem still exists.
   
   Setting 
   ```
   
hoodie.datasource.write.keygenerator.class=org.apache.hudi.keygen.NonpartitionedKeyGenerator
   
hoodie.datasource.hive_sync.partition_extractor_class=org.apache.hudi.hive.NonPartitionedExtractor
   ```
   for non partitioned data doesn't show data after syncing to Glue. Querying 
data shows empty results while the parquet file has all the columns. The schema 
in Glue shows "Partition(0)" against the column it was synced but probably 
doesn't show data because all partitions are empty. If I edit the Glue table 
schema and remove Partition field, it starts showing up the data correctly.
   
   The workaround for me was to use a 
`org.apache.hudi.keygen.SimpleKeyGenerator` with a constant column. (I was 
using Debezium, so I had `db_shard_source_partition` column to partition with)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to