[GitHub] [hudi] Yogashri12 opened a new issue #2017: multi-level partition

GitBox Mon, 24 Aug 2020 00:04:07 -0700


Yogashri12 opened a new issue #2017:
URL: https://github.com/apache/hudi/issues/2017



   Hi,
   I have a dataset which contain many columns.
   now i want to store this dataset based on year/month.
   
   hudi_options = {
     'hoodie.table.name': tableName,
     'hoodie.datasource.write.recordkey.field': 'ID',
     'hoodie.datasource.write.table.name': tableName,
     'hoodie.datasource.write.operation': 'upsert',
     'hoodie.datasource.write.precombine.field': 'ID',
     'hoodie.upsert.shuffle.parallelism': 2, 
     'hoodie.insert.shuffle.parallelism': 2,
     'hoodie.datasource.write.partitionpath.field':'year/month'
   }
   
   when i use partition path as year/month,it doesn't create any partition.
   
   i want my partition path as (year/month)  year--->month--->then parquet 
files.
   
   i saw your sample example where you have used (region/country/city)
   
   i am using pyspark to connect with apache Hudi
     


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [hudi] Yogashri12 opened a new issue #2017: multi-level partition

Reply via email to