Yogashri12 opened a new issue #2017:
URL: https://github.com/apache/hudi/issues/2017
Hi,
I have a dataset which contain many columns.
now i want to store this dataset based on year/month.
hudi_options = {
'hoodie.table.name': tableName,
'hoodie.datasource.write.recordkey.field': 'ID',
'hoodie.datasource.write.table.name': tableName,
'hoodie.datasource.write.operation': 'upsert',
'hoodie.datasource.write.precombine.field': 'ID',
'hoodie.upsert.shuffle.parallelism': 2,
'hoodie.insert.shuffle.parallelism': 2,
'hoodie.datasource.write.partitionpath.field':'year/month'
}
when i use partition path as year/month,it doesn't create any partition.
i want my partition path as (year/month) year--->month--->then parquet
files.
i saw your sample example where you have used (region/country/city)
i am using pyspark to connect with apache Hudi
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]