sanjiv1980 commented on issue #1067: How to do multiple partition URL: https://github.com/apache/incubator-hudi/issues/1067#issuecomment-561023870 @vinothchandar here is my details 1. source of data is kind of clickstream , which is processed my some existing job(datapipeline) for enrichment and put in one of the location for further processing . Its snappy.parquet format. 2. data is all ready being in S3 (snappy.parquet) , I have to write datapipeline to do update/delete on bulk based on user identification . (so It's kind of job that can formed HuDi dataset and after that It can run the needful (update/delete) ).
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
