bvaradar commented on issue #2406:
URL: https://github.com/apache/hudi/issues/2406#issuecomment-755176463


   I think the command line parameters are not passed correctly
   
   --hoodie-conf 
hoodie.datasource.write.keygenerator.class=org.apache.hudi.keygen.CustomKeyGenerator
   --hoodie-conf 
hoodie.datasource.write.recordkey.field=request_timestamp:TIMESTAMP
   --hoodie-conf  
hoodie.deltastreamer.source.dfs.root=s3://athena-examples-us-west-2/elb/parquet/year=2015/month=1/day=1
   --hoodie-conf  
hoodie.datasource.write.partitionpath.field=request_timestamp:TIMESTAMP
   
   On a related note, your record key and partition path are both same. This is 
ok if you are testing out a sample dataset but wont scale in real world as you 
would end-up with one record per directory.
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to