nsivabalan commented on issue #2987:
URL: https://github.com/apache/hudi/issues/2987#issuecomment-850068596


   I assume every commit updates all records. and by default if you read a 
table in hudi format in spark, it will return the latest snaphot and hence you 
are seeing just the last commit. 
   I can think of some hack that you can do. But not sure if there are some 
other elegant ways. 
   You can read the table in parquet format and retrieve all distinct commit 
times and then go from there.  // don't think this is advisable though.
   Also, if you have access to hudi-cli, you should be able to retrieve all 
commit info. 
   @n3nash @leesf @yanghua : any other ideas? 
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to