hughfdjackson commented on issue #1979:
URL: https://github.com/apache/hudi/issues/1979#issuecomment-679017460


   Hi @bvaradar - thanks for the reply! And for the suggestion.
   
   In our use case, we're interested in both incremental read of material 
changes, and in using the Hudi table with regular snapshot queries. I would 
expect 30-50% incremental reads, and 50-70% snapshot queries.
   
   If I'm understanding correctly, your suggestion would essentially lead to an 
event log of all material changes to an entity. If you do a snapshot query 
against that data, you'd end up with lots of duplicates, so each query would 
need to include de-duplication to reproduce the a materialised view with the 
latest data for each entity.
   
   Is that right?


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to