asp437 opened a new issue, #8229:
URL: https://github.com/apache/iceberg/issues/8229

   ### Query engine
   
   Spark
   
   ### Question
   
   I use `MERGE INTO` query to update values in Iceberg table via Spark with 
some condition in `WHEN MATCH` clause. And I want to collect metrics of such 
queries to track how many rows were updated/inserted/deleted.
   
   Is there a way to get a number of rows affected by `MERGE INTO` query?
   
   I tried to look on snapshot information but it contains number of rows in 
affected PARQUET files. And  this values are expected to be much higher e.g. in 
case of query updating half of rows, but due to storage configuration all data 
files were rewritten. This is useful metric too, but I also need a number of 
rows changed based on query logic, not physical representation.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to