Reo-LEI commented on issue #3118:
URL: https://github.com/apache/iceberg/issues/3118#issuecomment-931215591


   > > Are you sure that this is the right approach? It seems to me that if you 
have so many delete files that you need to read them in parallel, that you 
should rewrite and merge the delete files into data files.
   > 
   > @rdblue I think this optimization is needed. On the one hand, user usually 
need to query the latest data in streaming case, And these data may not have 
been rewritten yet. On the other hand, as you said, the best way is to rewrite 
and merge the delete files into data files, but the rewrite is depend on read. 
So we should optimizing the `DeleteFilter` and read delete files in parallel to 
speed up the read and rewrite.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to