fafacao86 opened a new issue, #7189:
URL: https://github.com/apache/paimon/issues/7189

   ### Search before asking
   
   - [x] I searched in the [issues](https://github.com/apache/paimon/issues) 
and found nothing similar.
   
   
   ### Motivation
   
   
https://cwiki.apache.org/confluence/display/PAIMON/PIP-23+Introduce+bitmap+file+index
   BitmapIndex tracks row positions of a specific value. 
   Let's say we have a primarykey table, a non-primary-key column: `type`. And 
we enable bitmap file index on the `type` column. For Spark, when `DELETE FROM 
table WHERE type IN 'type1'`, it seems that it'll first scan parquet files to 
get `FilePath` and `Position of rows`, then shuffle by `FilePath` to update DV. 
I'm curious if we can directly use the bitmap index to update DV, skipping the 
file scan and shuffling.
   
   ### Solution
   
   _No response_
   
   ### Anything else?
   
   _No response_
   
   ### Are you willing to submit a PR?
   
   - [ ] I'm willing to submit a PR!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to