zhangjun0x01 commented on PR #494: URL: https://github.com/apache/flink-table-store/pull/494#issuecomment-1413856322
> purpose Hi,@JingsongLi , My purpose is to use various methods to improve the read and write performance as much as possible. Using bloom filter can improve the query speed as much as possible when the statistics of the manifest are not match. For example, we have a orc file , the min and max values of the `id` column in our manifest file are 0 and 100, and we have a sql like `select * from mytable where id=50`. We can use the bloom filter to quickly determine whether the data with `id=50` is in the orc file without scanning the file -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
