Dandandan commented on code in PR #8951: URL: https://github.com/apache/arrow-rs/pull/8951#discussion_r2704873894
########## arrow-select/src/filter.rs: ########## @@ -37,9 +37,9 @@ use arrow_schema::*; /// [`SlicesIterator`] to copy ranges of values. Otherwise iterate /// over individual rows using [`IndexIterator`] /// -/// Threshold of 0.8 chosen based on <https://dl.acm.org/doi/abs/10.1145/3465998.3466009> +/// Threshold of 0.9 chosen based on benchmarking results /// -const FILTER_SLICES_SELECTIVITY_THRESHOLD: f64 = 0.8; +const FILTER_SLICES_SELECTIVITY_THRESHOLD: f64 = 0.9; Review Comment: We don't run with the same machine though I think @alamb is a Intel CPU and my local CPU. The details of our kernels could be very different from the paper, so I think we mainly could go with benchmarking (and improve the implementation of our code to better generate the instructions). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
