alamb commented on PR #7454:
URL: https://github.com/apache/arrow-rs/pull/7454#issuecomment-2841787436

   > It helps part of the regression about the read record/skip record too 
dense, which is the original regression:
   > 
   > Here is the result for page cache without this PR: [#7363 
(comment)](https://github.com/apache/arrow-rs/issues/7363#issuecomment-2769850463)
 The regression will from Q24->28 and Q30 -> Q31.
   > 
   > Q30 / Q31 no regression now for current PR:
   > 
   > ```rust
   > │ QQuery 30    │  420.68ms │                       431.94ms │     no 
change │
   > │ QQuery 31    │  571.58ms │                       528.87ms │ +1.08x 
faster │
   > ```
   
   
https://github.com/apache/datafusion/blob/7b370e26fea75fcd17121272eec1bd9447b2cb8f/benchmarks/queries/clickbench/queries.sql#L31-L32
   
   These queries have a predicate like 
   ```sql
   WHERE "SearchPhrase" <> ''
   ```
   
   But `SearchPhrase` is not used except for filtering
   
   > But Q24 -> Q 28 still have regression, same with original result:
   > 
   > ```rust
   > │ QQuery 24    │  273.40ms │                       386.72ms │  1.41x 
slower │
   > │ QQuery 25    │  274.14ms │                       370.83ms │  1.35x 
slower │
   > │ QQuery 26    │  320.12ms │                       435.73ms │  1.36x 
slower │
   > │ QQuery 27    │  900.06ms │                      1354.63ms │  1.51x 
slower │
   > │ QQuery 28    │ 7812.82ms │                      9813.62ms │  1.26x 
slower │
   > ```
   
   
https://github.com/apache/datafusion/blob/7b370e26fea75fcd17121272eec1bd9447b2cb8f/benchmarks/queries/clickbench/queries.sql#L25-L29
   
   These queries have the same predicate
   ```sql
   WHERE "SearchPhrase" <> ''
   ```
   
   But in this case `SearchPhrase` is also used in the query
   
   ```
   SELECT "SearchPhrase" FROM hits WHERE "SearchPhrase" <> '' ORDER BY 
"EventTime" LIMIT 10;
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to