GitHub user shadowmmu added a comment to the discussion: [Performance] Velox Bloom Filter Inefficiency vs. Photon at 1TB Scale
Thanks @zhouyuan for you effort, Its now all clear about the BHJ, bloom filter and Optimized query plan. For the bloom filter part, need to study Velox code if we can increase the limit without breaking anything. if you have anything that might help, please share it here. For the aggregation We have observed the difference in TPCH Q18 for instance. Here is the related query plans Velox <img width="803" height="844" alt="image" src="https://github.com/user-attachments/assets/99983508-7657-42f1-bccd-0e33fcb9d5d1" /> Photon <img width="740" height="811" alt="image" src="https://github.com/user-attachments/assets/ff14863b-f47e-4beb-beb8-da57f8de7302" /> GitHub link: https://github.com/apache/incubator-gluten/discussions/11554#discussioncomment-15686574 ---- This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
