Re: [PR] PARQUET-2361: Reduce failure rate of unit test [parquet-mr]

via GitHub Sat, 14 Oct 2023 09:11:26 -0700


amousavigourabi commented on PR #1170:
URL: https://github.com/apache/parquet-mr/pull/1170#issuecomment-1763024226


   @fengjiajie As the test is now evaluating the false positive rate with 
significantly more samples than what we use to build the filter, or are 
provided as NDV, might it not be the case that this will increase the false 
positive rate of the bloom filter to more than the FPP? Perhaps we could try 
increasing the NDV, or maybe an adaptive bloom filter might be more 
appropriate? WDYT?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [PR] PARQUET-2361: Reduce failure rate of unit test [parquet-mr]

Reply via email to