siddharthteotia opened a new pull request #5077: Reduce the non-determinism in 
lucene integration test
URL: https://github.com/apache/incubator-pinot/pull/5077
 
 
   The lucene cluster integration test is written to test both offline and 
realtime (primarily) text index functionality. The goal of the test is to be 
able to see an increasingly number of hits/matches in the index for the same 
query (run in a tight loop 2000 times).
   
   The test runs the query 2000 times and starts checking for more than 0 hits 
from 300th query onwards. It also compares the number of hits (count *) with 
that of previous query. However, based on how fast the queries are completed, 
how quickly the documents are ingested and how quickly the realtime segments 
get refreshed, the check fails and we start to see greater than 0 hits a little 
after 300th query.
   
   The current PR puts a 10ms sleep time between queries to reduce the 
non-determinism. 
   
   The test has blocked our internal release builds. I'd like to think about to 
how to make this more deterministic and address that in a follow-up. For now, 
this fix should get us going.
   
   Ideally, the desired goal with the test is to see the final total number of 
hits (17k) only for the last few hundred queries. Until then we should only be 
seeing an increasing number of hits. The sleep time compromises this goal since 
we are going to see the total number of hits sooner. I will address this later.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to