jmckenzie-dev commented on PR #2554:
URL: https://github.com/apache/cassandra/pull/2554#issuecomment-1673619426

   > <cost talking>
   Ok. So part of my reaction is the name `cost`; if we rename that to 
`split_size` I'm a lot more ok with it. But I'd think maybe we revise the 
calculation to something like:
   `num_splits = (test_count / (split_size)) * $AGENT_COUNT_SCALAR`, where 
`AGENT_COUNT_SCALAR` is a user-provided fudge-factor downstream to indicate 
that, for a given test, they have N agents they can call up. That'd let people 
parameterize dialing up and down parallelization based on their unique env 
(i.e. if you have double the agents that can run a unit test at 
`medium_executor` vs. what you have available for `large_executor`, having some 
way to provide that to the scripts / reference system would be nice.
   
   OR - we could just document that and let the users figure it out. But the 
num_splits = num_tests / hardcoded_value feels like it gives rise to a 
hardcoded value that is going to be env-specific.
   
   So maybe... hm. We provide the `default_tests_per_split` (or whatever) for 
each job and document that folks may want to adjust that, and the higher they 
drive that number the more they can expect the suite to parallelize if agents 
are available?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to