[GitHub] [arrow] vertexclique edited a comment on pull request #8635: ARROW-10551: [Rust] Fix unreproducible benches

2020-11-11 Thread GitBox


vertexclique edited a comment on pull request #8635:
URL: https://github.com/apache/arrow/pull/8635#issuecomment-725422469


   > If the argument is that two runs of the same benchmark do not agree due to 
randomness, then I do not understand how having the same seed per thread or 
different seeds per thread changes this behavior: the seed will still be set 
when the process starts and will be different per run.
   
   I don't think that it should be different per run per benchmark, that's why 
it is variating.
   
https://github.com/bheisler/criterion.rs/blob/4499ee5e32f59c0c15eb046dd7a2dfd0c15a96f7/src/stats/univariate/sample.rs#L210
   
   > Seeding the random number generators can be done using the existing rand 
crate, in the following way (this took me longer to figure out from the rand 
crate's documentation than I would like to admit):
   
   Yeah, I didn't want to write a const compiled seed and people forget to use 
the same seed every time. My implementation does some extra faster stuff 
compared to rand too. Here:
   https://docs.rs/bastion-utils/0.3.2/src/bastion_utils/math.rs.html#4-28
   
   But I am ok with that too if we won't forget it places :smile: 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [arrow] vertexclique edited a comment on pull request #8635: ARROW-10551: [Rust] Fix unreproducible benches

2020-11-11 Thread GitBox


vertexclique edited a comment on pull request #8635:
URL: https://github.com/apache/arrow/pull/8635#issuecomment-725422469


   > If the argument is that two runs of the same benchmark do not agree due to 
randomness, then I do not understand how having the same seed per thread or 
different seeds per thread changes this behavior: the seed will still be set 
when the process starts and will be different per run.
   
   I don't think that it should be different per run per benchmark, that's why 
it is variating.
   
https://github.com/bheisler/criterion.rs/blob/4499ee5e32f59c0c15eb046dd7a2dfd0c15a96f7/src/stats/univariate/sample.rs#L210
   
   > Seeding the random number generators can be done using the existing rand 
crate, in the following way (this took me longer to figure out from the rand 
crate's documentation than I would like to admit):
   
   Yeah, I didn't want to write a const compiled seed and people forget to use 
the same seed every time. My implementation does some extra faster stuff 
compared to rand too. Here:
   https://docs.rs/bastion-utils/0.3.2/src/bastion_utils/math.rs.html#4-28
   
   But I am ok with that too if we won't forget it in benches :smile: 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [arrow] vertexclique edited a comment on pull request #8635: ARROW-10551: [Rust] Fix unreproducible benches

2020-11-11 Thread GitBox


vertexclique edited a comment on pull request #8635:
URL: https://github.com/apache/arrow/pull/8635#issuecomment-725422469


   > If the argument is that two runs of the same benchmark do not agree due to 
randomness, then I do not understand how having the same seed per thread or 
different seeds per thread changes this behavior: the seed will still be set 
when the process starts and will be different per run.
   
   I don't think that it should be different per run per benchmark, that's why 
it is variating.
   
https://github.com/bheisler/criterion.rs/blob/4499ee5e32f59c0c15eb046dd7a2dfd0c15a96f7/src/stats/univariate/sample.rs#L210
   
   > Seeding the random number generators can be done using the existing rand 
crate, in the following way (this took me longer to figure out from the rand 
crate's documentation than I would like to admit):
   
   Yeah, I didn't want to write a const compiled seed and people forget to use 
the same seed every time. My implementation does some extra faster stuff 
compared to rand too. Here:
   https://docs.rs/bastion-utils/0.3.2/src/bastion_utils/math.rs.html#4-28
   
   But I am ok with that too.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [arrow] vertexclique edited a comment on pull request #8635: ARROW-10551: [Rust] Fix unreproducible benches

2020-11-11 Thread GitBox


vertexclique edited a comment on pull request #8635:
URL: https://github.com/apache/arrow/pull/8635#issuecomment-725422469


   > If the argument is that two runs of the same benchmark do not agree due to 
randomness, then I do not understand how having the same seed per thread or 
different seeds per thread changes this behavior: the seed will still be set 
when the process starts and will be different per run.
   
   I don't think that it should be different per run per benchmark, that's why 
it is variating.
   
https://github.com/bheisler/criterion.rs/blob/4499ee5e32f59c0c15eb046dd7a2dfd0c15a96f7/src/stats/univariate/sample.rs#L210
   
   > Seeding the random number generators can be done using the existing rand 
crate, in the following way (this took me longer to figure out from the rand 
crate's documentation than I would like to admit):
   
   Yeah, I didn't want to write a const compiled seed and people forget to use 
the same seed every time. My implementation does some extra faster stuff 
compared to rand too. Here:
   https://docs.rs/bastion-utils/0.3.2/src/bastion_utils/math.rs.html#4-28



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org