MaxGekk commented on issue #20793: [WIP][SPARK-23643] Shrinking the buffer in hashSeed up to size of the seed parameter URL: https://github.com/apache/spark/pull/20793#issuecomment-470864451 There are still failing tests: - PowerIterationClusteringSuite I don't see how the tests depend on seed. It looks like it is embedded into implementation. I can only change expected results. - LogisticRegressionSuite, GeneralizedLinearRegressionSuite There is R code which generates expected results but I haven't figured out how I could run it on input data generated for another seed. Need somehow extract and save input to csv files. It would be nice to have reference code which can be just executed easily.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
