Github user WeichenXu123 commented on the issue:
https://github.com/apache/spark/pull/19621
@felixcheung Another failed testcase, spark.mlp in sparkR, it also use
`RFormula` and it will also generate indeterministic result, see class
`MultilayerPerceptronClassifierWrapper` line 78:
```
val rFormula = new RFormula()
.setFormula(formula)
.setForceIndexLabel(true)
.setHandleInvalid(handleInvalid)
```
It can not set the string order and the default `frequencyDesc` order will
bring indeterministic result.
If I only modify the testcase in `spark.mlp`, in the future if the
`StringIndexer` implementation being changed, those tests will probably be
broken again. What do you think of this ?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]