Github user WeichenXu123 commented on the issue:

    https://github.com/apache/spark/pull/19621
  
    @felixcheung Another failed testcase, spark.mlp in sparkR, it also use 
`RFormula` and it will also generate indeterministic result, see class 
`MultilayerPerceptronClassifierWrapper` line 78:
    ```
    val rFormula = new RFormula()
          .setFormula(formula)
          .setForceIndexLabel(true)
          .setHandleInvalid(handleInvalid)
    ```
    It can not set the string order and the default `frequencyDesc` order will 
bring indeterministic result.
    
    If I only modify the testcase in `spark.mlp`, in the future if the 
`StringIndexer` implementation being changed, those tests will probably be 
broken again. What do you think of this ?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to