[GitHub] spark pull request #19715: [SPARK-22397][ML]add multiple columns support to ...

MLnick Fri, 08 Dec 2017 03:42:30 -0800

Github user MLnick commented on a diff in the pull request:

    https://github.com/apache/spark/pull/19715#discussion_r154571718
  
    --- Diff: 
mllib/src/main/scala/org/apache/spark/ml/feature/QuantileDiscretizer.scala ---
    @@ -105,9 +107,11 @@ private[feature] trait QuantileDiscretizerBase extends 
Params
      * possible that the number of buckets used will be smaller than this 
value, for example, if there
      * are too few distinct values of the input to create enough distinct 
quantiles.
      * Since 2.3.0,
    - * `QuantileDiscretizer` can also map multiple columns at once. Whether it 
goes to map a column or
    - * multiple columns, it depends on which parameter of `inputCol` and 
`inputCols` is set. When both
    - * are set, a log warning will be printed and by default it chooses 
`inputCol`.
    + * `QuantileDiscretizer ` can map multiple columns at once by setting the 
`inputCols` parameter.
    + * Note that when both the `inputCol` and `inputCols` parameters are set, 
a log warning will be
    + * printed and only `inputCol` will take effect, while `inputCols` will be 
ignored. To specify
    + * the number of bucketsfor each column , the `numBucketsArray ` parameter 
can be set, or if the
    --- End diff --
    
    "bucketsfor" -> "buckets for"
    
    and remove the leading space from " number of buckets ..." on next line



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #19715: [SPARK-22397][ML]add multiple columns support to ...

Reply via email to