Github user MLnick commented on a diff in the pull request:
https://github.com/apache/spark/pull/19715#discussion_r154571718
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/QuantileDiscretizer.scala ---
@@ -105,9 +107,11 @@ private[feature] trait QuantileDiscretizerBase extends
Params
* possible that the number of buckets used will be smaller than this
value, for example, if there
* are too few distinct values of the input to create enough distinct
quantiles.
* Since 2.3.0,
- * `QuantileDiscretizer` can also map multiple columns at once. Whether it
goes to map a column or
- * multiple columns, it depends on which parameter of `inputCol` and
`inputCols` is set. When both
- * are set, a log warning will be printed and by default it chooses
`inputCol`.
+ * `QuantileDiscretizer ` can map multiple columns at once by setting the
`inputCols` parameter.
+ * Note that when both the `inputCol` and `inputCols` parameters are set,
a log warning will be
+ * printed and only `inputCol` will take effect, while `inputCols` will be
ignored. To specify
+ * the number of bucketsfor each column , the `numBucketsArray ` parameter
can be set, or if the
--- End diff --
"bucketsfor" -> "buckets for"
and remove the leading space from " number of buckets ..." on next line
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]