viirya commented on a change in pull request #25812: [SPARK-22796][PYTHON][ML] 
Add multiple columns support to PySpark QuantileDiscretizer
URL: https://github.com/apache/spark/pull/25812#discussion_r325393980
 
 

 ##########
 File path: python/pyspark/ml/feature.py
 ##########
 @@ -1959,13 +1959,18 @@ def getDegree(self):
 
 
 @inherit_doc
-class QuantileDiscretizer(JavaEstimator, HasInputCol, HasOutputCol, 
HasHandleInvalid,
-                          JavaMLReadable, JavaMLWritable):
+class QuantileDiscretizer(JavaEstimator, HasInputCol, HasOutputCol, 
HasInputCols, HasOutputCols,
+                          HasHandleInvalid, JavaMLReadable, JavaMLWritable):
     """
     `QuantileDiscretizer` takes a column with continuous features and outputs 
a column with binned
     categorical features. The number of bins can be set using the 
:py:attr:`numBuckets` parameter.
     It is possible that the number of buckets used will be less than this 
value, for example, if
     there are too few distinct values of the input to create enough distinct 
quantiles.
+    Since 3.0.0, :py:class:`QuantileDiscretizer` can map multiple columns at 
once by setting the
 
 Review comment:
   above there is a ``` `QuantileDiscretizer` ```, add :py:class: to make it 
consistent?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to