HyukjinKwon commented on a change in pull request #29743:
URL: https://github.com/apache/spark/pull/29743#discussion_r487597235
##########
File path:
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/CountMinSketchAgg.scala
##########
@@ -37,15 +37,22 @@ import org.apache.spark.util.sketch.CountMinSketch
* @param confidenceExpression confidence, must be positive and less than 1.0
* @param seedExpression random seed
*/
+// scalastyle:off nonascii
@ExpressionDescription(
usage = """
_FUNC_(col, eps, confidence, seed) - Returns a count-min sketch of a
column with the given esp,
confidence and seed. The result is an array of bytes, which can be
deserialized to a
`CountMinSketch` before usage. Count-min sketch is a probabilistic data
structure used for
cardinality estimation using sub-linear space.
""",
+ examples = """
+ Examples:
+ > SELECT _FUNC_(col, 0.1d, 0.9d, 0) FROM VALUES (1), (2), (1) AS
tab(col);
+ ]�Z0jl����M��_
+ """,
Review comment:
cc @viirya FYI
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]