jagadesh-kiran commented on a change in pull request #23946:
[SPARK-26860][PySpark] Fix for RangeBetween docs appear to be wrong
URL: https://github.com/apache/spark/pull/23946#discussion_r261948172
##########
File path: python/pyspark/sql/window.py
##########
@@ -127,6 +156,37 @@ def rangeBetween(start, end):
and ``Window.currentRow`` to specify special boundary values, rather
than using integral
values directly.
+ A range-based boundary is based on the actual value of the ORDER BY
+ expression(s). An offset is used to alter the value of the ORDER BY
expression, for
+ instance if the current order by expression has a value of 10 and the
lower bound offset
+ is -3, the resulting lower bound for the current row will be 10 - 3 =
7. This however puts a
+ number of constraints on the ORDER BY expressions: there can be only
one expression and this
+ expression must have a numerical data type. An exception can be made
when the offset is
+ unbounded, because no value modification is needed, in this case
multiple and non-numeric
+ ORDER BY expression are allowed.
+
+ {{{
Review comment:
Thanks @HyukjinKwon will update the same
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
With regards,
Apache Git Services
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]