[GitHub] [spark] zhengruifeng opened a new pull request, #41180: [SPARK-43502][PYTHON][CONNECT] `DataFrame.drop` should support empty column

via GitHub Mon, 15 May 2023 20:19:11 -0700


zhengruifeng opened a new pull request, #41180:
URL: https://github.com/apache/spark/pull/41180


   ### What changes were proposed in this pull request?
   Make `DataFrame.drop` support empty column
   
   
   ### Why are the changes needed?
   to be consistent with vanilla PySpark
   
   
   ### Does this PR introduce _any_ user-facing change?
   yes
   
   ```
   In [1]: df = spark.createDataFrame([(1, 21), (2, 30)], ("id", "age"))
   
   In [2]: df.drop()
   ```
   
   before:
   ```
   In [2]: df.drop()
   ---------------------------------------------------------------------------
   PySparkValueError                         Traceback (most recent call last)
   Cell In[2], line 1
   ----> 1 df.drop()
   
   File ~/Dev/spark/python/pyspark/sql/connect/dataframe.py:449, in 
DataFrame.drop(self, *cols)
       444     raise PySparkTypeError(
       445         error_class="NOT_COLUMN_OR_STR",
       446         message_parameters={"arg_name": "cols", "arg_type": 
type(cols).__name__},
       447     )
       448 if len(_cols) == 0:
   --> 449     raise PySparkValueError(
       450         error_class="CANNOT_BE_EMPTY",
       451         message_parameters={"item": "cols"},
       452     )
       454 return DataFrame.withPlan(
       455     plan.Drop(
       456         child=self._plan,
      (...)
       459     session=self._session,
       460 )
   
   PySparkValueError: [CANNOT_BE_EMPTY] At least one cols must be specified.
   ```
   
   after
   ```
   In [2]: df.drop()
   Out[2]: DataFrame[id: bigint, age: bigint]
   ```
   
   
   ### How was this patch tested?
   enabled UT


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] zhengruifeng opened a new pull request, #41180: [SPARK-43502][PYTHON][CONNECT] `DataFrame.drop` should support empty column

Reply via email to