[GitHub] [spark] xinrong-databricks opened a new pull request #32596: [SPARK-35338][PYTHON] Separate arithmetic operations into data type based structures

GitBox Wed, 19 May 2021 17:07:34 -0700


xinrong-databricks opened a new pull request #32596:
URL: https://github.com/apache/spark/pull/32596

### What changes were proposed in this pull request?

The PR is proposed for **pandas APIs on Spark**, in order to separate
arithmetic operations shown as below into data-type-based structures.
`__add__, __sub__, __mul__, __truediv__, __floordiv__, __pow__, __mod__,
__radd__, __rsub__, __rmul__, __rtruediv__, __rfloordiv__, __rpow__,__rmod__`

DataTypeOps and subclasses are introduced.

The existing behaviors of each arithmetic operation should be preserved.

### Why are the changes needed?

Currently, the same arithmetic operation of all data types is defined in one
function, so it’s difficult to extend the behavior change based on the data
types.

Introducing DataTypeOps would be the foundation for [pandas APIs on Spark:
Separate basic operations into data type based
structures.](https://docs.google.com/document/d/12MS6xK0hETYmrcl5b9pX5lgV4FmGVfpmcSKq--_oQlc/edit?usp=sharing).

### Does this PR introduce _any_ user-facing change?

No.

### How was this patch tested?

Tests are introduced under pyspark.pandas.tests.data_type_ops. One test file
per DataTypeOps class.

--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] xinrong-databricks opened a new pull request #32596: [SPARK-35338][PYTHON] Separate arithmetic operations into data type based structures

Reply via email to