Hi all,

In https://github.com/apache/spark/pull/47233, we are looking to add a
Spark DataFrame API for functional equivalence to Spark SQL's UPDATE
statement.

There are open discussions on the PR about location/format of the API, and
we wanted to ask on devlist to get more opinions.

One consideration, is that Update SQL is an isolated, terminal operation
only on DSV2 tables that cannot be chained to other operations.

I made a quick write up about the background and discussed options in
https://docs.google.com/document/d/1AjkxOU06pFEzFmSbepfxdHoUGtvNAk6X1WY3zHGTW_o/edit.
It is my first one, so please let me know if I missed something.

Look forward to hearing from more Spark devs on thoughts, either in the PR,
document, or reply to this email.

Thank you,
Szehon

Reply via email to