Nick Hryhoriev created SPARK-42034:
--------------------------------------
Summary: QueryExecutionListener and Observation API, df.observe do
not work with `foreach` action.
Key: SPARK-42034
URL: https://issues.apache.org/jira/browse/SPARK-42034
Project: Spark
Issue Type: Bug
Components: SQL
Affects Versions: 3.3.1, 3.2.2, 3.1.3
Environment: I test it locally and on YARN in cluster mode.
Spark 3.3.1 and 3.2.2 and 3.1.1.
Yarn 2.9.2 and 3.2.1.
Reporter: Nick Hryhoriev
Observation API, {{observe}} dataframe transformation, and custom
QueryExecutionListener.
Do not work with {{foreach}} or {{foreachPartition actions.}}
{{This is due to }}QueryExecutionListener functions do not trigger on queries
whose action is {{foreach}} or {{{}foreachPartition{}}}.
But the Spark GUI SQL tab sees this query as SQL query and shows its query
plans and etc.
here is the code to reproduce it:
https://gist.github.com/GrigorievNick/e7cf9ec5584b417d9719e2812722e6d3
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]