Jungtaek Lim created SPARK-45178:
------------------------------------
Summary: Fallback to use single batch executor for
Trigger.AvailableNow with unsupported sources rather than using wrapper
Key: SPARK-45178
URL: https://issues.apache.org/jira/browse/SPARK-45178
Project: Spark
Issue Type: Bug
Components: Structured Streaming
Affects Versions: 4.0.0
Reporter: Jungtaek Lim
We have observed the case where wrapper implementation of Trigger.AvailableNow (
AvailableNowDataStreamWrapper and subclasses) is not fully compatible with 3rd
party data source and brought up correctness issue.
While we could persuade 3rd party data source to support Trigger.AvailableNow,
pursuing all 3rd parties to do this is too aggressive and challenging goal we
never be able to make. Also, it may not be also possible to come up with the
wrapper implementation which would have zero issue with any arbitrary source.
As a mitigation, we want to make a slight behavioral change for such case,
falling back to single batch execution (a.k.a. Trigger.Once) rather than using
wrapper implementation. The exact behavior between Trigger.AvailableNow and
Trigger.Once are different so it's technically behavioral change, but it's
probably lot less surprised than failing the query.
For extreme case where users are confident that there will be no issue at all
on using wrapper, we will come up with a flag to provide the previous behavior.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]