GitHub user tdas opened a pull request:
https://github.com/apache/spark/pull/18024
[SPARK-20792][SS] Support same timeout operations in mapGroupsWithState
function in batch queries as in streaming queries
## What changes were proposed in this pull request?
Currently, in the batch queries, timeout is disabled (i.e.
GroupStateTimeout.NoTimeout) which means any GroupState.setTimeout*** operation
would throw UnsupportedOperationException. This makes it weird when converting
a streaming query into a batch query by changing the input DF from streaming to
a batch DF. If the timeout was enabled and used, then the batch query will
start throwing UnsupportedOperationException.
This creates the dummy state in batch queries with the provided timeoutConf
so that it behaves in the same way.
## How was this patch tested?
Additional tests
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/tdas/spark SPARK-20792
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/18024.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #18024
commit eef789fe1fd04a98b4d82da6864ca4f4b23c2bfb
Author: Tathagata Das
Date: 2017-05-18T05:31:44Z
Fixed bug
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org