[ 
https://issues.apache.org/jira/browse/SPARK-38809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jungtaek Lim updated SPARK-38809:
---------------------------------
        Parent: SPARK-38684
    Issue Type: Sub-task  (was: Bug)

> Implement option to skip null values in symmetric hash impl of stream-stream 
> joins
> ----------------------------------------------------------------------------------
>
>                 Key: SPARK-38809
>                 URL: https://issues.apache.org/jira/browse/SPARK-38809
>             Project: Spark
>          Issue Type: Sub-task
>          Components: Structured Streaming
>    Affects Versions: 3.2.1
>            Reporter: Anish Shrigondekar
>            Assignee: Anish Shrigondekar
>            Priority: Major
>             Fix For: 3.3.0, 3.2.2
>
>
> Implement option to skip null values in symmetric hash impl of stream-stream 
> joins
>  * In the symmetric has join state manager, we can receive entries with null 
> values for a key and that caused the `removeByValue` and get iterators to 
> fail and run into the NullPointerException.
>  * This is possible if the state recovered is written from a old spark 
> version or its corrupted on disk. Since we don't have a utility to query this 
> state, we would like to provide a conf option to skip nulls for the symmetric 
> hash impl in stream stream joins.
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to