[ 
https://issues.apache.org/jira/browse/SPARK-54264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siying Dong updated SPARK-54264:
--------------------------------
    Description: Dedup operation now uses RocksDB's Get() only to check if 
there is result. Rather than doing that, it can use keyExists(), which serves 
the exact purpose but is faster.

> DeDup Operator can use RocksDB's keyExists()
> --------------------------------------------
>
>                 Key: SPARK-54264
>                 URL: https://issues.apache.org/jira/browse/SPARK-54264
>             Project: Spark
>          Issue Type: Task
>          Components: Structured Streaming
>    Affects Versions: 4.1.0
>            Reporter: Siying Dong
>            Priority: Major
>
> Dedup operation now uses RocksDB's Get() only to check if there is result. 
> Rather than doing that, it can use keyExists(), which serves the exact 
> purpose but is faster.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to