[
https://issues.apache.org/jira/browse/SPARK-27494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shixiong Zhu updated SPARK-27494:
---------------------------------
Description:
Right now Kafka source v2 doesn't support null keys or values.
* When processing a null key, all of the following keys in the same partition
will be null. This is a correctness bug.
* When processing a null value, it will throw NPE.
The workaround is setting sql conf
"spark.sql.streaming.disabledV2MicroBatchReaders" to
"org.apache.spark.sql.kafka010.KafkaSourceProvider" to use the v1 source.
was:Right now Kafka source v2 doesn't support null values. The issue is in
org.apache.spark.sql.kafka010.KafkaRecordToUnsafeRowConverter.toUnsafeRow which
doesn't handle null values.
> Null keys/values don't work in Kafka source v2
> ----------------------------------------------
>
> Key: SPARK-27494
> URL: https://issues.apache.org/jira/browse/SPARK-27494
> Project: Spark
> Issue Type: Bug
> Components: Structured Streaming
> Affects Versions: 2.4.1
> Reporter: Shixiong Zhu
> Assignee: Genmao Yu
> Priority: Major
> Labels: correctness
> Fix For: 3.0.0, 2.4.3
>
>
> Right now Kafka source v2 doesn't support null keys or values.
> * When processing a null key, all of the following keys in the same
> partition will be null. This is a correctness bug.
> * When processing a null value, it will throw NPE.
> The workaround is setting sql conf
> "spark.sql.streaming.disabledV2MicroBatchReaders" to
> "org.apache.spark.sql.kafka010.KafkaSourceProvider" to use the v1 source.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]