Jean-Daniel Cryans created KUDU-2250:
----------------------------------------
Summary: Document odd interaction between upserts and Spark
Datasets
Key: KUDU-2250
URL: https://issues.apache.org/jira/browse/KUDU-2250
Project: Kudu
Issue Type: Task
Components: spark
Affects Versions: 1.6.0
Reporter: Jean-Daniel Cryans
We need to document a specific behavior of Spark Datasets that runs contrary to
how Kudu works.
Say you have 3 columns "k, x, y" where k is the primary key.
You run a first insert on a row "k=1, x=2, y=3".
Now you upsert "k=1, y=4".
Using any Kudu API, the full row would now be "k=1, x=2, y=4" but with Datasets
you have "k=1, x=*NULL*, y=4". This means that Datasets put a null value when
some columns aren't specified.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)