[
https://issues.apache.org/jira/browse/HUDI-2208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17395119#comment-17395119
]
ASF GitHub Bot commented on HUDI-2208:
--------------------------------------
pengzhiwei2018 edited a comment on pull request #3328:
URL: https://github.com/apache/hudi/pull/3328#issuecomment-894613109
Hi @nsivabalan , The PR has updated with the follow chanes:
add "upsert" mode for insert.mode. Currently we have 3 insert mode:
- upsert
In upsert mode for insert into, duplicate record on primary key will be
updated.This is the default insert mode for pk-table.
- strict
In strict mode for insert into, we do the pk uniqueness guarantee for
COW pk-table. For MOR pk-table, it has the same behavior with "upsert" mode.
- non-strict
In non-strict mode for insert into, we use insert operation to write data
which allow writing the duplicate record.
The default insert mode is `upsert`.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
> [SQL] Support Bulk Insert For Spark Sql
> ---------------------------------------
>
> Key: HUDI-2208
> URL: https://issues.apache.org/jira/browse/HUDI-2208
> Project: Apache Hudi
> Issue Type: Sub-task
> Reporter: pengzhiwei
> Assignee: pengzhiwei
> Priority: Blocker
> Labels: pull-request-available, release-blocker
> Fix For: 0.9.0
>
>
> Support the bulk insert for spark sql
--
This message was sent by Atlassian Jira
(v8.3.4#803005)