[
https://issues.apache.org/jira/browse/HUDI-5729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Danny Chen closed HUDI-5729.
----------------------------
Fix Version/s: 0.13.1
0.14.0
Resolution: Fixed
Fixed via master branch: 688d947e44b4894b951162a3226bbc51f1fb7b4f
> BulkInsert recordKey contains timestamp field is fixed as a time string
> ------------------------------------------------------------------------
>
> Key: HUDI-5729
> URL: https://issues.apache.org/jira/browse/HUDI-5729
> Project: Apache Hudi
> Issue Type: Bug
> Components: flink
> Reporter: sandy du
> Assignee: sandy du
> Priority: Major
> Labels: pull-request-available
> Fix For: 0.13.1, 0.14.0
>
>
> When recordKey contains timestamp field , bulkInsert mode generated
> recordKey like “id:11,ts:2022-10-12T18:30:03”.
> But upsert mode according to config
> {color:#4c9aff}“hoodie.datasource.write.keygenerator.consistent.logical.timestamp.enabled”
> {color}{color:#172b4d}generated recordKey is “id:11,ts:2022-10-12T18:30:03”
> or “id:11,ts:167462460000”.{color}
> {color:#172b4d}In this case upsert aflter bulkInsert ,recordKey is different
> can cause duplicate data。{color}
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)