[ https://issues.apache.org/jira/browse/HUDI-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
pengzhiwei reassigned HUDI-2282: -------------------------------- Assignee: pengzhiwei > Insert for an already existing record throws DuplicateKeyException with > primary keyed spark sql table > ----------------------------------------------------------------------------------------------------- > > Key: HUDI-2282 > URL: https://issues.apache.org/jira/browse/HUDI-2282 > Project: Apache Hudi > Issue Type: Sub-task > Reporter: sivabalan narayanan > Assignee: pengzhiwei > Priority: Major > > [https://gist.github.com/nsivabalan/9837a90b1481c479a9c600bf16bafa57] > > {code:java} > create table hudi_test4 (id int, name string, price double, ts long) using > hudi options(primaryKey = 'id', precombineField = 'ts') partitioned by (name) > location 'file:///tmp/hudi_testing/hudi_test4'; > insert into hudi_test4 values(1, 100.0, 100000010, "abc");insert into > hudi_test4 values(2, 200.0, 200000010, "abc");insert into hudi_test4 > values(3, 300.0, 300000010, "abc"); > // this throws DuplicateKeyException ? Even though operation w/ hudi is > "upsert". > insert into hudi_test4 values(1, 600.0, 600000010, "abc"); > {code} > > > -- This message was sent by Atlassian Jira (v8.3.4#803005)