Hi, I'm new to the community, but kept an eye on this project for a while now. Just a quick comment on insertion/deletion. Do we expect multiple modification of a single row (with primary key)? For example, row with pk(x) is inserted (1st), deleted and insert (2nd time) in a single snapshot. How would such case recorded in a snapshot?
Best! Jaguar Xiong 马进 <majin1...@163.com> 于2020年4月6日周一 下午9:08写道: > hey guys, > > I wrote two proposals about upsertion/deletion for tables with primary key: > > > 1. dealing mutable data with primary key in iceberg > > <https://docs.google.com/document/d/1FlTTJTiCMKMCapJBE7iOaEm13QlkSQdSfIGSfm34tXA/edit?usp=sharing> > 2. Write conficts in upserting/deleteing situation of iceberg > > <https://docs.google.com/document/d/17wO3b0WEW1oC2EIFnIAfTidMYsAwC7IxHf9SQKI3dO0/edit?usp=sharing> > > the first one focused on upsertion/deletion designing for tables with pk, > which covers most of production scenario, and of course aiming for good > reading and writing performance, the second discussed write conflicts > situations and solutions for upsertion/deletion. > > I raised this two proposals to solve our production demands, I have talked > with Opennix and made a rough plan for this, and I anticipate there would > be discussions about goals, roadmap of iceberg and relations with hudi, > delta etc. II would be grateful for making these things clear and giving > your valuable options > > > thanks. > > > > > -- jaguar·run for ever