danny0405 commented on code in PR #10087: URL: https://github.com/apache/hudi/pull/10087#discussion_r1393618310
########## website/docs/concurrency_control.md: ########## @@ -61,6 +68,18 @@ With multiple writers using OCC, these are the write guarantees to expect: - *INCREMENTAL PULL Guarantee*: Data consumption and checkpoints are NEVER out of order. If there are inflight commits (due to multi-writing), incremental queries will not expose the completed commits following the inflight commits. +#### Multi Writer Guarantees with Non-Blocking Concurrency Control Mode (Experimental) + +`NON_BLOCKING_CONCURRENCY_CONTROL`, offers the same set of guarantees as mentioned in the case of OCC but without +explicit locks for serializing the writes. Lock is only needed for writing the commit metadata to the Hudi timeline. The +completion time for the commits reflects the serialization order and file slicing is done based on completion time. +Multiple writers can operate on the table with non-blocking conflict resolution. The writers can write into the same +file group with the conflicts resolved automatically by the query reader and the compactor. The new concurrency mode is +currently available for preview in version 1.0.0-beta only with following caveats: + +- It is only supported for Flink writers currently. Review Comment: Spark writers also support it right? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
