[GitHub] [hudi] bhasudha commented on a diff in pull request #9372: [DOCS]Update Concurrency page

via GitHub Mon, 28 Aug 2023 03:59:11 -0700


bhasudha commented on code in PR #9372:
URL: https://github.com/apache/hudi/pull/9372#discussion_r1307277531



##########
website/docs/concurrency_control.md:
##########
@@ -2,105 +2,126 @@
 title: "Concurrency Control"
 summary: In this page, we will discuss how to perform concurrent writes to 
Hudi Tables.
 toc: true
+toc_min_heading_level: 2
+toc_max_heading_level: 4
 last_modified_at: 2021-03-19T15:59:57-04:00
 ---
+Concurrency control defines how different writers/readers coordinate access to 
the table. Hudi ensures atomic writes, by way of publishing commits atomically 
to the timeline, stamped with an instant time that denotes the time at which 
the action is deemed to have occurred. Unlike general purpose file version 
control, Hudi draws clear distinction between writer processes (that issue 
user’s upserts/deletes), table services (that write data/metadata to 
optimize/perform bookkeeping) and readers (that execute queries and read data). 
Hudi provides snapshot isolation between all three types of processes, meaning 
they all operate on a consistent snapshot of the table. Hudi provides 
optimistic concurrency control (OCC) between writers, while providing 
lock-free, non-blocking MVCC based concurrency control between writers and 
table-services and between different table services.
 
-In this section, we will cover Hudi's concurrency model and describe ways to 
ingest data into a Hudi Table from multiple writers; using the [Hudi 
Streamer](#hudi-streamer) tool as well as 
-using the [Hudi datasource](#datasource-writer).
+In this section, we will discuss the different concurrency controls supported 
by Hudi and how they are leveraged to provide flexible deployment models; we 
will cover multi-writing, a  popular deployment model; finally, we’ll describe 
ways to ingest data into a Hudi Table from multiple writers using different 
writers, like  DeltaStreamer, Hudi datasource, Spark Structured Streaming and 
Spark SQL.
 
-## Supported Concurrency Controls
 
-- **MVCC** : Hudi table services such as compaction, cleaning, clustering 
leverage Multi Version Concurrency Control to provide snapshot isolation
-between multiple table service writers and readers. Additionally, using MVCC, 
Hudi provides snapshot isolation between an ingestion writer and multiple 
concurrent readers. 
-  With this model, Hudi supports running any number of table service jobs 
concurrently, without any concurrency conflict. 
-  This is made possible by ensuring that scheduling plans of such table 
services always happens in a single writer mode to ensure no conflict and 
avoids race conditions.
+## Deployment models with supported concurrency controls
 
-- **[NEW] OPTIMISTIC CONCURRENCY** : Write operations such as the ones 
described above (UPSERT, INSERT) etc, leverage optimistic concurrency control 
to enable multiple ingestion writers to
-the same Hudi Table. Hudi supports `file level OCC`, i.e., for any 2 commits 
(or writers) happening to the same table, if they do not have writes to 
overlapping files being changed, both writers are allowed to succeed. 
-  This feature is currently *experimental* and requires either Zookeeper or 
HiveMetastore to acquire locks.
+### Model A: Single writer with inline table services
 
-It may be helpful to understand the different guarantees provided by [write 
operations](/docs/write_operations/) via Hudi datasource or the Hudi Streamer.
+This is the simplest form of concurrency, meaning there is no concurrency at 
all in the write processes. In this model, Hudi eliminates the need for 
concurrency control and maximizes throughput by supporting these table services 
out-of-box and running inline after every write to the table. Execution plans 
are idempotent, persisted to the timeline and auto-recover from failures. For 
most simple use-cases, this means just writing is sufficient to get a 
well-managed table that needs no concurrency control.
 
-## Single Writer Guarantees
+Although there is no actual concurrent writing in this model, there is a need 
to provide snapshot isolation between readers and writers. **MVCC** is 
leveraged to provide such isolation between ingestion writer and multiple 
readers and also between multiple table service writers and readers. Writes to 
the table either from ingestion or from table services produce versioned data 
that are available to readers only after the writes are committed. Until then, 
readers can access only the previous version of the data.
 
- - *UPSERT Guarantee*: The target table will NEVER show duplicates.
- - *INSERT Guarantee*: The target table wilL NEVER have duplicates if 
[dedup](/docs/configurations#hoodiedatasourcewriteinsertdropduplicates) is 
enabled.
- - *BULK_INSERT Guarantee*: The target table will NEVER have duplicates if 
[dedup](/docs/configurations#hoodiedatasourcewriteinsertdropduplicates) is 
enabled.
- - *INCREMENTAL PULL Guarantee*: Data consumption and checkpoints are NEVER 
out of order.
+A single writer with all table services such as cleaning, clustering, 
compaction, etc can be configured to be inline (such as DeltaStreamer sync-once 
mode and Spark Datasource with default configs) without any additional configs.
 
-## Multi Writer Guarantees
+#### Single Writer Guarantees
 
-With multiple writers using OCC, some of the above guarantees change as follows
+In this model, the following are the guarantees on [write 
operations](https://hudi.apache.org/docs/write_operations/) to expect:
 
 - *UPSERT Guarantee*: The target table will NEVER show duplicates.
-- *INSERT Guarantee*: The target table MIGHT have duplicates even if 
[dedup](/docs/configurations#hoodiedatasourcewriteinsertdropduplicates) is 
enabled.
-- *BULK_INSERT Guarantee*: The target table MIGHT have duplicates even if 
[dedup](/docs/configurations#hoodiedatasourcewriteinsertdropduplicates) is 
enabled.
+- *INSERT Guarantee*: The target table wilL NEVER have duplicates if 
[dedup](https://hudi.apache.org/docs/configurations#hoodiedatasourcewriteinsertdropduplicates)
 is enabled.
+- *BULK_INSERT Guarantee*: The target table will NEVER have duplicates if 
[dedup](https://hudi.apache.org/docs/configurations#hoodiedatasourcewriteinsertdropduplicates)
 is enabled.
+- *INCREMENTAL PULL Guarantee*: Data consumption and checkpoints are NEVER out 
of order.
+
+
+### Model B: Single writer with async table services
+
+Hudi provides the option of running the table services in an async fashion, 
where most of the heavy lifting (e.g actually rewriting the columnar data by 
compaction service) is done asynchronously. In this model, the async deployment 
eliminates any repeated wasteful retries and optimizes the table using 
clustering techniques while a single writer consumes the writes to the table 
without having to be blocked by such table services. This model avoids the need 
for taking an [external lock](#external-locking-and-lock-providers) to control 
concurrency and avoids the need to separately orchestrate and monitor offline 
table services jobs..
+
+A single writer along with async table services runs in the same process. For 
example, you can have a  DeltaStreamer in continuous mode write to a MOR table 
using async compaction; you can use Spark Streaming (where 
[compaction](https://hudi.apache.org/docs/compaction) is async by default), and 
you can use Flink streaming or your own job setup and enable async table 
services inside the same writer.
+
+Hudi leverages **MVCC** in this model to support running any number of table 
service jobs concurrently, without any concurrency conflict.  This is made 
possible by ensuring Hudi 's ingestion writer and async table services 
coordinate among themselves to ensure no conflicts and avoid race conditions. 
The same single write guarantees described in Model A above can be achieved in 
this model as well.
+
+### Model C: Multi-writer
+
+It is not always possible to serialize all write operations to a table (such 
as UPSERT, INSERT or DELETE) into the same write process and therefore, 
multi-writing capability may be required. In multi-writing, disparate 
distributed processes run in parallel or overlapping time windows to write to 
the same table. In such cases, an external locking mechanism becomes necessary 
to coordinate concurrent accesses. Here are few different scenarios that would 
all fall under multi-writing:
+
+- Multiple ingestion writers to the same table:For instance, two Spark 
Datasource writers working on different sets of partitions form a source kafka 
topic.
+- Multiple ingestion writers to the same table, including one writer with 
async table services: For example, a DeltaStreamer with async compaction for 
regular ingestion & a Spark Datasource writer for backfilling.
+- A single ingestion writer and a separate compaction (HoodieCompactor) or 
clustering (HoodieClusteringJob) job apart from the ingestion writer: This is 
considered as multi-writing as they are not running in the same process.
+
+Hudi's concurrency model intelligently differentiates actual writing to the 
table from table services that manage or optimize the table. Hudi offers 
similar **optimistic concurrency control across multiple writers**, but **table 
services can still execute completely lock-free and async** as long as they run 
in the same process as one of the writers.
+For multi-writing, Hudi leverages file level optimistic concurrency 
control(OCC). For example, when two writers write to non overlapping files, 
both writes are allowed to succeed. However, when the writes from different 
writers overlap (touch the same set of files), only one of them will succeed. 
Please note that this feature is currently experimental and requires external 
lock providers to acquire locks briefly at critical sections during the write. 
More on lock providers below.
+
+#### Multi Writer Guarantees
+
+With multiple writers using OCC, these are the write guarantees to expect:
+
+- *UPSERT Guarantee*: The target table will NEVER show duplicates.
+- *INSERT Guarantee*: The target table MIGHT have duplicates even if dedup is 
enabled.
+- *BULK_INSERT Guarantee*: The target table MIGHT have duplicates even if 
dedup is enabled.
 - *INCREMENTAL PULL Guarantee*: Data consumption and checkpoints MIGHT be out 
of order due to multiple writer jobs finishing at different times.
 
+
 ## Enabling Multi Writing
 
-The following properties are needed to be set properly to turn on optimistic 
concurrency control.
+The following properties are needed to be set appropriately to turn on 
optimistic concurrency control to achieve multi writing.
 
 ```
 hoodie.write.concurrency.mode=optimistic_concurrency_control
-hoodie.cleaner.policy.failed.writes=LAZY
 hoodie.write.lock.provider=<lock-provider-classname>
 ```
 
-There are 4 different lock providers that require different configurations to 
be set.
+| Config Name| Default| Description                                            
                                                                                
                                                                                
                                                                                
                                                                                
                                                                                
                                                                                
                                                        |
+| 
---------------------------------------------------------------------------------
 | ------------------------ 
|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| hoodie.write.concurrency.mode | SINGLE_WRITER (Optional) | <u>[Concurrency 
modes](https://github.com/apache/hudi/blob/c387f2a6dd3dc9db2cd22ec550a289d3a122e487/hudi-common/src/main/java/org/apache/hudi/common/model/WriteConcurrencyMode.java)</u>
 for write operations.<br />Possible values:<br /><ul><li>`SINGLE_WRITER`: Only 
one active writer to the table. Maximizes 
throughput.</li><li>`OPTIMISTIC_CONCURRENCY_CONTROL`: Multiple writers can 
operate on the table with lazy conflict resolution using locks. This means that 
only one writer succeeds if multiple writers write to the same file 
group.</li></ul><br />`Config Param: WRITE_CONCURRENCY_MODE` |
+| hoodie.write.lock.provider    | 
org.apache.hudi.client.transaction.lock.ZookeeperBasedLockProvider (Optional)   
                   | Lock provider class name, user can provide their own 
implementation of LockProvider which should be subclass of 
org.apache.hudi.common.lock.LockProvider<br /><br />`Config Param: 
LOCK_PROVIDER_CLASS_NAME`<br />`Since Version: 0.8.0`                           
                                                                                
                                                                                
                                                                                
                                                                                
            |
 
-**`FileSystem`** based lock provider
+### External Locking and lock providers
 
-FileSystem based lock provider supports multiple writers cross different 
jobs/applications based on atomic create/delete operations of the underlying 
filesystem.
-
-:::note
-FileSystem based lock provider is not supported with cloud storage like S3 or 
GCS.
-:::
-
-```
-hoodie.write.lock.provider=org.apache.hudi.client.transaction.lock.FileSystemBasedLockProvider
-hoodie.write.lock.filesystem.path (optional)
-hoodie.write.lock.filesystem.expire (optional)
-```
-
-When using the FileSystem based lock provider, by default, the lock file will 
store into `hoodie.base.path`+`/.hoodie/lock`. You may use a custom folder to 
store the lock file by specifying `hoodie.write.lock.filesystem.path`.
+As can be seen above, a lock provider needs to be configured in muti-writing 
scenarios. External locking is typically used in conjunction with optimistic 
concurrency control because it provides a way to prevent conflicts that might 
occur when two or more transactions (commits in our case) attempt to modify the 
same resource concurrently. When a transaction attempts to modify a resource 
that is currently locked by another transaction, it must wait until the lock is 
released before proceeding.
 
-In case the lock cannot release during job crash, you can set 
`hoodie.write.lock.filesystem.expire` (lock will never expire by default). You 
may also delete lock file manually in such situation.
+In case of multi-writing in Hudi, the locks are acquired on the Hudi table for 
a very short duration during specific phases (such as just before committing 
the writes or before scheduling table services) instead of locking for the 
entire span of time. This approach allows multiple writers to work on the same 
table simultaneously, increasing concurrency and avoids conflicts.
 
-**`Zookeeper`** based lock provider
+There are 4 different lock providers that require different configurations to 
be set. Please refer to comprehensive locking configs 
[here](https://hudi.apache.org/docs/next/configurations#LOCK).
 
+#### Zookeeper based lock provider
 ```
 
hoodie.write.lock.provider=org.apache.hudi.client.transaction.lock.ZookeeperBasedLockProvider
-hoodie.write.lock.zookeeper.url
-hoodie.write.lock.zookeeper.port
-hoodie.write.lock.zookeeper.lock_key
-hoodie.write.lock.zookeeper.base_path
 ```
+Following are the basic configs required to setup this lock provider:
 
-**`HiveMetastore`** based lock provider
+| Config Name| Default| Description                                            
                                                                                
                                                                    |
+| ---------------------------------------------------------------------------- 
| ------------------------ 
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| hoodie.write.lock.zookeeper.base_path   | N/A **(Required)**   | The base 
path on Zookeeper under which to create lock related ZNodes. This should be 
same for all concurrent writers to the same table<br /><br />`Config Param: 
ZK_BASE_PATH`<br />`Since Version: 0.8.0` |
+| hoodie.write.lock.zookeeper.port        | N/A **(Required)**   | Zookeeper 
port to connect to.<br /><br />`Config Param: ZK_PORT`<br />`Since Version: 
0.8.0`                                                                          
                                     |
+| hoodie.write.lock.zookeeper.url         | N/A **(Required)**   | Zookeeper 
URL to connect to.<br /><br />`Config Param: ZK_CONNECT_URL`<br />`Since 
Version: 0.8.0`                                                                 
                                        |
+
+#### HiveMetastore based lock provider
 
 ```
 
hoodie.write.lock.provider=org.apache.hudi.hive.transaction.lock.HiveMetastoreBasedLockProvider
-hoodie.write.lock.hivemetastore.database
-hoodie.write.lock.hivemetastore.table
 ```
+Following are the basic configs required to setup this lock provider:
 
-`The HiveMetastore URI's are picked up from the hadoop configuration file 
loaded during runtime.`
-
-**`Amazon DynamoDB`** based lock provider
+| Config Name| Default| Description                                            
                                                                                
                                                                    |
+| ----------------------------------------------------------------------- | 
------------------------ 
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+| hoodie.write.lock.hivemetastore.database                     | N/A 
**(Required)**                                                                  
               | For Hive based lock provider, the Hive database to acquire 
lock against<br /><br />`Config Param: HIVE_DATABASE_NAME`<br />`Since Version: 
0.8.0`                                                                          
                                                          |
+| hoodie.write.lock.hivemetastore.table                         | N/A 
**(Required)**                                                                  
               | For Hive based lock provider, the Hive table to acquire lock 
against<br /><br />`Config Param: HIVE_TABLE_NAME`<br />`Since Version: 0.8.0`  
                                                                                
                                                        |
 
-Amazon DynamoDB based lock provides a simple way to support multi writing 
across different clusters.  You can refer to the
-[DynamoDB based Locks 
Configurations](https://hudi.apache.org/docs/configurations#DynamoDB-based-Locks-Configurations)
-section for the details of each related configuration knob.
+`The HiveMetastore URI's are picked up from the hadoop configuration file 
loaded during runtime.`
 
+#### Amazon DynamoDB based lock provider
 ```
 
hoodie.write.lock.provider=org.apache.hudi.aws.transaction.lock.DynamoDBBasedLockProvider
-hoodie.write.lock.dynamodb.table (required)
-hoodie.write.lock.dynamodb.partition_key (optional)
-hoodie.write.lock.dynamodb.region (optional)
-hoodie.write.lock.dynamodb.endpoint_url (optional)
-hoodie.write.lock.dynamodb.billing_mode (optional)
 ```
+Amazon DynamoDB based lock provides a simple way to support multi writing 
across different clusters.  You can refer to the

Review Comment:
   Yes. I wanted to avoid all configs and point to advanced configs section. 
And keep this page to only essential configs. 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [hudi] bhasudha commented on a diff in pull request #9372: [DOCS]Update Concurrency page

Reply via email to